Mteb

Latest version: v1.20.0

Safety actively analyzes 682487 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 21 of 58

1.12.42

Fix

* fix: Backward compatibility fixes for clustering (954)

* Added max_document_to_embed to all existing clustering tasks

* format ([`623d833`](https://github.com/embeddings-benchmark/mteb/commit/623d83300157921fe71bc78aa6700c85a5f45486))

1.12.41

Fix

* fix: Add MINERS Bitext retrieval benchmark (951)

* add new task
* add miners bitext mining benchmark
* Update TaskMetadata.py
* Add NollySenti
* rename metadata
* Update mteb/benchmarks.py
Co-authored-by: Kenneth Enevoldsen <kennethcenevoldsengmail.com>
* Update benchmarks.py
* Update benchmarks.py
---------
Co-authored-by: Kenneth Enevoldsen <kennethcenevoldsengmail.com> ([`f95b9e0`](https://github.com/embeddings-benchmark/mteb/commit/f95b9e0e17ec36272e249fbb754b7f7020727303))

Unknown

* Update points table ([`efbce71`](https://github.com/embeddings-benchmark/mteb/commit/efbce71e314fb5d97c4d08f7b546b16c7c9a2790))

* Update points table ([`5f39d55`](https://github.com/embeddings-benchmark/mteb/commit/5f39d55c1b7d92428c39e089e7f081049a1c08b5))

1.12.40

Documentation

* docs: Add point for PR 948 (950)

* add point

* add point ([`34286f2`](https://github.com/embeddings-benchmark/mteb/commit/34286f2a36d8bf11c0bab1160d38c5cae3b95461))

Fix

* fix: Compare Cluster and ClusterFast scores and speedup (892)

* first go at getting spearman corr for e5-base
* add back large
* small and large results
* v3 means downsampling by stratified subsampling + bootstrap to k=max_documents_per_cluster
* v3-1 means swapping values of max_documents_per_cluster and max_documents_to_embed
* v3-2 means increasing max_documents_per_cluster to 65536
* task-wise comparison
* use recommended syntax
* add back no-op changes
* add back no-op changes
* option c is now v2; remove all v3 variants; add back level 0 in results; add test significance&39;
* paraphrase-multilingual-MiniLM-L12-v2 results
* lint script
* cluster without fast should not have levels
* spearman on significant rank
* add more small model results
* 2x max_documents_to_embed to 4096
* max_documents_to_embed=8192
* t
* Added plots
* format
* use 32k samples for bigger cluster datasets
* use 4% n_samples and update task metadata
* make lint
* tests passing
* make lint
* add paraphrase-multilingual-mpnet-base-v2 and e5-large-v2 results
* add e5_eng_base_v2,labse,mxbai_embed_large_v1,bge_base_en_v1.5
* move plot scripts to mmteb srcipts repo
* replace use_dataset_as_is wtih max_document_to_embed and add description in docstrings
* lint
---------
Co-authored-by: Kenneth Enevoldsen <kennethcenevoldsengmail.com> ([`2bb7623`](https://github.com/embeddings-benchmark/mteb/commit/2bb76239368c497efb92d5ae09a914eedd44a66d))

Unknown

* Update tasks table ([`54c5745`](https://github.com/embeddings-benchmark/mteb/commit/54c5745c3b2eb285a4cd1a10e06a516040eec6f1))

* Update points table ([`6aeaff4`](https://github.com/embeddings-benchmark/mteb/commit/6aeaff45a0d657f708677752cae0b96f0fb875a6))

1.12.39

Documentation

* docs: Added a script to extract the bibtex citations and generate the consolidated bib file (904)

* Added IndicNLP News Classificaiton

* Added IndicNLP News Classificaiton

* Added results

* Updated dataset version

* Small fixes

* Small fix

* Small fix

* Updated results

* Fix linting issues

* Added points

* Resolve conflict

* Update 610.jsonl

* Backfilled missing bibtex citations.

* Backfilled missing bibtex citations.

* Remove non-present files

* Wrote a script to scrape through the bibtex citations and create the corresponding bib file

* Added missed dependency

* Added functionality to create latex table

* Fixed linting issue

---------

Co-authored-by: Imene Kerboua <33312980+imenelydiakerusers.noreply.github.com>
Co-authored-by: Kenneth Enevoldsen <kennethcenevoldsengmail.com> ([`ab23552`](https://github.com/embeddings-benchmark/mteb/commit/ab235525d39402d9e1460eb7662d96928e055927))

Fix

* fix: Add LinceMT Bitext Mining (MINERS) (948)

* add new task

* add metadata

* update init

* Add results

* Update TaskMetadata.py ([`6bd165e`](https://github.com/embeddings-benchmark/mteb/commit/6bd165e95d1fe2cc623ffa7faa79aae8201d5dc8))

* fix: Add Phinc Bitext Mining (MINERS) (947)

* add phinc

* update metadata

* add files

* add points ([`0e71110`](https://github.com/embeddings-benchmark/mteb/commit/0e711102dbe2bdef4063099ba83af99fef16839c))

* fix: pair classification inconsistency (945)

* first go at fixing PairClassification

* added transformation for consistency

* added IndicXnliPairClassification

* added IndicXnliPairClassification 2

* lint fixes

* points added ([`6660f43`](https://github.com/embeddings-benchmark/mteb/commit/6660f432bd501eb2bbdd131fe61d697ef547e755))

Unknown

* Update tasks table ([`3ca687f`](https://github.com/embeddings-benchmark/mteb/commit/3ca687ffe7be3ae08970f163174e3053379f72d2))

* Update tasks table ([`4e3d868`](https://github.com/embeddings-benchmark/mteb/commit/4e3d868e501c5ed2e95e8f9dbd5331c1389f07e4))

* Update points table ([`cbe8458`](https://github.com/embeddings-benchmark/mteb/commit/cbe8458e8b3328c4e83b2e517404fab625f34824))

* Update points table ([`bc9c094`](https://github.com/embeddings-benchmark/mteb/commit/bc9c094dc445120eb0686e4f4f50e47054cc55d7))

1.12.38

Fix

* fix: Merge miracl evaluator (906)

* start merge

* removing redundancy

* removing MIRACLevaluator

* add linting

* clean up method names

* correct arg bug.

* remove type annotation

* combine unique texts

* improve readability

* merge main

* add back main changes

* adjust for task_name changes

* add back

* lint

---------

Co-authored-by: Jordan Example <jordan.clive19gmail.com> ([`8ab4c14`](https://github.com/embeddings-benchmark/mteb/commit/8ab4c141313d65a6f9e265a156894889e3d32565))

Unknown

* Added overview figure in SVG and PNG (939)

* Added overview figure in SVG and PNG

* Added wide version

* Added centered overview and fixed svg files

* Made orange color darker on figures ([`e53821e`](https://github.com/embeddings-benchmark/mteb/commit/e53821e598790a90a55088a8b743641497dd303b))

1.12.37

Fix

* fix: Add jmteb (938)

* fix: correct label for sib200

* fix: Add JaGovFaqs and NLPJournal datasets (808)

* Add JaGovFaqs dataset

* Add NLPJournal datasets

* Add JAQKET dataset

* Add points

* Fix metadata

* Remove title from corpus for JAQKET dataset

* Update JAQKET scores (without title)

* Exclude JAQKET dataset

* Add points for review

---------

Co-authored-by: Ashwin Mathur <97467100+awinmlusers.noreply.github.com> ([`f38c79b`](https://github.com/embeddings-benchmark/mteb/commit/f38c79b33eb3d306a557c56234b43601a4307ffc))

Unknown

* Update tasks table ([`980dfa8`](https://github.com/embeddings-benchmark/mteb/commit/980dfa874a9fcf2ee118187d48fc944a510c968a))

* Update points table ([`a104512`](https://github.com/embeddings-benchmark/mteb/commit/a10451249a2528ade57ad9610e08051be97957cc))

Page 21 of 58

Links

Releases

Has known vulnerabilities

© 2024 Safety CLI Cybersecurity Inc. All Rights Reserved.