Mteb

Latest version: v1.36.22

Safety actively analyzes 714815 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 53 of 82

1.11.17

Fix

* fix: Convert BigPatent to Fast (813)

* Convert BigPatentClustering to fast, fill metadata

* Fix num samples ([`9b85380`](https://github.com/embeddings-benchmark/mteb/commit/9b85380beaf0395363b4c67c6b056f067da11024))

* fix: Fixes MultilabelClassification eval_split (812)

* fixes MultilabelClassification
* added MalteseNewsClassification to test_mteb
* points added ([`9b602ad`](https://github.com/embeddings-benchmark/mteb/commit/9b602ade56d6be9018aac74a5addc8c3fd18c43a))

* fix: Convert Biorxiv and Medrxiv clustering to fast (788)

* Rework biorxiv and medrxiv data processing scripts
- AbsTaskClusteringFast format for biorxiv
- AbsTaskClusteringFast format for medrxiv
- deduplication
* Add AbsTaskClusteringFast versions of the Biorxiv and Medrxiv tasks
- fast bio p2p
- fast bio s2s
- fast med p2p
- fast med s2s
- linting
- fix metadata
Bio/medrxiv: add metadata to old tasks, naming scheme for fast tasks, revision hashes
* Add results for biorxiv and medrxiv tasks
rerun tasks
* Add points
update points
---------
Co-authored-by: supplyandcommand <42962106+supplyandcommandusers.noreply.github.com>
Co-authored-by: Isaac Chung <chungisaac1217gmail.com> ([`13147aa`](https://github.com/embeddings-benchmark/mteb/commit/13147aa1ebf196007d7a6bf6c3f3d384fa55b8a2))

Unknown

* Update tasks table ([`8b88a25`](https://github.com/embeddings-benchmark/mteb/commit/8b88a25e33d68ca5f357c8ec405d2cfe3852e578))

* Update points table ([`de25749`](https://github.com/embeddings-benchmark/mteb/commit/de257498203a2a65eb8512053f1e969c1adf4eb8))

* Update points table ([`07dce3b`](https://github.com/embeddings-benchmark/mteb/commit/07dce3b78c7d6424961e7f92fe4b4844823e0a92))

* Update tasks table ([`d508c82`](https://github.com/embeddings-benchmark/mteb/commit/d508c82682991245d58e74c9acf00c06388727f2))

* update reranking Fr tasks (811)

* update tasks

* fix missing import

* fix dates

* apply linter

* add call to dataset_transform

* remove stratified subsampling

* apply lint

* update results

* add multilingual-e5-small results

* update mteb version

---------

Co-authored-by: Imene Kerboua <imenelydia.krgmail.com> ([`7980167`](https://github.com/embeddings-benchmark/mteb/commit/798016747f6d5d19e65fae6a5d51b1a3ffd30fdf))

* Update tasks table ([`1e69ed2`](https://github.com/embeddings-benchmark/mteb/commit/1e69ed287422a02f8bdd8b7569babb6489b04ca0))

* Update points table ([`264a4c8`](https://github.com/embeddings-benchmark/mteb/commit/264a4c8d3754b96734f95a9e4957c1df885f1931))

* Multilabel Brazilian Toxic Tweets Classification (773)

* BrazilianToxic Tweets multilabel classification

* minor maltese news clf fixes

* BrazilianToxicTweetsClassification improvements

* BrazilianToxicTweetsClassification cleanup

* Update mteb/tasks/MultiLabelClassification/por/BrazilianToxicTweetsClassification.py

Co-authored-by: Kenneth Enevoldsen <kennethcenevoldsengmail.com>

* points added

---------

Co-authored-by: Kenneth Enevoldsen <kennethcenevoldsengmail.com> ([`5f0cd32`](https://github.com/embeddings-benchmark/mteb/commit/5f0cd32f922e092bfe5f06842b1af0bf6fbaaa1e))

1.11.16

Fix

* fix: Speed up Reranking tasks (793)

* Speed up MindSmallReranking

* Fix typo

* Update docs/mmteb/points/792.jsonl

Co-authored-by: Kenneth Enevoldsen <kennethcenevoldsengmail.com>

* Fix points files

---------

Co-authored-by: Kenneth Enevoldsen <kennethcenevoldsengmail.com> ([`9707621`](https://github.com/embeddings-benchmark/mteb/commit/970762142550d8ba4b58de506b90dc45043935ea))

Unknown

* Update points table ([`4fd87f7`](https://github.com/embeddings-benchmark/mteb/commit/4fd87f779b8515be41a5c7ab682b56acbd658167))

1.11.15

Fix

* fix: Converted VG to hierarchical (694)

* Added hierarchical VG clustering tasks

* Added startified subsampling for multilabel tasks to AbsTask

* Added stratified subsampling to VG clustering

* Fixed stratified subsampling for multilabel tasks

* fix: Converted VG to AbsTaskClusteringFast

* Added results for paraphrase model

* Removed debugging print statements

* Added &39;not specified&39; license to VGHierarchical

* Added proper license from Norsk Aviskorpus

* Ran linting

* Replaced stratification with just regular subsampling

* fix: fixed subsampling

* Added results for VG

* Added points

* fix: Fixed JSON in 694.jsonl

---------

Co-authored-by: Kenneth Enevoldsen <kennethcenevoldsengmail.com> ([`ece878e`](https://github.com/embeddings-benchmark/mteb/commit/ece878eb274c4d72084d4488367e944c7db99fe1))

Unknown

* Update tasks table ([`d813be0`](https://github.com/embeddings-benchmark/mteb/commit/d813be0c0a9e5bf2c60c5540ac2421005171dcf0))

* Update points table ([`418a2b1`](https://github.com/embeddings-benchmark/mteb/commit/418a2b1df1885496b2cdd4a66f037c784b9e5911))

* Update points table ([`9a9f2e6`](https://github.com/embeddings-benchmark/mteb/commit/9a9f2e6f1f173e1ca66701d6b0f7840e8c25b768))

* Fix French retieval metrics (805)

* fix French retieval metrics

* chore: add points ([`e82ab71`](https://github.com/embeddings-benchmark/mteb/commit/e82ab710af174d4bc705f68320282fa54a3fc819))

1.11.14

Documentation

* docs: fixed points ([`774ca70`](https://github.com/embeddings-benchmark/mteb/commit/774ca70f5f51eae7b467f990bf4bc49dc771e21e))

Fix

* fix: broken dataset references (803)

* format

* updated datasets paths ([`9bbd2dd`](https://github.com/embeddings-benchmark/mteb/commit/9bbd2dd6779f36e2c75a5ae20af1f10ce613f934))

* fix: Added ArXiv Hierarchical clustering (S2S and P2P) (699)

* Added ArXiv Hierarchical clustering (S2S and P2P)

* Use dummy subsampling in ArXivHierarchical

* fix: convert iterables to list

* Added results for ArXivHierarchical

* Added points ([`396eefa`](https://github.com/embeddings-benchmark/mteb/commit/396eefa32e99c4aff372543ae2aa88be32a34381))

Unknown

* Update points table ([`266dfce`](https://github.com/embeddings-benchmark/mteb/commit/266dfce170b94d73a6e1c99c3f02c52f2d3c93cc))

* Merge branch &39;main&39; of https://github.com/embeddings-benchmark/mteb ([`6afad5b`](https://github.com/embeddings-benchmark/mteb/commit/6afad5b24f58994a6db1d812bf9414ea5cfa9af2))

* Update tasks table ([`9b79eb2`](https://github.com/embeddings-benchmark/mteb/commit/9b79eb2d61fcf669a610527f868b1696d310be48))

1.11.13

Fix

* fix: GPT4-o generated queries for 14 languages (718)

* first proper upload of wikipedia-retrieval dataset

* update license and README of dataset

* fix test split and add WikipediaRetrievalDE task

* add WikipediaRerankingDE task

* add Bengali tasks

* multilingual reranking dataset

* add Multilingual Reranking

* add Retrieval tasks

* update metadata for Reranking task

* run make lint

* fix metadata validation errors

* delete German and Bengali Reranking tasks

* fix more task metadata, tests passing now

* add retrieval results

* WIP: reranking with multilingual dataset

* undo changes to run script

* update points and contributor info

* subcall MultilingualTask for reranking task and add reranking results

* WIP: make retrieval a multilingual dataset, too

* WIP: first run of WikipediaRetrievalMultilinugal

* add WikipediaRetrievalMultilinugal task and results

* delete language specific retrieval tasks and results

* update points and add Openreview IDs

* make lint

* remove debugging print statement ([`411e232`](https://github.com/embeddings-benchmark/mteb/commit/411e232e3f696d0c3cb2921cb3e51b53d52d331c))

* fix: add Xstance and ensure valid dataset paths (795)

* fix: Added new dataset XStance pair classification (737)

* Added xstance dataset

* Adding points, fixed hard-coded data loading

* fix: ensure dataset paths a valid

---------

Co-authored-by: malteos <giti.mieo.de> ([`47eb54c`](https://github.com/embeddings-benchmark/mteb/commit/47eb54c205cea7b7fe059769347a8d8126ac32e3))

Unknown

* Update tasks table ([`2b105d7`](https://github.com/embeddings-benchmark/mteb/commit/2b105d7afd7efee3ff31abc8665c108a9d722ead))

1.11.12

Fix

* fix: update CO2 tracker attribute (791)

* fix: update CO2 tracker attribute

* chore: add points ([`bef9508`](https://github.com/embeddings-benchmark/mteb/commit/bef9508d60cb23d955b113413b167e5db9ee8e4b))

Unknown

* Update points table ([`c39cf08`](https://github.com/embeddings-benchmark/mteb/commit/c39cf08acf6adb73b86e2c6fff2026f47cd2b8b5))

Page 53 of 82

Links

Releases

Has known vulnerabilities

© 2025 Safety CLI Cybersecurity Inc. All Rights Reserved.