Text embeddings are commonly evaluated on a small set of datasets from a single task not covering their possible applications to other tasks. It is unclear whether state-of-the-art embeddings on semantic textual similarity (STS) can be equally well applied to other tasks like clustering or reranking. This makes progress in the field difficult to track, as various models are constantly being proposed without proper evaluation. To solve this problem, we introduce the Massive Text Embedding Benchmark (MTEB). MTEB spans 8 embedding tasks covering a total of 58 datasets and 112 languages. Through the benchmarking of 33 models on MTEB, we establish the most comprehensive benchmark of text embeddings to date. We find that no particular text embedding method dominates across all tasks. This suggests that the field has yet to converge on a universal text embedding method and scale it up sufficiently to provide state-of-the-art results on all embedding tasks.

Features

  • Dataset selection
  • Datasets can be selected by providing the list of datasets
  • You can also specify which languages to load for multilingual/crosslingual tasks
  • You can evaluate only on test splits of all tasks
  • Use a custom model
  • Evaluate on a custom task

Project Samples

Project Activity

See All Activity >

Categories

Neural Search

License

Apache License V2.0

Follow MTEB

MTEB Web Site

You Might Also Like
Component Content Management System for Software Documentation Icon
Component Content Management System for Software Documentation

Great tool for serious technical writers

Paligo is an end-to-end Component Content Management System (CCMS) solution for technical documentation, policies and procedures, knowledge management, and more.
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of MTEB!

Additional Project Details

Programming Language

Python

Related Categories

Python Neural Search Software

Registered

2023-08-21