Thanks to visit codestin.com
Credit goes to github.com

Skip to content
@embeddings-benchmark

Massive Text Embedding Benchmark

MTEB is a Python framework for evaluating embeddings and retrieval systems for both text and image. MTEB covers more than 1000 languages and diverse tasks, from classics like classification and clustering to use-case specialized tasks such as legal, code, or healthcare retrieval.

You can get started using mteb, check out our documentation.

Overview
📈 Leaderboard The interactive leaderboard of the benchmark
Get Started.
🏃 Get Started Overview of how to use mteb
🤖 Defining Models How to use existing model and define custom ones
📋 Selecting tasks How to select tasks, benchmarks, splits etc.
🏭 Running Evaluation How to run the evaluations, including cache management, speeding up evaluations etc.
📊 Loading Results How to load and work with existing model results
Overview.
📋 Tasks Overview of available tasks
📐 Benchmarks Overview of available benchmarks
🤖 Models Overview of available Models
Contributing
🤖 Adding a model How to submit a model to MTEB and to the leaderboard
👩‍💻 Adding a dataset How to add a new task/dataset to MTEB
👩‍💻 Adding a benchmark How to add a new benchmark to MTEB and to the leaderboard
🤝 Contributing How to contribute to MTEB and set it up for development

Popular repositories Loading

  1. mteb mteb Public

    MTEB: Massive Text Embedding Benchmark

    Python 2.9k 483

  2. results results Public

    Data for the MTEB leaderboard

    Python 33 91

  3. leaderboard leaderboard Public archive

    Code for the MTEB leaderboard

    Python 30 15

  4. arena arena Public

    Code for the MTEB Arena

    Python 23 9

  5. mtebpaper mtebpaper Public

    Resources & scripts for the paper "MTEB: Massive Text Embedding Benchmark"

    Python 18 4

  6. miebpaper miebpaper Public

    Jupyter Notebook 2

Repositories

Showing 7 of 7 repositories
  • mteb Public

    MTEB: Massive Text Embedding Benchmark

    embeddings-benchmark/mteb’s past year of commit activity
    Python 2,919 Apache-2.0 483 247 (8 issues need help) 6 Updated Oct 23, 2025
  • results Public

    Data for the MTEB leaderboard

    embeddings-benchmark/results’s past year of commit activity
    Python 33 91 0 1 Updated Oct 21, 2025
  • .github Public
    embeddings-benchmark/.github’s past year of commit activity
    0 0 0 0 Updated Oct 16, 2025
  • arena Public

    Code for the MTEB Arena

    embeddings-benchmark/arena’s past year of commit activity
    Python 23 9 25 5 Updated Jul 2, 2025
  • miebpaper Public
    embeddings-benchmark/miebpaper’s past year of commit activity
    Jupyter Notebook 2 0 0 0 Updated Feb 28, 2025
  • leaderboard Public archive

    Code for the MTEB leaderboard

    embeddings-benchmark/leaderboard’s past year of commit activity
    Python 30 15 15 2 Updated Feb 4, 2025
  • mtebpaper Public

    Resources & scripts for the paper "MTEB: Massive Text Embedding Benchmark"

    embeddings-benchmark/mtebpaper’s past year of commit activity
    Python 18 4 2 0 Updated Sep 22, 2024

Top languages

Loading…

Most used topics

Loading…