Towards NLP(@towards_nlp). MTEB: Massive Text Embedding Benchmark Indeed massive work of comparison of 33 models on 56 dataset

MTEB: Massive Text Embedding Benchmark Indeed massive work of comparison of 33 models on 56 datasets and 112 languages💪 Now, if you are interested in some task, you can go to this leaderbord and orient to the best models for this task in specific language. Or, if you have new model, you can perform more clear and fair comparison. Paper: https://arxiv.org/abs/2210.07316 (useful to read more details about the tasks, abbreviations, details of the datasets and the models) Github: https://github.com/embeddings-benchmark/mteb Leaderboard at 🤗: https://huggingface.co/spaces/mteb/leaderboard