MTEB: Massive Text Embedding Benchmark
Indeed massive work of comparison of 33 models on 56 datasets and 112 languages💪
Now, if you are interested in some task, you can go to this leaderbord and orient to the best models for this task in specific language. Or, if you have new model, you can perform more clear and fair comparison.
Paper: https://arxiv.org/abs/2210.07316(useful to read more details about the tasks, abbreviations, details of the datasets and the models)
Github: https://github.com/embeddings-benchmark/mteb
Leaderboard at 🤗: https://huggingface.co/spaces/mteb/leaderboard