Training data
#2
by
kardosdrur
- opened
Hi
@Gameselo
I'm Márton, maintainer of MTEB. I'm writing to you as we have been collecting metadata on models to provide our users a realistic estimate of how much models' scores on MTEB can be considered to be indicative of their generalized performance (if models train on MTEB, they obviously perform better).
We are still lacking annotations on your model, and I failed to find information about what your model has been trained on.
Can you please tell us, which datasets in MTEB, in particular in the multilingual benchmark were or were not used to train this model?
Thanks in advance, Márton