Post
1504
@FinancialSupport
and I just released a new version of the Italian LLMs leaderboard https://huggingface.co/spaces/FinancialSupport/open_ita_llm_leaderboard
using the super useful
demo-leaderboard
template from
@clefourrier
.
We’ve evaluated over 50 models (base, merged, fine-tuned, etc.) from:
- Major companies like Meta, Mistral, Google ...
- University groups such as
sapienzanlp
or
swap-uniba
- Italian Companies like MoxoffSpA ,
FairMind
or
raicrits
- Various communities and individuals
All models were tested on #Italian benchmarks #mmlu #arc-c #hellaswag, which we contributed to the opensource lm-evaluation-harness library from
EleutherAI
.
Plus, you can now submit your model for automatic evaluation, thanks to to
seeweb
sponsored computation.
Curious about the top Italian models? Check out the leaderboard and submit your model!
https://huggingface.co/spaces/FinancialSupport/open_ita_llm_leaderboard
using the super useful
We’ve evaluated over 50 models (base, merged, fine-tuned, etc.) from:
- Major companies like Meta, Mistral, Google ...
- University groups such as


- Italian Companies like MoxoffSpA ,


- Various communities and individuals
All models were tested on #Italian benchmarks #mmlu #arc-c #hellaswag, which we contributed to the opensource lm-evaluation-harness library from

Plus, you can now submit your model for automatic evaluation, thanks to to

Curious about the top Italian models? Check out the leaderboard and submit your model!
https://huggingface.co/spaces/FinancialSupport/open_ita_llm_leaderboard