@clefourrier on Hugging Face: "🔥 New LLM leaderboard on the hub: an Enterprise Scenarios Leaderboard! This…"

Hugging Face

Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Back to feed

clefourrier

posted an update Jan 31, 2024

Post

🔥 New LLM leaderboard on the hub: an Enterprise Scenarios Leaderboard!

This work evaluates LLMs on several real world use cases (Finance documents, Legal confidentiality, Customer support, ...), which makes it grounded, and interesting for companies! 🏢
Bonus: the test set is private, so it's hard to game 🔥
PatronusAI/enterprise_scenarios_leaderboard

Side note: I discovered through this benchmark that you could evaluate "Engagingness" of an LLM, which could also be interesting for our LLM fine-tuning community out there.

Read more about their different tasks and metrics in the intro blog: https://huggingface.co/blog/leaderboards-on-the-hub-patronus

Congrats to @sunitha98 who led the leaderboard implementation, and to @rebeccaqian and @anandnk24 , all at Patronus AI !

clem

Jan 31, 2024

•

edited Jan 31, 2024

very useful! This is the link to the leaderboard btw: https://huggingface.co/spaces/PatronusAI/enterprise_scenarios_leaderboard

clefourrier

Feb 1, 2024

Thanks a lot, edited the post :)

In this post