Small Changes
Browse files- src/about.py +1 -1
src/about.py
CHANGED
@@ -91,7 +91,7 @@ TITLE = """<h1 align="center" id="space-title">🚀 EVALITA-LLM Leaderboard 🚀
|
|
91 |
|
92 |
# What does your leaderboard evaluate?
|
93 |
INTRODUCTION_TEXT = """
|
94 |
-
Evalita-LLM
|
95 |
|
96 |
**Multiple Choice**: 📊TE (Textual Entailment), 😃SA (Sentiment Analysis), ⚠️HS (Hate Speech Detection), 🏥AT (Admission Test), 🔤WIC (Word in Context), ❓FAQ (Frequently Asked Questions)
|
97 |
**Generative**: 🔄LS (Lexical Substitution), 📝SU (Summarization), 🏷️NER (Named Entity Recognition), 🔗REL (Relation Extraction)
|
|
|
91 |
|
92 |
# What does your leaderboard evaluate?
|
93 |
INTRODUCTION_TEXT = """
|
94 |
+
Evalita-LLM is a benchmark designed to evaluate Large Language Models (LLMs) on Italian tasks. The distinguishing features of Evalita-LLM are the following: (i) **all tasks are native Italian**, avoiding translation issues and potential cultural biases; (ii) the benchmark includes **generative** tasks, enabling more natural interaction with LLMs; (iii) **all tasks are evaluated against multiple prompts**, this way mitigating the model sensitivity to specific prompts and allowing a fairer evaluation.
|
95 |
|
96 |
**Multiple Choice**: 📊TE (Textual Entailment), 😃SA (Sentiment Analysis), ⚠️HS (Hate Speech Detection), 🏥AT (Admission Test), 🔤WIC (Word in Context), ❓FAQ (Frequently Asked Questions)
|
97 |
**Generative**: 🔄LS (Lexical Substitution), 📝SU (Summarization), 🏷️NER (Named Entity Recognition), 🔗REL (Relation Extraction)
|