Spaces:
Running
Running
Update src/about.py
Browse files- src/about.py +3 -3
src/about.py
CHANGED
@@ -62,9 +62,9 @@ Addressing the gaps in existing LLM evaluation frameworks, this benchmark is spe
|
|
62 |
### A Unified Framework for Persian LLM Evaluation
|
63 |
By combining these datasets, our work establishes a culturally grounded alignment evaluation framework, enabling systematic assessment across three key aspects:
|
64 |
|
65 |
-
-
|
66 |
-
-
|
67 |
-
-
|
68 |
|
69 |
|
70 |
This benchmark not only fills a critical gap in Persian LLM evaluation but also provides a standardized leaderboard to track progress in developing aligned, ethical, and culturally aware Persian language models.
|
|
|
62 |
### A Unified Framework for Persian LLM Evaluation
|
63 |
By combining these datasets, our work establishes a culturally grounded alignment evaluation framework, enabling systematic assessment across three key aspects:
|
64 |
|
65 |
+
- Safety: Avoiding harmful or toxic content.
|
66 |
+
- Fairness: Mitigating biases in model outputs.
|
67 |
+
- Social Norms: Ensuring culturally appropriate behavior.
|
68 |
|
69 |
|
70 |
This benchmark not only fills a critical gap in Persian LLM evaluation but also provides a standardized leaderboard to track progress in developing aligned, ethical, and culturally aware Persian language models.
|