Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
ilsp 's Collections
Krikri 8B
ILSP Greek Evaluation Suite
Meltemi 7B

ILSP Greek Evaluation Suite

updated Jun 18

A collection of test sets for evaluating base and chat LLMs (incl. VLMs) on Greek generation and understanding capabilities

Upvote
3

  • ilsp/mmlu_greek

    Viewer • Updated May 20, 2024 • 31.7k • 1.19k • 4

  • ilsp/medical_mcqa_greek

    Viewer • Updated Sep 9, 2024 • 2.03k • 62 • 3

  • ilsp/mcqa_greek_asep

    Viewer • Updated Jun 27 • 1.2k • 38 • 3

  • ilsp/arc_greek

    Viewer • Updated Jun 7, 2024 • 7.78k • 108 • 3

  • ilsp/winogrande_greek

    Viewer • Updated Mar 7, 2024 • 41.7k • 17 • 1

  • ilsp/truthful_qa_greek

    Viewer • Updated Mar 17, 2024 • 1.63k • 97 • 2

  • ilsp/hellaswag_greek

    Viewer • Updated Apr 9, 2024 • 59.8k • 90 • 4

  • ilsp/ancient-modern_greek_translations

    Viewer • Updated Feb 27 • 100 • 89 • 2

  • ilsp/ifeval_greek

    Viewer • Updated Jun 2 • 541 • 97

  • ilsp/mt-bench-greek

    Viewer • Updated Jun 2 • 80 • 87

  • ilsp/m-ArenaHard_greek

    Viewer • Updated Jun 2 • 1k • 21 • 1

  • ilsp/mgsm_greek

    Viewer • Updated Oct 11, 2024 • 258 • 103

  • ilsp/MMLU-Pro_greek

    Viewer • Updated Aug 6, 2024 • 12k • 51 • 3

  • ilsp/vibeeval_greek

    Viewer • Updated Jan 20 • 269 • 10
Upvote
3
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs