Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Kelly Chiu's picture
1 4

Kelly Chiu PRO

kellycyy
shuyuej's profile picture 21world's profile picture
·

AI & ML interests

None yet

Organizations

Ai2's profile picture University of Washington's profile picture CulturalTeaming's profile picture MoralDilemmas's profile picture

Collections 1

CulturalBench
A Robust, Diverse and Challegning Benchmark for Measuring Cultural Knowledge of LLMs
  • CulturalBench: a Robust, Diverse and Challenging Benchmark on Measuring the (Lack of) Cultural Knowledge of LLMs

    Paper • 2410.02677 • Published Oct 3, 2024

Papers 1

arxiv:2410.02677

spaces 1

Running

CulturalBench

🔥

Display leaderboard for model evaluation

Oct 14, 2024

models 0

None public yet

datasets 4

kellycyy/daily_dilemmas

Viewer • Updated Oct 15, 2024 • 17.7k • 98 • 3

kellycyy/CulturalBench

Viewer • Updated Oct 14, 2024 • 6.14k • 371 • 4

kellycyy/wildentities_classify

Viewer • Updated May 29, 2024 • 8.61k • 8

kellycyy/wildchat-factual-classify

Viewer • Updated May 6, 2024 • 8.53k • 16
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs