Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
GEM benchmark
https://gem-benchmark.com
Activity Feed
Request to join this org
Follow
108
AI & ML interests
We develop infrastructure for the evaluation of generated text.
Recent Activity
gentaiscool
authored
a paper
1 day ago
Language Surgery in Multilingual Large Language Models
1024m
authored
a paper
26 days ago
Uncovering Cultural Representation Disparities in Vision-Language Models
Krystalan
authored
a paper
about 1 month ago
ExTrans: Multilingual Deep Reasoning Translation via Exemplar-Enhanced Reinforcement Learning
View all activity
Team members
94
+60
+47
+26
+16
GEM
's Spaces
3
Sort: Recently updated
Runtime error
9
DatasetCardForm
👁
Runtime error
3
Gem Submissions
💎
Running
3
Gem Results
📊