Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
cot_encyclopedia_human_eval
Activity Feed
Follow
9
AI & ML interests
None defined yet.
Recent Activity
seungone
authored
a paper
15 days ago
M-Prometheus: A Suite of Open Multilingual LLM Judges
seungone
authored
a paper
15 days ago
Scaling Evaluation-time Compute with Reasoning Models as Process Evaluators
seungone
authored
a paper
4 months ago
LLM-as-an-Interviewer: Beyond Static Testing Through Dynamic LLM Evaluation
View all activity
Team members
7
models
None public yet
datasets
None public yet