Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
MixEval
community
https://mixeval.github.io/
NiJinjie
Psycoy
Activity Feed
Follow
11
AI & ML interests
LLM & LMM evaluation
Recent Activity
jinjieni
authored
a paper
3 days ago
NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation
yuexiang96
authored
a paper
28 days ago
ScholarCopilot: Training Large Language Models for Academic Writing with Accurate Citations
jinjieni
authored
a paper
about 1 month ago
ResearchBench: Benchmarking LLMs in Scientific Discovery via Inspiration-Based Task Decomposition
View all activity
Team members
7
models
0
None public yet
datasets
2
Sort: Recently updated
MixEval/MixEval-X
Viewer
•
Updated
Feb 15
•
7.68k
•
136
•
10
MixEval/MixEval
Viewer
•
Updated
Sep 27, 2024
•
5k
•
173
•
22