Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
agcfg
non-profit
Activity Feed
Follow
2
AI & ML interests
None defined yet.
Recent Activity
yilunzhao
authored
a paper
about 11 hours ago
Can LLMs Generate High-Quality Test Cases for Algorithm Problems? TestCase-Eval: A Systematic Evaluation of Fault Coverage and Exposure
yilunzhao
authored
a paper
about 11 hours ago
MultiFinBen: A Multilingual, Multimodal, and Difficulty-Aware Benchmark for Financial LLM Evaluation
yilunzhao
authored
a paper
17 days ago
VF-Eval: Evaluating Multimodal LLMs for Generating Feedback on AIGC Videos
View all activity
Team members
3
models
0
None public yet
datasets
0
None public yet