6 19 23

Mingzhe Du PRO

Elfsong

https://mingzhe.space

Elfsong

AI & ML interests

Code Generation / Preference Alignment / Bias Mitigation

Recent Activity

upvoted a paper 21 days ago

BENCHAGENTS: Automated Benchmark Creation with Agent Interaction

liked a dataset 26 days ago

Elfsong/Mercury_Multilingual

upvoted a paper about 1 month ago

Charting and Navigating Hugging Face's Model Atlas

View all activity

Organizations

upvoted a paper 21 days ago

BENCHAGENTS: Automated Benchmark Creation with Agent Interaction

Paper • 2410.22584 • Published Oct 29, 2024 • 1

liked a dataset 26 days ago

Elfsong/Mercury_Multilingual

Viewer • Updated Jun 24, 2024 • 13.8M • 4.06k • 1

upvoted a paper about 1 month ago

Charting and Navigating Hugging Face's Model Atlas

Paper • 2503.10633 • Published Mar 13 • 86

authored a paper about 1 month ago

Afterburner: Reinforcement Learning Facilitates Self-Improving Code Efficiency Optimization

Paper • 2505.23387 • Published May 29 • 7

upvoted a paper about 1 month ago

Qwen3 Embedding: Advancing Text Embedding and Reranking Through Foundation Models

Paper • 2506.05176 • Published Jun 5 • 62

updated a Space about 1 month ago

Monolith

🚀

Sandbox for Code Generation

upvoted 3 papers about 1 month ago

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published Jun 2 • 166

VF-Eval: Evaluating Multimodal LLMs for Generating Feedback on AIGC Videos

Paper • 2505.23693 • Published May 29 • 56

Afterburner: Reinforcement Learning Facilitates Self-Improving Code Efficiency Optimization

Paper • 2505.23387 • Published May 29 • 7

commented a paper about 1 month ago

Afterburner: Reinforcement Learning Facilitates Self-Improving Code Efficiency Optimization

Paper • 2505.23387 • Published May 29 • 7 •

upvoted a paper about 1 month ago

Effi-Code: Unleashing Code Efficiency in Language Models

Paper • 2410.10209 • Published Oct 14, 2024 • 2

updated a dataset about 1 month ago

Elfsong/Venus

Viewer • Updated May 30 • 9.3k • 169 • 5

upvoted a paper about 1 month ago

ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows

Paper • 2505.19897 • Published May 26 • 102

liked a dataset about 1 month ago

EffiBench/effibench-x

Viewer • Updated May 16 • 623 • 43 • 1

authored a paper about 2 months ago

GuardReasoner-VL: Safeguarding VLMs via Reinforced Reasoning

Paper • 2505.11049 • Published May 16 • 59

upvoted a paper about 2 months ago

GuardReasoner-VL: Safeguarding VLMs via Reinforced Reasoning

Paper • 2505.11049 • Published May 16 • 59

updated a dataset about 2 months ago

Elfsong/Venus_Anotation

Viewer • Updated May 18 • 9 • 18

liked a dataset about 2 months ago

acl-anonymous/CrowdEval

Viewer • Updated Feb 15 • 388k • 854 • 1

upvoted a paper about 2 months ago

WaterDrum: Watermarking for Data-centric Unlearning Metric

Paper • 2505.05064 • Published May 8 • 8

Mingzhe Du PRO

AI & ML interests

Recent Activity

Organizations

Elfsong's activity

Monolith