yuzhe gu's picture

9 13 3

yuzhe gu

vanilla1116

·

https://guyuzhe.site/

Liqu1d-G

AI & ML interests

LLM; Hallucination; Self-Improvement

Recent Activity

upvoted a paper 3 days ago

MIG: Automatic Data Selection for Instruction Tuning by Maximizing Information Gain in Semantic Space

upvoted a paper 9 days ago

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

commented on a paper 23 days ago

RIG: Synergizing Reasoning and Imagination in End-to-End Generalist Policy

View all activity

Organizations

vanilla1116's activity

upvoted a paper 3 days ago

MIG: Automatic Data Selection for Instruction Tuning by Maximizing Information Gain in Semantic Space

Paper • 2504.13835 • Published 6 days ago • 35

upvoted a paper 9 days ago

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published 10 days ago • 239

commented a paper 23 days ago

RIG: Synergizing Reasoning and Imagination in End-to-End Generalist Policy

Paper • 2503.24388 • Published 24 days ago • 30 •

upvoted a paper 23 days ago

RIG: Synergizing Reasoning and Imagination in End-to-End Generalist Policy

Paper • 2503.24388 • Published 24 days ago • 30

upvoted a paper about 1 month ago

Creation-MMBench: Assessing Context-Aware Creative Intelligence in MLLM

Paper • 2503.14478 • Published Mar 18 • 47

updated a dataset about 1 month ago

opencompass/anah

Viewer • Updated Mar 13 • 783 • 132 • 3

New activity in opencompass/anah about 1 month ago

Update dataset card, link to paper, add category

#2 opened about 2 months ago by

New activity in opencompass/anah-7b about 2 months ago

Add missing metadata and clarify license

#1 opened about 2 months ago by

New activity in opencompass/anah-20b about 2 months ago

Add missing metadata: `pipeline_tag`, `library_name`, and `license`

#1 opened about 2 months ago by

New activity in opencompass/anah-v2 about 2 months ago

Improve model card with library_name and pipeline_tag

#1 opened about 2 months ago by

authored a paper about 2 months ago

Mask-DPO: Generalizable Fine-grained Factuality Alignment of LLMs

Paper • 2503.02846 • Published Mar 4 • 19

upvoted a paper about 2 months ago

Mask-DPO: Generalizable Fine-grained Factuality Alignment of LLMs

Paper • 2503.02846 • Published Mar 4 • 19

commented a paper about 2 months ago

Mask-DPO: Generalizable Fine-grained Factuality Alignment of LLMs

Paper • 2503.02846 • Published Mar 4 • 19 •

upvoted a paper about 2 months ago

Visual-RFT: Visual Reinforcement Fine-Tuning

Paper • 2503.01785 • Published Mar 3 • 78

authored a paper 2 months ago

Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning

Paper • 2502.06781 • Published Feb 10 • 61

upvoted a paper 2 months ago

Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning

Paper • 2502.06781 • Published Feb 10 • 61

commented a paper 2 months ago

Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning

Paper • 2502.06781 • Published Feb 10 • 61 •

upvoted a paper 3 months ago

Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training

Paper • 2501.11425 • Published Jan 20 • 105

updated a model 4 months ago

opencompass/anah-v2

Text Classification • Updated Mar 8 • 28 • 3

liked a Space 6 months ago

Open VLM Video Leaderboard

VLMEvalKit Eval Results in video understanding benchmark