24 60 29

Yuansheng Ni

yuanshengni

https://yuanshengni.github.io/

AI & ML interests

NLP

Recent Activity

updated a dataset 4 days ago

MMMU/MMMU_Pro

upvoted a paper 23 days ago

Beyond Semantic Similarity: Rethinking Retrieval for Agentic Search via Direct Corpus Interaction

updated a dataset about 1 month ago

MMMU/MMMU

View all activity

Organizations

updated a dataset 4 days ago

MMMU/MMMU_Pro

Benchmark • Updated 4 days ago • 5.19k • 19.3k • 58

upvoted a paper 23 days ago

Beyond Semantic Similarity: Rethinking Retrieval for Agentic Search via Direct Corpus Interaction

Paper • 2605.05242 • Published May 3 • 120

updated a dataset about 1 month ago

MMMU/MMMU

Viewer • Updated Apr 21 • 11.6k • 90k • 327

upvoted 3 papers about 2 months ago

upvoted a paper 2 months ago

CUA-Suite: Massive Human-annotated Video Demonstrations for Computer-Use Agents

Paper • 2603.24440 • Published Mar 25 • 98

upvoted a collection 2 months ago

OpenResearcher

Collection

OpenResearcher: A Fully Open Pipeline for Long-Horizon Deep Research Trajectory Synthesis • 8 items • Updated Mar 24 • 18

upvoted a paper 2 months ago

OpenResearcher: A Fully Open Pipeline for Long-Horizon Deep Research Trajectory Synthesis

Paper • 2603.20278 • Published Mar 17 • 98

upvoted 2 papers 3 months ago

VisPhyWorld: Probing Physical Reasoning via Code-Driven Video Reconstruction

Paper • 2602.13294 • Published Feb 9 • 13

InnoEval: On Research Idea Evaluation as a Knowledge-Grounded, Multi-Perspective Reasoning Problem

Paper • 2602.14367 • Published Feb 16 • 17

upvoted a paper 4 months ago

Context Forcing: Consistent Autoregressive Video Generation with Long Context

Paper • 2602.06028 • Published Feb 5 • 36

New activity in MMMU/MMMU 5 months ago

wrong_use，need deleted

#6 opened 5 months ago by

Aros199

upvoted a paper 5 months ago

Illusions of Confidence? Diagnosing LLM Truthfulness via Neighborhood Consistency

Paper • 2601.05905 • Published Jan 9 • 21

upvoted 2 papers 6 months ago

TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models

Paper • 2512.02014 • Published Dec 1, 2025 • 78

InnoGym: Benchmarking the Innovation Potential of AI Agents

Paper • 2512.01822 • Published Dec 1, 2025 • 36

updated 3 models 7 months ago

TIGER-Lab/VisCoder2-32B

Image-Text-to-Text • 33B • Updated Nov 3, 2025 • 8 • 1

TIGER-Lab/VisCoder2-14B

Image-Text-to-Text • 15B • Updated Nov 3, 2025 • 10 • • 2

TIGER-Lab/VisCoder2-3B

Text Generation • 3B • Updated Nov 3, 2025 • 902 • • 3

updated a dataset 7 months ago

TIGER-Lab/VisPlotBench

Viewer • Updated Nov 3, 2025 • 888 • 505 • 2

Yuansheng Ni

AI & ML interests

Recent Activity

Organizations

yuanshengni's activity

wrong_use，need deleted