1 16 6

Shudong Liu

Sudanl

http://sudanl.github.io

AI & ML interests

NLP, LLM

Recent Activity

upvoted a collection 23 days ago

CompassVerifier

liked a model 23 days ago

opencompass/CompassVerifier-7B

liked a model 23 days ago

opencompass/CompassVerifier-32B

View all activity

Organizations

upvoted a collection 23 days ago

CompassVerifier

Collection

CompassVerifier: A Unified and Robust Verifier for Large Language Models • 4 items • Updated about 3 hours ago • 3

liked 2 models 23 days ago

opencompass/CompassVerifier-7B

8B • Updated 26 days ago • 32 • 4

opencompass/CompassVerifier-32B

33B • Updated 26 days ago • 9 • 4

updated a dataset 25 days ago

opencompass/VerifierBench

Viewer • Updated 25 days ago • 2.82k • 118

updated a collection 26 days ago

CompassVerifier

Collection

CompassVerifier: A Unified and Robust Verifier for Large Language Models • 4 items • Updated about 3 hours ago • 3

upvoted a paper 27 days ago

Rethinking Verification for LLM Code Generation: From Generation to Testing

Paper • 2507.06920 • Published 28 days ago • 28

upvoted a paper 28 days ago

Coding Triangle: How Does Large Language Model Understand Code?

Paper • 2507.06138 • Published 29 days ago • 20

upvoted a paper about 2 months ago

Lingshu: A Generalist Foundation Model for Unified Multimodal Medical Understanding and Reasoning

Paper • 2506.07044 • Published Jun 8 • 110

authored 3 papers 2 months ago

TempoSum: Evaluating the Temporal Generalization of Abstractive Summarization

Paper • 2305.01951 • Published May 3, 2023 • 1

CultureVLM: Characterizing and Improving Cultural Understanding of Vision-Language Models for over 100 Countries

Paper • 2501.01282 • Published Jan 2

Deciphering Trajectory-Aided LLM Reasoning: An Optimization Perspective

Paper • 2505.19815 • Published May 26 • 37

upvoted 2 papers 2 months ago

Scaling Image and Video Generation via Test-Time Evolutionary Search

Paper • 2505.17618 • Published May 23 • 42

Deciphering Trajectory-Aided LLM Reasoning: An Optimization Perspective

Paper • 2505.19815 • Published May 26 • 37

New activity in nvidia/Nemotron-CrossThink 2 months ago

It seems that the train_qa subset only contains multiple-choice questions

#4 opened 2 months ago by

Sudanl

upvoted a paper 3 months ago

Learn to Reason Efficiently with Adaptive Length-based Reward Shaping

Paper • 2505.15612 • Published May 21 • 34

upvoted a paper 4 months ago

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published Apr 14 • 280

authored a paper 4 months ago

Harnessing the Reasoning Economy: A Survey of Efficient Reasoning for Large Language Models

Paper • 2503.24377 • Published Mar 31 • 18

upvoted a paper 4 months ago

Harnessing the Reasoning Economy: A Survey of Efficient Reasoning for Large Language Models

Paper • 2503.24377 • Published Mar 31 • 18

upvoted 2 papers 5 months ago

Creation-MMBench: Assessing Context-Aware Creative Intelligence in MLLM

Paper • 2503.14478 • Published Mar 18 • 49

OmniAlign-V: Towards Enhanced Alignment of MLLMs with Human Preference

Paper • 2502.18411 • Published Feb 25 • 75

Shudong Liu

AI & ML interests

Recent Activity

Organizations

Sudanl's activity

It seems that the train_qa subset only contains multiple-choice questions