23 47 93

Li Dong

unilm

AI & ML interests

Language Model Pre-Training

Recent Activity

liked a model about 7 hours ago

Motif-Technologies/Motif-2.6B

liked a model about 7 hours ago

microsoft/VibeVoice-1.5B

liked a model about 7 hours ago

microsoft/VibeVoice-Tokenizer

View all activity

Organizations

upvoted a collection about 7 hours ago

VibeVoice

Collection

Frontier Text-to-Speech Models https://microsoft.github.io/VibeVoice/ • 3 items • Updated about 5 hours ago • 12

upvoted a paper 4 days ago

NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model

Paper • 2508.14444 • Published 6 days ago • 29

upvoted a paper 6 days ago

DINOv3

Paper • 2508.10104 • Published 12 days ago • 194

upvoted a paper 14 days ago

Agent Lightning: Train ANY AI Agents with Reinforcement Learning

Paper • 2508.03680 • Published 20 days ago • 61

upvoted 2 papers about 1 month ago

Step-Audio 2 Technical Report

Paper • 2507.16632 • Published Jul 22 • 61

Scaling Laws for Optimal Data Mixtures

Paper • 2507.09404 • Published Jul 12 • 34

upvoted 3 papers about 2 months ago

Differential Mamba

Paper • 2507.06204 • Published Jul 8 • 19

GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Paper • 2507.01006 • Published Jul 1 • 232

MoCa: Modality-aware Continual Pre-training Makes Better Bidirectional Multimodal Embeddings

Paper • 2506.23115 • Published Jun 29 • 37

upvoted 2 papers 2 months ago

Reinforcement Learning with Verifiable Rewards Implicitly Incentivizes Correct Reasoning in Base LLMs

Paper • 2506.14245 • Published Jun 17 • 40

SeerAttention-R: Sparse Attention Adaptation for Long Reasoning

Paper • 2506.08889 • Published Jun 10 • 24

upvoted 6 papers 3 months ago

upvoted a paper 7 months ago

Chain-of-Retrieval Augmented Generation

Paper • 2501.14342 • Published Jan 24 • 59

upvoted 2 papers 8 months ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8 • 283

Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Though

Paper • 2501.04682 • Published Jan 8 • 99

Li Dong

AI & ML interests

Recent Activity

Organizations

unilm's activity