3 13 1

Dawei Li

wjldw

https://david-li0406.github.io/

AI & ML interests

LLM, NLP, Data Mining

Recent Activity

upvoted a paper about 15 hours ago

R-Zero: Self-Evolving Reasoning LLM from Zero Data

upvoted a paper about 22 hours ago

Are Today's LLMs Ready to Explain Well-Being Concepts?

upvoted a paper about 23 hours ago

On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

View all activity

Organizations

upvoted a paper about 15 hours ago

R-Zero: Self-Evolving Reasoning LLM from Zero Data

Paper • 2508.05004 • Published 2 days ago • 63

upvoted a paper about 22 hours ago

Are Today's LLMs Ready to Explain Well-Being Concepts?

Paper • 2508.03990 • Published 3 days ago • 8

upvoted 2 papers about 23 hours ago

On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

Paper • 2508.05629 • Published 1 day ago • 77

Is Chain-of-Thought Reasoning of LLMs a Mirage? A Data Distribution Lens

Paper • 2508.01191 • Published 7 days ago • 173

authored a paper 1 day ago

Is Chain-of-Thought Reasoning of LLMs a Mirage? A Data Distribution Lens

Paper • 2508.01191 • Published 7 days ago • 173

commented a paper 2 days ago

Is Chain-of-Thought Reasoning of LLMs a Mirage? A Data Distribution Lens

Paper • 2508.01191 • Published 7 days ago • 173 •

updated a model 10 days ago

wjldw/Qwen2.5-14B_gemini_sft_30000

Text Generation • 15B • Updated 10 days ago • 3

published a model 10 days ago

wjldw/Qwen2.5-14B_gemini_sft_30000

Text Generation • 15B • Updated 10 days ago • 3

updated a model 10 days ago

wjldw/Qwen2.5-14B_gpt4_sft_30000

Text Generation • 15B • Updated 10 days ago • 4

published a model 10 days ago

wjldw/Qwen2.5-14B_gpt4_sft_30000

Text Generation • 15B • Updated 10 days ago • 4

authored 7 papers 2 months ago

C3KG: A Chinese Commonsense Conversation Knowledge Graph

Paper • 2204.02549 • Published Apr 6, 2022

Balancing Speciality and Versatility: a Coarse to Fine Framework for Supervised Fine-tuning Large Language Model

Paper • 2404.10306 • Published Apr 16, 2024 • 1

Optimizing Language Model's Reasoning Abilities with Weak Supervision

Paper • 2405.04086 • Published May 7, 2024 • 2

Can LLMs Learn from Previous Mistakes? Investigating LLMs' Errors to Boost for Reasoning

Paper • 2403.20046 • Published Mar 29, 2024

Eliminating Reasoning via Inferring with Planning: A New Framework to Guide LLMs' Non-linear Thinking

Paper • 2310.12342 • Published Oct 18, 2023

SMoA: Improving Multi-agent Large Language Models with Sparse Mixture-of-Agents

Paper • 2411.03284 • Published Nov 5, 2024

The Quest for Efficient Reasoning: A Data-Centric Benchmark to CoT Distillation

Paper • 2505.18759 • Published May 24 • 12

commented a paper 2 months ago

The Quest for Efficient Reasoning: A Data-Centric Benchmark to CoT Distillation

Paper • 2505.18759 • Published May 24 • 12 •

upvoted 2 papers 2 months ago

PhyX: Does Your Model Have the "Wits" for Physical Reasoning?

Paper • 2505.15929 • Published May 21 • 49

The Quest for Efficient Reasoning: A Data-Centric Benchmark to CoT Distillation

Paper • 2505.18759 • Published May 24 • 12

Dawei Li

AI & ML interests

Recent Activity

Organizations

wjldw's activity