Dawei Li's picture

3 13 1

Dawei Li

wjldw

·

https://david-li0406.github.io/

AI & ML interests

LLM, NLP, Data Mining

Recent Activity

upvoted a paper about 17 hours ago

R-Zero: Self-Evolving Reasoning LLM from Zero Data

upvoted a paper about 24 hours ago

Are Today's LLMs Ready to Explain Well-Being Concepts?

upvoted a paper 1 day ago

On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

View all activity

Organizations

authored a paper 1 day ago

Is Chain-of-Thought Reasoning of LLMs a Mirage? A Data Distribution Lens

Paper • 2508.01191 • Published 7 days ago • 173

authored 7 papers 2 months ago

C3KG: A Chinese Commonsense Conversation Knowledge Graph

Paper • 2204.02549 • Published Apr 6, 2022

Balancing Speciality and Versatility: a Coarse to Fine Framework for Supervised Fine-tuning Large Language Model

Paper • 2404.10306 • Published Apr 16, 2024 • 1

Optimizing Language Model's Reasoning Abilities with Weak Supervision

Paper • 2405.04086 • Published May 7, 2024 • 2

Can LLMs Learn from Previous Mistakes? Investigating LLMs' Errors to Boost for Reasoning

Paper • 2403.20046 • Published Mar 29, 2024

Eliminating Reasoning via Inferring with Planning: A New Framework to Guide LLMs' Non-linear Thinking

Paper • 2310.12342 • Published Oct 18, 2023

SMoA: Improving Multi-agent Large Language Models with Sparse Mixture-of-Agents

Paper • 2411.03284 • Published Nov 5, 2024

The Quest for Efficient Reasoning: A Data-Centric Benchmark to CoT Distillation

Paper • 2505.18759 • Published May 24 • 12

authored a paper 6 months ago

Preference Leakage: A Contamination Problem in LLM-as-a-judge

Paper • 2502.01534 • Published Feb 3 • 42

authored 2 papers 9 months ago

Contextualization Distillation from Large Language Model for Knowledge Graph Completion

Paper • 2402.01729 • Published Jan 28, 2024

From Generation to Judgment: Opportunities and Challenges of LLM-as-a-judge

Paper • 2411.16594 • Published Nov 25, 2024 • 42