Dawei Li's picture

Dawei Li

wjldw

·

https://david-li0406.github.io/

AI & ML interests

LLM, NLP, Data Mining

Organizations

Papers 15

arxiv:2601.12294

arxiv:2509.25154

arxiv:2508.19570

arxiv:2508.01191

models 18

wjldw/ToolPRM-GRPO-synthesis

4B • Updated Jan 4 • 2

wjldw/ToolPRM-GRPO-v4

4B • Updated Jan 3 • 4

wjldw/ToolPRM-Base-v4

Text Generation • 196k • Updated Jan 3 • 4

wjldw/ToolPRM-CoT-v4

Text Generation • 196k • Updated Jan 3 • 3

wjldw/ToolPRM-Base-synthesis

Text Generation • 196k • Updated Jan 3 • 3

wjldw/ToolPRM-GRPO-v3

4B • Updated Jan 1 • 3

wjldw/ToolPRM-Checklist-v3

Text Generation • 196k • Updated Jan 1 • 4

wjldw/ToolPRM-Base-v3

Text Generation • 196k • Updated Jan 1 • 3

wjldw/ToolPRM-CoT-v3

Text Generation • 196k • Updated Jan 1 • 2

wjldw/Qwen2.5-14B_gemini_sft_30000

Text Generation • 15B • Updated Jul 29, 2025 • 3

datasets 1

wjldw/JD-Bench

Viewer • Updated Sep 29, 2025 • 42k • 36