Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
3
13
1
Dawei Li
wjldw
Follow
HowieHwong's profile picture
1 follower
·
5 following
https://david-li0406.github.io/
home
David-Li0406
dawei-li-29b334251
AI & ML interests
LLM, NLP, Data Mining
Recent Activity
upvoted
a
paper
about 12 hours ago
R-Zero: Self-Evolving Reasoning LLM from Zero Data
upvoted
a
paper
about 19 hours ago
Are Today's LLMs Ready to Explain Well-Being Concepts?
upvoted
a
paper
about 19 hours ago
On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification
View all activity
Organizations
Papers
11
arxiv:
2508.01191
arxiv:
2505.18759
arxiv:
2502.01534
arxiv:
2411.16594
Expand 11 papers
models
9
Sort: Recently updated
wjldw/Qwen2.5-14B_gemini_sft_30000
Text Generation
•
15B
•
Updated
10 days ago
•
3
wjldw/Qwen2.5-14B_gpt4_sft_30000
Text Generation
•
15B
•
Updated
10 days ago
•
4
wjldw/bert_classifier
0.1B
•
Updated
Jan 22
•
3
wjldw/Mistral-7B-v0.1_gemini_dpo_30000
Text Generation
•
7B
•
Updated
Jan 2
•
3
wjldw/Mistral-7B-v0.1_gpt4_dpo_30000
Text Generation
•
7B
•
Updated
Jan 2
•
2
wjldw/Mistral-7B-v0.1_llama_dpo_30000
Text Generation
•
7B
•
Updated
Jan 2
•
2
wjldw/Mistral-7B-v0.1_gemini_sft_30000
Text Generation
•
7B
•
Updated
Dec 26, 2024
•
3
wjldw/Mistral-7B-v0.1_gpt4_sft_30000
Text Generation
•
7B
•
Updated
Dec 26, 2024
•
3
wjldw/Mistral-7B-v0.1_llama_sft_30000
Text Generation
•
7B
•
Updated
Dec 26, 2024
•
2
datasets
0
None public yet