Feng
VandeeeFeng
·
AI & ML interests
None yet
Recent Activity
new activity
6 days ago
deepseek-ai/DeepSeek-R1-0528:DeepSeek 农历
updated
a collection
18 days ago
apps
updated
a collection
26 days ago
apps
Organizations
None yet
Collections
3
-
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
Paper • 2402.03300 • Published • 121 -
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training
Paper • 2501.17161 • Published • 122 -
2.65k
The Ultra-Scale Playbook
🌌The ultimate guide to training LLM on large GPU Clusters
-
209
LLM训练终极指南 | The Ultra-Scale Playbook
🔥了解LLM训练的方方面面
datasets
0
None public yet