Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
VandeeeFeng 's Collections
models
papers
apps

papers

updated Feb 21
Upvote
-

  • DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

    Paper • 2402.03300 • Published Feb 5, 2024 • 122

  • SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

    Paper • 2501.17161 • Published Jan 28 • 122

  • Running
    2.66k
    2.66k

    The Ultra-Scale Playbook

    🌌

    The ultimate guide to training LLM on large GPU Clusters


  • Running
    209
    209

    LLM训练终极指南 | The Ultra-Scale Playbook

    🔥

    了解LLM训练的方方面面

Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs