Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
choco9966 's Collections
Reading-Paper-List
Post-Training Papers

Reading-Paper-List

updated Apr 22
Upvote
-

  • BitNet b1.58 2B4T Technical Report

    Paper • 2504.12285 • Published Apr 16 • 73

  • DataDecide: How to Predict Best Pretraining Data with Small Experiments

    Paper • 2504.11393 • Published Apr 15 • 18

  • Efficient Process Reward Model Training via Active Learning

    Paper • 2504.10559 • Published Apr 14 • 13

  • CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training

    Paper • 2504.13161 • Published Apr 17 • 92
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs