Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Alessamo 's Collections
entropy
data
RL
DPO

data

updated about 1 month ago
Upvote
-

  • CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training

    Paper • 2504.13161 • Published Apr 17 • 92

  • Which Data Attributes Stimulate Math and Code Reasoning? An Investigation via Influence Functions

    Paper • 2505.19949 • Published May 26 • 16
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs