Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Alessamo's picture
1 3

Alessamo

Alessamo

AI & ML interests

None yet

Recent Activity

updated a collection about 1 month ago
DPO
updated a collection about 1 month ago
RL
updated a collection about 1 month ago
RL
View all activity

Organizations

None yet

Collections 3

data
  • CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training

    Paper • 2504.13161 • Published Apr 17 • 92
RL
  • Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

    Paper • 2504.13837 • Published Apr 18 • 126
  • TTRL: Test-Time Reinforcement Learning

    Paper • 2504.16084 • Published Apr 22 • 109
  • What, How, Where, and How Well? A Survey on Test-Time Scaling in Large Language Models

    Paper • 2503.24235 • Published Mar 31 • 54

models 0

None public yet

datasets 0

None public yet
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs