Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Feng's picture
4 8

Feng

Yunzhen
y8phi's profile picture
·
https://fengyzpku.github.io/
  • fengyzpku

AI & ML interests

None yet

Organizations

None yet

commented 3 papers 3 months ago

Don't Waste Mistakes: Leveraging Negative RL-Groups via Confidence Reweighting

Paper • 2510.08696 • Published Oct 9 • 14 •
3

Rethinking Thinking Tokens: LLMs as Improvement Operators

Paper • 2510.01123 • Published Oct 1 • 5 •
2

What Characterizes Effective Reasoning? Revisiting Length, Review, and Structure of CoT

Paper • 2509.19284 • Published Sep 23 • 22 •
2
commented a paper 11 months ago

PILAF: Optimal Human Preference Sampling for Reward Modeling

Paper • 2502.04270 • Published Feb 6 • 12 •
2
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs