Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Hao Peng's picture
4 19 10

Hao Peng

Wesleythu
Magifafa07's profile picture
·
  • h-peng17

AI & ML interests

None yet

Recent Activity

upvoted a paper 6 days ago
LongWriter-Zero: Mastering Ultra-Long Text Generation via Reinforcement Learning
liked a dataset 14 days ago
THU-KEG/VerInstruct
authored a paper 17 days ago
VerIF: Verification Engineering for Reinforcement Learning in Instruction Following
View all activity

Organizations

Knowledge Engineer Group @ Tsinghua University's profile picture

commented a paper 17 days ago

VerIF: Verification Engineering for Reinforcement Learning in Instruction Following

Paper • 2506.09942 • Published 18 days ago • 6 •
2
New activity in huggingface/HuggingDiscussions 18 days ago

[FEEDBACK] Daily Papers

🔥 ❤️ 20
137
#32 opened about 1 year ago by
kramp
commented a paper 4 months ago

Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems

Paper • 2502.19328 • Published Feb 26 • 22 •
2
commented a paper 8 months ago

Constraint Back-translation Improves Complex Instruction Following of Large Language Models

Paper • 2410.24175 • Published Oct 31, 2024 • 18 •
2
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs