Hao Peng's picture

Hao Peng

Wesleythu

·

h-peng17

AI & ML interests

None yet

Recent Activity

upvoted a paper 6 days ago

LongWriter-Zero: Mastering Ultra-Long Text Generation via Reinforcement Learning

liked a dataset 14 days ago

THU-KEG/VerInstruct

authored a paper 17 days ago

VerIF: Verification Engineering for Reinforcement Learning in Instruction Following

View all activity

Organizations

authored a paper 17 days ago

VerIF: Verification Engineering for Reinforcement Learning in Instruction Following

Paper • 2506.09942 • Published 18 days ago • 6

authored 4 papers 4 months ago

KoLA: Carefully Benchmarking World Knowledge of Large Language Models

Paper • 2306.09296 • Published Jun 15, 2023 • 19

ADELIE: Aligning Large Language Models on Information Extraction

Paper • 2405.05008 • Published May 8, 2024 • 2

Constraint Back-translation Improves Complex Instruction Following of Large Language Models

Paper • 2410.24175 • Published Oct 31, 2024 • 18

Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems

Paper • 2502.19328 • Published Feb 26 • 22

authored a paper 8 months ago

Pre-training Distillation for Large Language Models: A Design Space Exploration

Paper • 2410.16215 • Published Oct 21, 2024 • 16