Penghui Qi's picture

3 19 3

Penghui Qi

QPHutu

·

QPHutu

AI & ML interests

None yet

Recent Activity

upvoted a paper 22 minutes ago

Skywork-Reward-V2: Scaling Preference Data Curation via Human-AI Synergy

upvoted a paper 3 days ago

SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning

updated a collection 8 days ago

View all activity

Organizations

authored a paper about 1 month ago

Optimizing Anytime Reasoning via Budget Relative Policy Optimization

Paper • 2505.13438 • Published May 19 • 35

authored a paper 3 months ago

Understanding R1-Zero-Like Training: A Critical Perspective

Paper • 2503.20783 • Published Mar 26 • 52

authored a paper 4 months ago

PipeOffload: Improving Scalability of Pipeline Parallelism with Memory Optimization

Paper • 2503.01328 • Published Mar 3 • 16

authored a paper 8 months ago

Balancing Pipeline Parallelism with Vocabulary Parallelism

Paper • 2411.05288 • Published Nov 8, 2024 • 20

authored a paper about 1 year ago

Pipeline Parallelism with Controllable Memory

Paper • 2405.15362 • Published May 24, 2024 • 3

authored a paper over 1 year ago

Zero Bubble Pipeline Parallelism

Paper • 2401.10241 • Published Nov 30, 2023 • 25