The collection for the Paper "The Surprising Effectiveness of Negative Reinforcement in LLM Reasoning"
Xinyu Zhu
TianHongZXY
AI & ML interests
Large Language Models; Reasoning; Reinforcement Learning
Recent Activity
liked
a dataset
1 day ago
nvidia/Nemotron-Post-Training-Dataset-v1
upvoted
a
collection
11 days ago
RLVR-Decomposed
updated
a model
11 days ago
TianHongZXY/Qwen2.5-Math-7B-GRPO