Xinyu Zhu's picture

Xinyu Zhu

TianHongZXY

·

https://zhuxinyu.top

AI & ML interests

Large Language Models; Reasoning; Reinforcement Learning

Recent Activity

updated a model about 23 hours ago

meng-lab/MATH-Qwen3-8B-Base-GRPO-Serval

published a model 6 days ago

meng-lab/MATH-Qwen3-8B-Base-GRPO-Serval

liked a dataset 16 days ago

Xnhyacinth/LongBench

View all activity

Organizations

Collections 2

Papers 13

arxiv:2603.00889

arxiv:2506.01347

arxiv:2506.15710

arxiv:2409.18786

models 12

TianHongZXY/CHIMERA-4B-SFT

4B • Updated Mar 2 • 14 • 2

TianHongZXY/CHIMERA-4B-RL

4B • Updated Mar 2 • 9 • 4

TianHongZXY/Qwen3-4B-NSR

4B • Updated Dec 6, 2025 • 3

TianHongZXY/Qwen2.5-Math-7B-GRPO

8B • Updated Jul 28, 2025 • 2

TianHongZXY/OpenR1-Math-46k-8192-Qwen2.5-7B-Instruct-GRPO-clip_0.28

Updated Jul 8, 2025

TianHongZXY/Qwen2.5-Math-7B-W-REINFORCE

8B • Updated Jun 1, 2025 • 5 • 1

TianHongZXY/Qwen3-4B-GRPO

4B • Updated May 31, 2025 • 24

TianHongZXY/Qwen3-4B-PPO

4B • Updated May 31, 2025 • 3

TianHongZXY/Qwen3-4B-PSR

4B • Updated May 31, 2025 • 12 • 1

TianHongZXY/Qwen2.5-Math-7B-PPO

8B • Updated May 31, 2025 • 3

datasets 6

TianHongZXY/CHIMERA

Viewer • Updated 17 days ago • 9.23k • 599 • 21

TianHongZXY/aime-1983-2025

Viewer • Updated Apr 16, 2025 • 963 • 124

TianHongZXY/AIME2025

Viewer • Updated Mar 22, 2025 • 30 • 378 • 1

TianHongZXY/AIME2024

Viewer • Updated Mar 22, 2025 • 30 • 157

TianHongZXY/amc23

Viewer • Updated Mar 22, 2025 • 40 • 398

TianHongZXY/MATH

Viewer • Updated Jan 12, 2025 • 12.5k • 744 • 3