2 94 29

Maozhou Ge

Gmc2

GHGmc2

AI & ML interests

None yet

Recent Activity

upvoted an article 9 days ago

From GRPO to DAPO and GSPO: What, Why, and How

upvoted a paper 12 days ago

Group Sequence Policy Optimization

upvoted a collection 12 days ago

Qwen3

View all activity

Organizations

None yet

upvoted an article 9 days ago

Article

From GRPO to DAPO and GSPO: What, Why, and How

•

10 days ago

• 11

upvoted a paper 12 days ago

Group Sequence Policy Optimization

Paper • 2507.18071 • Published 26 days ago • 289

upvoted a collection 12 days ago

Qwen3

Collection

84 items • Updated 13 days ago • 1.11k

upvoted a paper about 1 month ago

Pre-Trained Policy Discriminators are General Reward Models

Paper • 2507.05197 • Published Jul 7 • 39

upvoted an article about 1 month ago

Article

Mixture of Depth is Vibe

•

Apr 22, 2024

• 48

upvoted an article 2 months ago

Article

Efficient LLM Pretraining: Packed Sequences and Masked Attention

•

Oct 7, 2024

• 46

upvoted a paper 3 months ago

Insights into DeepSeek-V3: Scaling Challenges and Reflections on Hardware for AI Architectures

Paper • 2505.09343 • Published May 14 • 68

upvoted an article 4 months ago

Article

Vision Language Models Explained

and 1 other •

Apr 11, 2024

• 437

upvoted a collection 5 months ago

🌾Oat-Zero: Understanding R1-Zero-Like Training

Collection

5 items • Updated Apr 10 • 7

upvoted a paper 5 months ago

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19 • 200

upvoted 2 articles 5 months ago

Article

How 🤗 Accelerate runs very large models thanks to PyTorch

•

Sep 27, 2022

• 14

Article

Open R1: Update #3

and 9 others •

Mar 11

• 295

upvoted 2 articles 6 months ago

Article

Open R1: Update #2

and 6 others •

Feb 10

• 217

Article

Open-source DeepResearch – Freeing our search agents

and 4 others •

Feb 4

• 1.28k

upvoted an article 7 months ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

and 2 others •

Jan 28

• 877

upvoted a paper 7 months ago

DeepSeek-V3 Technical Report

Paper • 2412.19437 • Published Dec 27, 2024 • 69

upvoted a paper 9 months ago

Marco-o1: Towards Open Reasoning Models for Open-Ended Solutions

Paper • 2411.14405 • Published Nov 21, 2024 • 62

upvoted 3 papers 10 months ago

Maozhou Ge

AI & ML interests

Recent Activity

Organizations

Gmc2's activity

From GRPO to DAPO and GSPO: What, Why, and How

Mixture of Depth is Vibe

Efficient LLM Pretraining: Packed Sequences and Masked Attention

Vision Language Models Explained

How 🤗 Accelerate runs very large models thanks to PyTorch

Open R1: Update #3

Open R1: Update #2

Open-source DeepResearch – Freeing our search agents

Open-R1: a fully open reproduction of DeepSeek-R1