Quentin Gallouédec's picture

Quentin Gallouédec PRO

qgallouedec

·

AI & ML interests

None yet

Recent Activity

upvoted an article about 3 hours ago

🐯 Liger GRPO meets TRL

updated a dataset about 16 hours ago

trl-lib/documentation-images

liked a model 3 days ago

deepseek-ai/DeepSeek-V3-0324

View all activity

Organizations

qgallouedec's activity

upvoted an article about 3 hours ago

Article

🐯 Liger GRPO meets TRL

By

and 5 others •

9 days ago

• 33

updated a dataset about 16 hours ago

trl-lib/documentation-images

Viewer • Updated about 16 hours ago • 7 • 113k

liked a model 3 days ago

deepseek-ai/DeepSeek-V3-0324

Text Generation • Updated Mar 27 • 370k • • 2.95k

liked a Space 4 days ago

Predict Memory

Calculate memory usage from model configurations

reacted to AtAndDev's post with 🤗 4 days ago

Post

2629

deepseek-ai/DeepSeek-R1-0528

This is the end

1 reply

·

liked a model 4 days ago

deepseek-ai/DeepSeek-R1-0528

Text Generation • Updated 4 days ago • 41.6k • • 1.62k

upvoted a paper 4 days ago

Kimina-Prover Preview: Towards Large Formal Reasoning Models with Reinforcement Learning

Paper • 2504.11354 • Published Apr 15 • 5

updated a dataset 6 days ago

qgallouedec/trl-metrics

Viewer • Updated 6 days ago • 120k • 321 • 1

liked a dataset 7 days ago

open-r1/Mixture-of-Thoughts

Viewer • Updated 7 days ago • 699k • 17.8k • 166

upvoted a paper 7 days ago

INTELLECT-2: A Reasoning Model Trained Through Globally Decentralized Reinforcement Learning

Paper • 2505.07291 • Published 22 days ago • 11

published a model 8 days ago

qgallouedec/Qwen3-0.6B-SFT

Updated 8 days ago

updated a Space 8 days ago

Train

Show job instructions for TRL model training

published a model 9 days ago

qgallouedec/Qwen2.5-0.5B-SFT

Updated 9 days ago

liked a Space 9 days ago

Train

Show job instructions for TRL model training

updated a Space 9 days ago

Train

Show job instructions for TRL model training

published a Space 9 days ago

Tmp

updated 2 Spaces 9 days ago

Train

Show job instructions for TRL model training

Run Hello World