Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
2140.8
TFLOPS
1204
74
77
Quentin Gallouédec
PRO
qgallouedec
Follow
Fishtiks's profile picture
hisaac617's profile picture
jdfowler's profile picture
304 followers
·
262 following
QGallouedec
qgallouedec
qgallouedec
qgallouedec.bsky.social
AI & ML interests
None yet
Recent Activity
liked
a model
1 day ago
deepseek-ai/DeepSeek-V3-0324
liked
a Space
2 days ago
nanotron/predict_memory
reacted
to
AtAndDev
's
post
with 🤗
2 days ago
deepseek-ai/DeepSeek-R1-0528 This is the end
View all activity
Organizations
Articles
6
Article
36
Gotchas in Tokenizer Behavior Every Developer Should Know
Article
291
Open R1: Update #3
View all Articles
Papers
4
arxiv:
2402.09844
arxiv:
2402.03046
arxiv:
2208.14928
arxiv:
2106.13687
spaces
5
Sort: Recently updated
Sleeping
Tmp
🚀
Runtime error
2
Run Hello World
👀
Sleeping
Compute
👁
Runtime error
Run DuckDB Jobs
🦆
Process datasets with DuckDB SQL
Running
14
Train Memory
📈
Generate memory forecast for ML models
models
731
Sort: Recently updated
qgallouedec/Qwen3-0.6B-SFT
Updated
6 days ago
qgallouedec/Qwen2.5-0.5B-SFT
Updated
7 days ago
qgallouedec/SmolLM2-360M-Rickified-GRPO
Text Generation
•
Updated
10 days ago
•
54
•
1
qgallouedec/SmolLM2-360M-Rickified
Text Generation
•
Updated
11 days ago
•
550
qgallouedec/SmolLM2-360M-SFT
Text Generation
•
Updated
23 days ago
•
4
qgallouedec/R1-Zero-Qwen-7B-Math
Text Generation
•
Updated
about 1 month ago
•
171
qgallouedec/Qwen-2.5-7B-Simple-RL
Text Generation
•
Updated
Apr 8
•
10
qgallouedec/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
•
Updated
Apr 7
•
17
qgallouedec/DeepSeek-R1-Distill-Qwen-7B-GRPO
Updated
Mar 26
qgallouedec/DeepSeek-R1-Distill-Qwen-1.5B-GRPO
Updated
Mar 24
Expand 731 models
datasets
72
Sort: Recently updated
qgallouedec/trl-metrics
Viewer
•
Updated
4 days ago
•
120k
•
321
•
1
qgallouedec/rick-physics-grpo
Viewer
•
Updated
10 days ago
•
1.79k
•
185
•
1
qgallouedec/rick-science
Viewer
•
Updated
15 days ago
•
1.18k
•
185
•
1
qgallouedec/physics-problems
Viewer
•
Updated
22 days ago
•
247
•
49
qgallouedec/rick-teaches-math
Viewer
•
Updated
22 days ago
•
6.8k
•
98
qgallouedec/DAPO-Math-17k-Processed-Scored
Viewer
•
Updated
Apr 29
•
16.4k
•
98
•
2
qgallouedec/prm800k
Viewer
•
Updated
Dec 17, 2024
•
41.2k
•
45
•
3
qgallouedec/ultrafeedback-prompt
Viewer
•
Updated
Sep 9, 2024
•
60.9k
•
31
qgallouedec/ultrafeedback-gpt-3.5-turbo-helpfulness
Viewer
•
Updated
Sep 9, 2024
•
16.6k
•
29
qgallouedec/lm-human-preferences-descriptiveness
Viewer
•
Updated
Sep 9, 2024
•
6.26k
•
36
Expand 72 datasets