Aurelien Lucchi
alucchi
ยท
AI & ML interests
None yet
Recent Activity
updated
a dataset
about 15 hours ago
alucchi/Qwen3-4B_pre_g1110_b18_1_a10_2
published
a dataset
about 15 hours ago
alucchi/Qwen3-4B_pre_g1110_b18_1_a10_2
updated
a dataset
about 17 hours ago
alucchi/Qwen3-4B_n1000_e2_oadam0.0001_b16_1_a10_g1110_1825_n1000_e1_oadam0.0001_b18_1_a10_1825
Organizations
None yet
alucchi's activity
[Experiment] Applying GRPO to DeepSeek-R1-Distill-Qwen-1.5B with LIMO
๐ฅ
๐
22
22
#15 opened 4 months ago
by
lewtun
