Lora & full finetune experiments on r1 distills to generate python code for math problems
Ram PRO
0-hero
AI & ML interests
All work on this profile is personal
Recent Activity
new activity
18 days ago
fffiloni/bnb-iso-skeuo-3d-icns-gen:Might need to change fal model endpoint
published
a model
24 days ago
0-hero/r1-7b-grpo-full
published
a model
24 days ago
0-hero/R1-7B-MATH-GRPO-FULL
Organizations
Collections
5
models
49

0-hero/r1-7B-grpo-v3.3-epoch-3
Updated
•
4

0-hero/r1-7B-grpo-v3.3-epoch-2
Updated
•
2

0-hero/r1-7B-grpo-v3.3-epoch-1
Updated
•
2

0-hero/r1-7B-grpo-v3.2-epoch-2
Updated
•
4

0-hero/r1-7B-grpo-v3.2-epoch-1
Updated
•
2

0-hero/r1-14B-grpo-v3.1-epoch-2
Updated
•
2

0-hero/r1-14B-grpo-v3.1-epoch-1
Updated
•
4

0-hero/r1-7B-grpo-v3.1-epoch-3
Updated
•
1

0-hero/r1-7B-grpo-v3.1-epoch-2
Updated
•
4

0-hero/r1-7B-grpo-v2-temp-1.0-60
Updated
•
3
datasets
14
0-hero/MATH
Viewer
•
Updated
•
331k
•
50
0-hero/audio-samples-fixed
Viewer
•
Updated
•
10
•
13
0-hero/distilabel-math-preference-dpo
Viewer
•
Updated
•
2.42k
•
22
0-hero/lj_speech_with_spectogram_conversations
Viewer
•
Updated
•
13.1k
•
19
•
1
0-hero/lj_speech_with_spectogram
Viewer
•
Updated
•
13.1k
•
28
•
1
0-hero/Matter-0.2-alpha
Viewer
•
Updated
•
2.52M
•
44
•
3
0-hero/Matter-0.1
Viewer
•
Updated
•
2.25M
•
80
•
53
0-hero/Matter-0.1-Slim-D
Viewer
•
Updated
•
1.32M
•
49
0-hero/Matter-0.1-Slim-C
Viewer
•
Updated
•
343k
•
37
0-hero/Matter-0.1-Slim-B
Viewer
•
Updated
•
308k
•
25
•
1