Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
2
stpeteishii
stpete2
Follow
0 followers
·
5 following
tztechno
AI & ML interests
None yet
Organizations
None yet
stpete2
's models
18
Sort: Recently updated
stpete2/Qwen2.5-1.5B-gsm8k-grpo
Text Generation
•
Updated
May 15
•
6
stpete2/Qwen2.5-1.5B-gsm8k-sft
Text Generation
•
Updated
May 10
•
3
stpete2/Qwen2.5-0.5B-gsm8k-reinforcevanilla
Text Generation
•
Updated
May 8
•
3
stpete2/Qwen2.5-0.5B-gsm8k-reinforceplusplus
Text Generation
•
Updated
May 8
•
3
stpete2/Qwen2.5-0.5B-gsm8k-raftvanilla
Text Generation
•
Updated
May 8
•
3
stpete2/Qwen2.5-0.5B-gsm8k-raftplusplus
Text Generation
•
Updated
May 8
•
3
stpete2/Qwen2.5-0.5B-gsm8k-drgrpo
Text Generation
•
Updated
May 7
•
16
stpete2/Qwen2.5-0.5B-gsm8k-cppo
Text Generation
•
Updated
May 7
•
3
stpete2/Qwen2.5-0.5B-gsm8k-grpo
Text Generation
•
Updated
May 7
•
4
stpete2/Qwen2.5-0.5B-gsm8k-sft
Text Generation
•
Updated
May 3
•
17
stpete2/Qwen2.5-0.5b-gsm8k-drgrpocppo
Text Generation
•
0.6B
•
Updated
Apr 27
•
7
stpete2/Qwen2.5-0.5b-ini
0.5B
•
Updated
Apr 22
•
6
stpete2/Qwen2-0.5B-math-cppo
Text Generation
•
0.6B
•
Updated
Apr 21
•
6
stpete2/Qwen2-0.5B-math-grpo
Text Generation
•
0.6B
•
Updated
Apr 21
•
5
stpete2/Qwen2-0.5B-gsm8k-grpo
Text Generation
•
0.6B
•
Updated
Apr 21
•
8
stpete2/Qwen2-0.5B-gsm8k-cppo
Text Generation
•
0.6B
•
Updated
Apr 21
•
5
stpete2/Qwen2-1.5b-zero
2B
•
Updated
Apr 12
•
6
stpete2/dqn_othello_20250216
Updated
Feb 17