Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
stpeteishii's picture
2

stpeteishii

stpete2
·
  • tztechno

AI & ML interests

None yet

Organizations

None yet

stpete2 's models 18

stpete2/Qwen2.5-1.5B-gsm8k-grpo

Text Generation • Updated May 15 • 6

stpete2/Qwen2.5-1.5B-gsm8k-sft

Text Generation • Updated May 10 • 3

stpete2/Qwen2.5-0.5B-gsm8k-reinforcevanilla

Text Generation • Updated May 8 • 3

stpete2/Qwen2.5-0.5B-gsm8k-reinforceplusplus

Text Generation • Updated May 8 • 3

stpete2/Qwen2.5-0.5B-gsm8k-raftvanilla

Text Generation • Updated May 8 • 3

stpete2/Qwen2.5-0.5B-gsm8k-raftplusplus

Text Generation • Updated May 8 • 3

stpete2/Qwen2.5-0.5B-gsm8k-drgrpo

Text Generation • Updated May 7 • 16

stpete2/Qwen2.5-0.5B-gsm8k-cppo

Text Generation • Updated May 7 • 3

stpete2/Qwen2.5-0.5B-gsm8k-grpo

Text Generation • Updated May 7 • 4

stpete2/Qwen2.5-0.5B-gsm8k-sft

Text Generation • Updated May 3 • 17

stpete2/Qwen2.5-0.5b-gsm8k-drgrpocppo

Text Generation • 0.6B • Updated Apr 27 • 7

stpete2/Qwen2.5-0.5b-ini

0.5B • Updated Apr 22 • 6

stpete2/Qwen2-0.5B-math-cppo

Text Generation • 0.6B • Updated Apr 21 • 6

stpete2/Qwen2-0.5B-math-grpo

Text Generation • 0.6B • Updated Apr 21 • 5

stpete2/Qwen2-0.5B-gsm8k-grpo

Text Generation • 0.6B • Updated Apr 21 • 8

stpete2/Qwen2-0.5B-gsm8k-cppo

Text Generation • 0.6B • Updated Apr 21 • 5

stpete2/Qwen2-1.5b-zero

2B • Updated Apr 12 • 6

stpete2/dqn_othello_20250216

Updated Feb 17
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs