Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
1
Rintaro Enomoto
ununtrium
Follow
AI & ML interests
None yet
Organizations
None yet
Papers
1
arxiv:
2407.03963
models
6
Sort: Recently updated
ununtrium/Qwen2.5-1.5B-Open-R1-GRPO-2rewards
Text Generation
•
Updated
Feb 19
•
3
ununtrium/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
•
Updated
Feb 11
ununtrium/Qwen2.5-1.5B-Instruct-Open-R1-GRPO-gsm8k2
Text Generation
•
Updated
Feb 9
ununtrium/Qwen2.5-1.5B-Instruct-Open-R1-GRPO-gsm8k
Text Generation
•
Updated
Feb 8
•
21
ununtrium/Llama-3.2-1B-Instruct-Open-R1-GRPO-gsm8k
Updated
Feb 8
ununtrium/Llama-3.2-1B-Instruct-Open-R1-GRPO-1k
Updated
Feb 4
datasets
None public yet