Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
mkurman
/
llama-3.2-MEDIT-3B-o1-GRPO-LLM-Eval
like
0
Text Generation
Transformers
Safetensors
O1-OPEN/OpenO1-SFT
English
llama
reasoning
o1
GRPO-LLM-Evaluator
conversational
text-generation-inference
License:
llama3.2
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
llama-3.2-MEDIT-3B-o1-GRPO-LLM-Eval
Commit History
Upload 8 files
e363456
verified
mkurman
commited on
Feb 22
Update README.md
823a1aa
verified
mkurman
commited on
Feb 22
Update README.md
a95dc87
verified
mkurman
commited on
Feb 22
initial commit
18eccd5
verified
mkurman
commited on
Feb 22