Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Model Tree
Reset
OpenRLHF/Llama-3-8b-sft-mixture
Finetunes
Quantizations
Inference Providers
Select all
Together AI
Nebius AI Studio
Novita
Cohere
SambaNova
Replicate
Fireworks
Cerebras
fal
Hyperbolic
HF Inference API
Misc
Inference Endpoints
AutoTrain Compatible
text-generation-inference
Eval Results
Merge
4-bit precision
8-bit precision
custom_code
text-embeddings-inference
Carbon Emissions
Mixture of Experts
Apply filters
Models
9
Full-text search
Edit filters
Sort: Trending
Active filters:
OpenRLHF/Llama-3-8b-sft-mixture
Clear all
RTO-RL/Llama3-8B-RTO
Updated
Feb 11
•
6
•
1
RTO-RL/Llama3-8B-PPO
Updated
Feb 11
•
3
•
1
RTO-RL/Llama3-8B-RDPO
Updated
Feb 11
•
3
•
1
RTO-RL/Llama3-8B-TDPO
Updated
Feb 11
•
3
•
1
RTO-RL/Llama3-8B-RPP
Updated
7 days ago
•
5
•
1
RTO-RL/Llama3-8B-RTO_RPP
Updated
7 days ago
•
3
•
1
RTO-RL/Llama3-8B-RewardModel
Updated
Feb 11
•
88
RTO-RL/Llama3-8B-DPO
Updated
Feb 11
•
45
RTO-RL/Llama3-8B-SimPO
Updated
Feb 11
•
9