Edit Models filters

Model Tree

OpenRLHF/Llama-3-8b-sft-mixture

Inference Providers

Nebius AI Studio

HF Inference API

Misc

Inference Endpoints

AutoTrain Compatible

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Carbon Emissions

Mixture of Experts

Models

9

Full-text search

Active filters: OpenRLHF/Llama-3-8b-sft-mixture

RTO-RL/Llama3-8B-RTO

Updated Feb 11 • 6 • 1

RTO-RL/Llama3-8B-PPO

Updated Feb 11 • 3 • 1

RTO-RL/Llama3-8B-RDPO

Updated Feb 11 • 3 • 1

RTO-RL/Llama3-8B-TDPO

Updated Feb 11 • 3 • 1

RTO-RL/Llama3-8B-RPP

Updated 7 days ago • 5 • 1

RTO-RL/Llama3-8B-RTO_RPP

Updated 7 days ago • 3 • 1

RTO-RL/Llama3-8B-RewardModel

Updated Feb 11 • 88

RTO-RL/Llama3-8B-DPO

Updated Feb 11 • 45

RTO-RL/Llama3-8B-SimPO

Updated Feb 11 • 9