Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Edit Models filters

Inference Providers
Replicate
fal
Hyperbolic
Cohere
Novita
Fireworks
Nebius AI Studio
Together AI
Cerebras
Nscale
SambaNova
HF Inference API
Misc
reward_model
Inference Endpoints
text-generation-inference
custom_code

Misc with no match

Eval Results
Merge
4-bit precision
8-bit precision
text-embeddings-inference
Carbon Emissions
Mixture of Experts

Models

34
Full-text search
Active filters: reward_model

LemiSt/PairRM-mdeberta-v3-base

Text Generation • Updated Sep 25, 2024 • 24

Huanghz/align2llava-7b-lora-question

Updated 17 days ago • 4

Huanghz/align2llava-7b-lora-answer

Updated 17 days ago • 4

il-pugin/hse-prog-task-transformer-reward-model

Reinforcement Learning • Updated 11 days ago • 54
  • Previous
  • 1
  • 2
  • Next
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs