Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference Providers
Select all
fal
Cerebras
Nebius AI Studio
Fireworks
Novita
Together AI
Cohere
Hyperbolic
SambaNova
Replicate
HF Inference API
Misc
Reset Misc
reinforcement learning
AutoTrain Compatible
Inference Endpoints
Eval Results
text-generation-inference
Misc with no match
Merge
4-bit precision
8-bit precision
custom_code
text-embeddings-inference
Carbon Emissions
Mixture of Experts
Apply filters
Models
11
Full-text search
Edit filters
Sort: Trending
Active filters:
reinforcement learning
Clear all
nvidia/AceMath-RL-Nemotron-7B
Text Generation
•
Updated
about 20 hours ago
•
48
•
7
nicklashansen/tdmpc2
Reinforcement Learning
•
Updated
Oct 26, 2023
•
14
vmicheli/delta-iris
Updated
Jul 3, 2024
•
1
keras-io/deep-deterministic-policy-gradient
Updated
Jan 13, 2022
•
24
keras-io/ppo-cartpole
Updated
Jan 13, 2022
•
7
Liamdu/poca-SoccerTwos
Updated
Feb 27, 2024
•
5
Wyatt-Huang/DIPO
Updated
Mar 12, 2024
kuds/rl-lunar-lander
Updated
Aug 5, 2024
mazpie/genrl_models
Updated
Jun 25, 2024
kuds/rl-car-racing
Updated
Aug 5, 2024
siddheshtv/td3-stock-aapl
Updated
Oct 6, 2024