Collections
Discover the best community collections!
Collections trending this week
-
Teaching Large Language Models to Reason with Reinforcement Learning
Paper • 2403.04642 • Published • 51 -
Stop Regressing: Training Value Functions via Classification for Scalable Deep RL
Paper • 2403.03950 • Published • 16 -
RT-Sketch: Goal-Conditioned Imitation Learning from Hand-Drawn Sketches
Paper • 2403.02709 • Published • 9
-
Teaching Large Language Models to Reason with Reinforcement Learning
Paper • 2403.04642 • Published • 51 -
Stop Regressing: Training Value Functions via Classification for Scalable Deep RL
Paper • 2403.03950 • Published • 16 -
RT-Sketch: Goal-Conditioned Imitation Learning from Hand-Drawn Sketches
Paper • 2403.02709 • Published • 9