VoxRep: Enhancing 3D Spatial Understanding in 2D Vision-Language Models via Voxel Representation Paper • 2503.21214 • Published Mar 27 • 2
Running 2.66k 2.66k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
M-RewardBench: Evaluating Reward Models in Multilingual Settings Paper • 2410.15522 • Published Oct 20, 2024 • 12
M-RewardBench: Evaluating Reward Models in Multilingual Settings Paper • 2410.15522 • Published Oct 20, 2024 • 12 • 3
M-RewardBench: Evaluating Reward Models in Multilingual Settings Paper • 2410.15522 • Published Oct 20, 2024 • 12
Multilingual RewardBench (M-RewardBench) [ACL 2025 Main] Collection Multilingual Reward Model Evaluation Dataset and Results • 3 items • Updated 21 days ago • 4
view article Article Scaling robotics datasets with video encoding By aliberts and 2 others • Aug 27, 2024 • 40
view article Article Illustrating Reinforcement Learning from Human Feedback (RLHF) By natolambert and 3 others • Dec 9, 2022 • 264