Kimi-VL-A3B Collection Moonshot's efficient MoE VLMs, exceptional on agent, long-context, and thinking • 6 items • Updated Apr 12 • 65
Multilingual RewardBench (M-RewardBench) [ACL 2025 Main] Collection Multilingual Reward Model Evaluation Dataset and Results • 3 items • Updated 17 days ago • 4