How to Train Your LLM Web Agent: A Statistical Diagnosis Paper • 2507.04103 • Published 10 days ago • 45
INCLUDE: Evaluating Multilingual Language Understanding with Regional Knowledge Paper • 2411.19799 • Published Nov 29, 2024 • 14
Kaleidoscope: In-language Exams for Massively Multilingual Vision Evaluation Paper • 2504.07072 • Published Apr 9 • 9
Augmenting LLM Reasoning with Dynamic Notes Writing for Complex QA Paper • 2505.16293 • Published May 22 • 2
Augmenting LLM Reasoning with Dynamic Notes Writing for Complex QA Paper • 2505.16293 • Published May 22 • 2 • 2
ServiceNow-AI/Apriel-Nemotron-15b-Thinker Text Generation • 15B • Updated May 15 • 6.37k • 89
AlignVLM: Bridging Vision and Language Latent Spaces for Multimodal Understanding Paper • 2502.01341 • Published Feb 3 • 39
M-RewardBench: Evaluating Reward Models in Multilingual Settings Paper • 2410.15522 • Published Oct 20, 2024 • 12
M-RewardBench: Evaluating Reward Models in Multilingual Settings Paper • 2410.15522 • Published Oct 20, 2024 • 12
CohereLabsCommunity/multilingual-reward-bench Viewer • Updated Nov 4, 2024 • 66.8k • 2.59k • 29