ReasonIR: Training Retrievers for Reasoning Tasks Paper • 2504.20595 • Published 16 days ago • 52
Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia Paper • 2503.07920 • Published Mar 10 • 98
The Same But Different: Structural Similarities and Differences in Multilingual Language Modeling Paper • 2410.09223 • Published Oct 11, 2024 • 5