Embodied-Reasoner: Synergizing Visual Search, Reasoning, and Action for Embodied Interactive Tasks Paper β’ 2503.21696 β’ Published 28 days ago β’ 22
MagicComp: Training-free Dual-Phase Refinement for Compositional Video Generation Paper β’ 2503.14428 β’ Published Mar 18 β’ 8
SSTuning Collection Zero-shot text classification models trained with self-supervised tuning (SSTuning) β’ 5 items β’ Updated Mar 11
Babel: Open Multilingual Large Language Models Serving Over 90% of Global Speakers Paper β’ 2503.00865 β’ Published Mar 2 β’ 64
Babel: Open Multilingual Large Language Models Serving Over 90% of Global Speakers Paper β’ 2503.00865 β’ Published Mar 2 β’ 64
InfoCTM: A Mutual Information Maximization Perspective of Cross-Lingual Topic Modeling Paper β’ 2304.03544 β’ Published Apr 7, 2023