Fostering Video Reasoning via Next-Event Prediction Paper • 2505.22457 • Published 29 days ago • 27
Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs Paper • 2502.12982 • Published Feb 18 • 18
🔱 Sailor2 Language Models Collection Sailing in South-East Asia with Inclusive Multilingual LLMs • 34 items • Updated 22 days ago • 28
LightTransfer: Your Long-Context LLM is Secretly a Hybrid Model with Effortless Adaptation Paper • 2410.13846 • Published Oct 17, 2024 • 2
Qwen1.5 Collection Qwen1.5 is the improved version of Qwen, the large language model series developed by Alibaba Cloud. • 55 items • Updated Apr 28 • 209