Fostering Video Reasoning via Next-Event Prediction Paper • 2505.22457 • Published 29 days ago • 27
Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs Paper • 2502.12982 • Published Feb 18 • 18
🔱 Sailor2 Language Models Collection Sailing in South-East Asia with Inclusive Multilingual LLMs • 34 items • Updated 22 days ago • 28
LightTransfer: Your Long-Context LLM is Secretly a Hybrid Model with Effortless Adaptation Paper • 2410.13846 • Published Oct 17, 2024 • 2
GliDe with a CaPE: A Low-Hassle Method to Accelerate Speculative Decoding Paper • 2402.02082 • Published Feb 3, 2024 • 1
view article Article Accelerating LLM Inference: Fast Sampling with Gumbel-Max Trick By cxdu • Oct 24, 2024 • 12
Qwen1.5 Collection Qwen1.5 is the improved version of Qwen, the large language model series developed by Alibaba Cloud. • 55 items • Updated Apr 28 • 209