Qwen2.5 Collection The Qwen 2.5 models are a series of AI models trained on 18 trillion tokens, supporting 29 languages and offering advanced features such as instructio • 33 items • Updated Oct 12, 2024 • 7
Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 • 3 items • Updated 1 day ago • 244
Enhancing the Reasoning Ability of Multimodal Large Language Models via Mixed Preference Optimization Paper • 2411.10442 • Published Nov 15, 2024 • 73
InternVL2.5-MPO Collection Enhancing the Reasoning Ability of MLLMs via Mixed Preference Optimization • 16 items • Updated 19 days ago • 26
view article Article Yay! Organizations can now publish blog Articles By huggingface • 8 days ago • 30
view article Article Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference 13 days ago • 61
view article Article MiniMax-01 is Now Open-Source: Scaling Lightning Attention for the AI Agent Era By MiniMax-AI • 14 days ago • 40
view article Article Announcing NVIDIA Cosmos World Foundation Models By mingyuliutw • 22 days ago • 23
Qwen2.5 Collection Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 45 items • Updated Nov 28, 2024 • 479
Bamba Collection Collection of Bamba - hybrid Mamba2 model architecture based models trained on open data • 8 items • Updated Dec 18, 2024 • 18