view article Article Introducing ColQwen-Omni: Retrieve in every modality By manu and 4 others • 19 days ago • 60
MiniMax-M1 Collection MiniMax-M1, the world's first open-weight, large-scale hybrid-attention reasoning model. • 6 items • Updated Jul 3 • 110
view article Article No GPU left behind: Unlocking Efficiency with Co-located vLLM in TRL By toslali-ibm and 5 others • Jun 3 • 81
Model Optimizer Collection A collection of generative models quantized and optimized with TensorRT Model Optimizer. • 29 items • Updated 5 days ago • 27
SeedVR2: One-Step Video Restoration via Diffusion Adversarial Post-Training Paper • 2506.05301 • Published Jun 5 • 55
view article Article Training and Finetuning Embedding Models with Sentence Transformers v3 By tomaarsen • May 28, 2024 • 240
view article Article What is Qwen-Agent framework? Inside the Qwen family By Kseniase and 1 other • Mar 20 • 12
DeepSeek R1 (All Versions) Collection DeepSeek-R1-0528 is here! The most powerful reasoning open LLM, available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models. • 37 items • Updated 6 days ago • 254
view article Article Tiny Agents in Python: a MCP-powered agent in ~70 lines of code By celinah and 3 others • May 23 • 152
The Bitter Lesson Learned from 2,000+ Multilingual Benchmarks Paper • 2504.15521 • Published Apr 22 • 64
view article Article Training and Finetuning Reranker Models with Sentence Transformers v4 By tomaarsen • Mar 26 • 151
view article Article The NLP Course is becoming the LLM Course! By burtenshaw and 9 others • Apr 3 • 99