VLM-R^3: Region Recognition, Reasoning, and Refinement for Enhanced Multimodal Chain-of-Thought Paper • 2505.16192 • Published 17 days ago • 8
view article Article Open-R1: a fully open reproduction of DeepSeek-R1 By eliebak and 2 others • Jan 28 • 863
view article Article You could have designed state of the art positional encoding By FL33TW00D-HF • Nov 25, 2024 • 290
view article Article The Transformers Library: standardizing model definitions By lysandre and 3 others • 24 days ago • 112
view article Article Vision Language Models (Better, Faster, Stronger) By merve and 4 others • 27 days ago • 420
INTELLECT-2 Collection INTELLECT-2 is a 32 billion parameter language model with globally distributed reinforcement learning. • 3 items • Updated 27 days ago • 22
Llama Nemotron Collection Open, Production-ready Enterprise Models • 8 items • Updated 1 day ago • 60
Qwen3 Collection Qwen's new Qwen3 models. In Unsloth Dynamic 2.0, GGUF, 4-bit and 16-bit Safetensor formats. Includes 128K Context Length variants. • 65 items • Updated 9 days ago • 150
BitNet v2: Native 4-bit Activations with Hadamard Transformation for 1-bit LLMs Paper • 2504.18415 • Published Apr 25 • 43
ReTool: Reinforcement Learning for Strategic Tool Use in LLMs Paper • 2504.11536 • Published Apr 15 • 61