Qwen2.5-1M Collection The long-context version of Qwen2.5, supporting 1M-token context lengths • 2 items • Updated about 18 hours ago • 66
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published 5 days ago • 208
Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training Paper • 2501.11425 • Published 7 days ago • 76
Condor: Enhance LLM Alignment with Knowledge-Driven Data Synthesis and Refinement Paper • 2501.12273 • Published 6 days ago • 14
view article Article Yay! Organizations can now publish blog Articles By huggingface • 7 days ago • 30
DeepSeek R1 (All Versions) Collection DeepSeek R1 - the most powerful reasoning open-source model - available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models. • 27 items • Updated about 8 hours ago • 65
Jan 17 Releases ❄️ Collection Models and datasets of the second week of Jan 2025. • 23 items • Updated 10 days ago • 10
Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps Paper • 2501.09732 • Published 11 days ago • 65
OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking Paper • 2501.09751 • Published 11 days ago • 46
MMDocIR: Benchmarking Multi-Modal Retrieval for Long Documents Paper • 2501.08828 • Published 12 days ago • 28
RepVideo: Rethinking Cross-Layer Representation for Video Generation Paper • 2501.08994 • Published 12 days ago • 15
MiniMax-01: Scaling Foundation Models with Lightning Attention Paper • 2501.08313 • Published 13 days ago • 268
The Lessons of Developing Process Reward Models in Mathematical Reasoning Paper • 2501.07301 • Published 14 days ago • 88
Multiagent Finetuning: Self Improvement with Diverse Reasoning Chains Paper • 2501.05707 • Published 17 days ago • 19
LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs Paper • 2501.06186 • Published 17 days ago • 59
VideoRAG: Retrieval-Augmented Generation over Video Corpus Paper • 2501.05874 • Published 17 days ago • 66