view article Article We’re open-sourcing our text-to-image model and the process behind it Nov 12, 2025 • 85
KTO: Model Alignment as Prospect Theoretic Optimization Paper • 2402.01306 • Published Feb 2, 2024 • 21
Running on CPU Upgrade Featured 2.98k The Smol Training Playbook 📚 2.98k The secrets to building world-class LLMs
deployed-models Collection Models that are currently deployed by the hf-inference provider • 1355 items • Updated 4 days ago • 33
🛩️Qwen3-VL Collection the most powerful vision-language model in the Qwen series to date. Available in Dense and MoE architectures • 5 items • Updated Oct 15, 2025
<7B Best of MoE 🧠 Collection Collection of Small size big impact MoE. • 4 items • Updated Oct 10, 2025