view article Article (LoRA) Fine-Tuning FLUX.1-dev on Consumer Hardware By derekl35 and 4 others • 24 days ago • 75
view article Article Enhance Your Models in 5 Minutes with the Hugging Face Kernel Hub By drbh and 6 others • about 1 month ago • 113
view article Article quanto: a pytorch quantization toolkit By dacorvo and 2 others • Mar 18, 2024 • 41
view article Article Exploring Quantization Backends in Diffusers By derekl35 and 2 others • May 21 • 38
view article Article Good answers are not necessarily factual answers: an analysis of hallucination in leading LLMs By davidberenstein1957 and 1 other • May 7 • 38
view article Article Announcing MamayLM, an efficient state-of-the-art Ukrainian LLM By INSAIT-Institute and 2 others • Apr 23 • 55
view article Article ChatGPT-4o's Image Generation Capabilities and Its Wild Examples By prithivMLmods • Apr 5 • 20
view article Article Introducing Three New Serverless Inference Providers: Hyperbolic, Nebius AI Studio, and Novita 🔥 By reach-vb and 6 others • Feb 18 • 99
view article Article Fine-tuning SmolLM with Group Relative Policy Optimization (GRPO) by following the Methodologies By prithivMLmods • Feb 17 • 22
view article Article Reasoning at the Forefront of Advanced AI Models : Mistral-Small-24B-Base-2501 By ruslanmv • Feb 8 • 3
view article Article Train 400x faster Static Embedding Models with Sentence Transformers By tomaarsen • Jan 15 • 195
view article Article Open-source DeepResearch – Freeing our search agents By m-ric and 4 others • Feb 4 • 1.27k
view article Article Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial By open-r1 • Jan 31 • 50
view article Article 🦸🏻#9: Does AI Remember? The Role of Memory in Agentic Workflows By Kseniase • Feb 2 • 17
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published Jan 22 • 405