view article Article LLM Dataset Formats 101: A No‐BS Guide for Hugging Face Devs By tegridydev • Feb 1 • 5
view article Article From Zero to Reasoning Hero: How DeepSeek-R1 Leverages Reinforcement Learning to Master Complex Reasoning By NormalUhr • Feb 4 • 11
view article Article Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face By dvgodoy • 27 days ago • 9
OLMoE (January 2025) Collection Improved OLMoE for iOS app. Read more: https://allenai.org/blog/olmoe-app • 10 items • Updated 27 days ago • 9
view article Article 🦸🏻#9: Does AI Remember? The Role of Memory in Agentic Workflows By Kseniase • Feb 2 • 15
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published Jan 22 • 341
view article Article Fine-tune ModernBERT for RAG with Synthetic Data By sdiazlor and 2 others • Jan 20 • 37
llama.vim Collection Recommended models for the llama.vim and llama.vscode plugins • 6 items • Updated about 11 hours ago • 24
view article Article A Beginner-Friendly PyTorch Tutorial: Build and Train Your First Model By dvgodoy • Jan 20 • 5