When Does Reasoning Matter? Unpacking the Contribution of Reasoning to LLM Performance By Nicolas-BZRD and 1 other • about 14 hours ago • 10
Qianfan-VL: A Milestone Achievement in Chinese Multimodal AI with Domestic Chips By baidu • 6 days ago • 8
Ground-up efforts to build large datasets for effective and accurate translation of Modi-Script documents into modern Marathi By Arunbiz and 1 other • 5 days ago • 6
Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face By dvgodoy • Feb 11 • 72
How I Trained Action Chunking Transformer (ACT) on SO-101: My Journey, Gotchas, and Lessons By sherryxychen • about 18 hours ago • 5
DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge By NormalUhr • Feb 7 • 225
When Does Reasoning Matter? Unpacking the Contribution of Reasoning to LLM Performance By Nicolas-BZRD and 1 other • about 14 hours ago • 10
Qianfan-VL: A Milestone Achievement in Chinese Multimodal AI with Domestic Chips By baidu • 6 days ago • 8
Ground-up efforts to build large datasets for effective and accurate translation of Modi-Script documents into modern Marathi By Arunbiz and 1 other • 5 days ago • 6
Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face By dvgodoy • Feb 11 • 72
How I Trained Action Chunking Transformer (ACT) on SO-101: My Journey, Gotchas, and Lessons By sherryxychen • about 18 hours ago • 5
DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge By NormalUhr • Feb 7 • 225