Introducing the Palmyra-mini family: Powerful, lightweight, and ready to reason! By Writer and 1 other • 6 days ago • 53
mem-agent: Persistent, Human Readable Memory Agent Trained with Online RL By driaforall and 1 other • 6 days ago • 12
"Anemll-style" Root-Mean-Square (RMS) Normalization on the Apple Neural Engine: A Simple Hack By anemll • about 22 hours ago • 9
AtlasOCR: Building the First Open-Source Darija OCR Model with Vision Language Models By imomayiz and 4 others • 1 day ago • 8
Fine-tune Any LLM from the Hugging Face Hub with Together AI By togethercomputer and 3 others • 7 days ago • 7
🌎 What kind of environmental impacts are AI companies disclosing? (And can we compare them?) 🌎 By sasha and 1 other • about 4 hours ago • 5
DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge By NormalUhr • Feb 7 • 216
How to Train an Antibody Developability Model By ginkgo-datapoints and 1 other • about 4 hours ago • 4
Efficient Deep Learning: A Comprehensive Overview of Optimization Techniques 👐 📚 By Isayoften • Aug 26, 2024 • 74
Introducing the Palmyra-mini family: Powerful, lightweight, and ready to reason! By Writer and 1 other • 6 days ago • 53
mem-agent: Persistent, Human Readable Memory Agent Trained with Online RL By driaforall and 1 other • 6 days ago • 12
"Anemll-style" Root-Mean-Square (RMS) Normalization on the Apple Neural Engine: A Simple Hack By anemll • about 22 hours ago • 9
AtlasOCR: Building the First Open-Source Darija OCR Model with Vision Language Models By imomayiz and 4 others • 1 day ago • 8
Fine-tune Any LLM from the Hugging Face Hub with Together AI By togethercomputer and 3 others • 7 days ago • 7
🌎 What kind of environmental impacts are AI companies disclosing? (And can we compare them?) 🌎 By sasha and 1 other • about 4 hours ago • 5
DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge By NormalUhr • Feb 7 • 216
How to Train an Antibody Developability Model By ginkgo-datapoints and 1 other • about 4 hours ago • 4
Efficient Deep Learning: A Comprehensive Overview of Optimization Techniques 👐 📚 By Isayoften • Aug 26, 2024 • 74