Laguna M.1 Collection Our most capable model to date, designed for long-horizon work. Apache 2.0. • 4 items • Updated 1 day ago • 18
MSA: Memory Sparse Attention for Efficient End-to-End Memory Model Scaling to 100M Tokens Paper • 2603.23516 • Published Mar 6 • 53
view article Article Profiling in PyTorch (Part 2): From nn.Linear to a Fused MLP +3 ariG23498, ror, sergiopaniego, pcuenq, sayakpaul • 19 days ago • 50
view article Article How to Fine-Tune Nemotron 3.5 ASR for Your Language, Domain, or Accent nvidia • 26 days ago • 67
PP-OCRv6 Collection From 1.5M to 34.5M Parameters, Surpassing Billion-Scale VLMs on OCR Tasks • 19 items • Updated 15 days ago • 98
view article Article Introducing North Mini Code: Cohere’s First Model For Developers CohereLabs • 21 days ago • 79
FlashMemory-DeepSeek-V4: Lightning Index Ultra-Long Context via Lookahead Sparse Attention Paper • 2606.09079 • Published 22 days ago • 65
Zamba2-VL Collection A suite of vision-language models based on Zamba2. • 3 items • Updated 21 days ago • 5
Multi-Faceted Interactivity Alignment in Full-Duplex Speech Models Paper • 2606.11167 • Published 21 days ago • 5
Interactivity Alignment Collection Full-duplex speech models post-trained with reinforcement learning for improved conversational interactivity. • 4 items • Updated 20 days ago • 6
Self-Evolving Vision-Language Models for Image Quality Assessment via Voting and Ranking Paper • 2509.25787 • Published Jan 27 • 3