Understanding and Mitigating Bottlenecks of State Space Models through the Lens of Recency and Over-smoothing Paper • 2501.00658 • Published 12 days ago • 7
Nested Attention: Semantic-aware Attention Values for Concept Personalization Paper • 2501.01407 • Published 10 days ago • 10
Byte Latent Transformer: Patches Scale Better Than Tokens Paper • 2412.09871 • Published about 1 month ago • 86
Active propulsion noise shaping for multi-rotor aircraft localization Paper • 2402.17289 • Published Feb 27, 2024