Decoder-Hybrid-Decoder Architecture for Efficient Reasoning with Long Generation Paper โข 2507.06607 โข Published Jul 9 โข 10 โข 1
Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling Paper โข 2406.07522 โข Published Jun 11, 2024 โข 41 โข 5