nvidia/Llama-3_1-Nemotron-Ultra-253B-v1 Text Generation โข Updated about 1 month ago โข 10.6k โข โข 308
Jamba-1.5: Hybrid Transformer-Mamba Models at Scale Paper โข 2408.12570 โข Published Aug 22, 2024 โข 34
Jamba: A Hybrid Transformer-Mamba Language Model Paper โข 2403.19887 โข Published Mar 28, 2024 โข 111