-
Mamba: Linear-Time Sequence Modeling with Selective State Spaces
Paper • 2312.00752 • Published • 145 -
Elucidating the Design Space of Diffusion-Based Generative Models
Paper • 2206.00364 • Published • 17 -
GLU Variants Improve Transformer
Paper • 2002.05202 • Published • 3 -
StarCoder 2 and The Stack v2: The Next Generation
Paper • 2402.19173 • Published • 147
Jeffrey Magder
jmagder
AI & ML interests
None yet
Recent Activity
updated
a collection
24 days ago
Finished Reading
upvoted
a
paper
24 days ago
Accuracy is Not All You Need
upvoted
a
paper
about 1 month ago
MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning
Attention
Organizations
None yet