Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
robertchen245
's Collections
Architecture improvement
Architecture improvement
updated
Apr 14
Upvote
-
Multi-Token Attention
Paper
•
2504.00927
•
Published
Apr 1
•
52
Upvote
-
Share collection
View history
Collection guide
Browse collections