Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
2
3
128
mengfanxu
fxmeng
Follow
21world's profile picture
Jinnan's profile picture
kevinapple's profile picture
17 followers
·
31 following
https://fxmeng.github.io
fxmeng
AI & ML interests
None yet
Recent Activity
updated
a dataset
27 days ago
fxmeng/transmla_pretrain_100m_tokens
updated
a dataset
27 days ago
fxmeng/transmla_pretrain_1B_tokens
updated
a dataset
27 days ago
fxmeng/transmla_pretrain_6B_tokens
View all activity
Organizations
None yet
fxmeng
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
commented
3 papers
6 months ago
TransMLA: Multi-head Latent Attention Is All You Need
Paper
•
2502.07864
•
Published
Feb 11
•
57
•
9
TransMLA: Multi-head Latent Attention Is All You Need
Paper
•
2502.07864
•
Published
Feb 11
•
57
•
9
TransMLA: Multi-head Latent Attention Is All You Need
Paper
•
2502.07864
•
Published
Feb 11
•
57
•
9
New activity in
MMMU/MMMU
over 1 year ago
Question about "Text as Input"
#4 opened over 1 year ago by
fxmeng
Load more