Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
mengfanxu's picture
2 3 128

mengfanxu

fxmeng
21world's profile picture Jinnan's profile picture kevinapple's profile picture
·
https://fxmeng.github.io
  • fxmeng

AI & ML interests

None yet

Recent Activity

updated a dataset 27 days ago
fxmeng/transmla_pretrain_100m_tokens
updated a dataset 27 days ago
fxmeng/transmla_pretrain_1B_tokens
updated a dataset 27 days ago
fxmeng/transmla_pretrain_6B_tokens
View all activity

Organizations

None yet

commented 3 papers 6 months ago

TransMLA: Multi-head Latent Attention Is All You Need

Paper • 2502.07864 • Published Feb 11 • 57 •
9

TransMLA: Multi-head Latent Attention Is All You Need

Paper • 2502.07864 • Published Feb 11 • 57 •
9

TransMLA: Multi-head Latent Attention Is All You Need

Paper • 2502.07864 • Published Feb 11 • 57 •
9
New activity in MMMU/MMMU over 1 year ago

Question about "Text as Input"

#4 opened over 1 year ago by
fxmeng
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs