Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
L-Hongbin 's Collections
MutiModal_Paper
LLM
Diffusion
MutiModal_Dataset
Optimizer_Papers
MoE_Papers

MoE_Papers

updated Dec 25, 2024
Upvote
1

  • A Closer Look into Mixture-of-Experts in Large Language Models

    Paper • 2406.18219 • Published Jun 26, 2024 • 16

  • VisionZip: Longer is Better but Not Necessary in Vision Language Models

    Paper • 2412.04467 • Published Dec 5, 2024 • 117

  • p-MoD: Building Mixture-of-Depths MLLMs via Progressive Ratio Decay

    Paper • 2412.04449 • Published Dec 5, 2024 • 7

  • ReMoE: Fully Differentiable Mixture-of-Experts with ReLU Routing

    Paper • 2412.14711 • Published Dec 19, 2024 • 16
Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs