Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
bgibson 's Collections
llm-analysis
llm-local
papers
llm-datasets
llm-models

papers

updated Feb 1, 2024
Upvote
-

  • Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

    Paper • 2401.09417 • Published Jan 17, 2024 • 63

  • MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts

    Paper • 2401.04081 • Published Jan 8, 2024 • 73

  • SwitchHead: Accelerating Transformers with Mixture-of-Experts Attention

    Paper • 2312.07987 • Published Dec 13, 2023 • 41

  • DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

    Paper • 2401.06066 • Published Jan 11, 2024 • 55
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs