Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
GEONTT 's Collections
base
3D
LLM
audio
video
image
RAG

base

updated Jun 4, 2024
Upvote
-

  • ChunkAttention: Efficient Self-Attention with Prefix-Aware KV Cache and Two-Phase Partition

    Paper • 2402.15220 • Published Feb 23, 2024 • 23

  • Jamba: A Hybrid Transformer-Mamba Language Model

    Paper • 2403.19887 • Published Mar 28, 2024 • 111

  • MambaMixer: Efficient Selective State Space Models with Dual Token and Channel Selection

    Paper • 2403.19888 • Published Mar 29, 2024 • 12

  • Mixture-of-Depths: Dynamically allocating compute in transformer-based language models

    Paper • 2404.02258 • Published Apr 2, 2024 • 106

  • ReFT: Representation Finetuning for Language Models

    Paper • 2404.03592 • Published Apr 4, 2024 • 99

  • Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse Controls to Any Diffusion Model

    Paper • 2404.09967 • Published Apr 15, 2024 • 22

  • Multi-Head Mixture-of-Experts

    Paper • 2404.15045 • Published Apr 23, 2024 • 61

  • Transformers are SSMs: Generalized Models and Efficient Algorithms Through Structured State Space Duality

    Paper • 2405.21060 • Published May 31, 2024 • 68
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs