attention zoo - a wwymak Collection

wwymak 's Collections

multilingual modelling

small-but-mighty-llms

llm-long-context

llm-explainability

image-generation-models

attention zoo

updated Apr 24

TransformerFAM: Feedback attention is working memory

Paper • 2404.09173 • Published Apr 14 • 43
Ring Attention with Blockwise Transformers for Near-Infinite Context

Paper • 2310.01889 • Published Oct 3, 2023 • 10
Megalodon: Efficient LLM Pretraining and Inference with Unlimited Context Length

Paper • 2404.08801 • Published Apr 12 • 63