view article Article Unlocking Longer Generation with Key-Value Cache Quantization By RaushanTurganbay • May 16, 2024 • 49
view article Article Don't repeat yourself - 🤗 Transformers Design Philosophy By patrickvonplaten • Apr 5, 2022 • 32
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer Paper • 1910.10683 • Published Oct 23, 2019 • 13