Collections
Discover the best community collections!
Collections trending this week
-
tiiuae/falcon-180B
Text Generation • Updated • 5.85k • 1.14k -
tiiuae/falcon-180B-chat
Text Generation • Updated • 63 • 545 -
DeepSpeed Ulysses: System Optimizations for Enabling Training of Extreme Long Sequence Transformer Models
Paper • 2309.14509 • Published • 18 -
Effective Long-Context Scaling of Foundation Models
Paper • 2309.16039 • Published • 30