view article Article Decoding Strategies in Large Language Models By mlabonne โข Oct 29, 2024 โข 39
Solar Pro Collection The most intelligent LLM on a single GPU โข 4 items โข Updated Nov 15, 2024 โข 14
C4AI Aya 23 Collection Aya 23 is an open weights research release of an instruction fine-tuned model with highly advanced multilingual capabilities. โข 4 items โข Updated Dec 3, 2024 โข 51
Yi 1.5 GGUFs Collection Collection of Yi 1.5 GGUFs made with gguf-my-repo โข 15 items โข Updated May 20, 2024 โข 5
Phi-3 Collection Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. โข 26 items โข Updated 20 days ago โข 549
MegaScale: Scaling Large Language Model Training to More Than 10,000 GPUs Paper โข 2402.15627 โข Published Feb 23, 2024 โข 35
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits Paper โข 2402.17764 โข Published Feb 27, 2024 โข 608
Frankenmodels Collection They're not supposed to be that size! Neat, right? โข 8 items โข Updated Dec 12, 2023 โข 3