EXAONE-3.5 Collection EXAONE 3.5 language model series including instruction-tuned models of 2.4B, 7.8B, and 32B. β’ 10 items β’ Updated 16 days ago β’ 81
GPT-Generated Unified Format (GGUF) Collection ease of reading β’ 34 items β’ Updated 2 days ago β’ 10
OpenCoder Collection OpenCoder is an open and reproducible code LLM family which matches the performance of top-tier code LLMs. β’ 8 items β’ Updated Nov 23 β’ 78
Llama3-8B-1.58 Collection A trio of powerful models: fine-tuned from Llama3-8b-Instruct, with BitNet architecture! β’ 3 items β’ Updated Sep 14 β’ 12
SmolLM2 Collection State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M β’ 15 items β’ Updated 3 days ago β’ 195
MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases Paper β’ 2402.14905 β’ Published Feb 22 β’ 126
MobileLLM Collection Optimizing Sub-billion Parameter Language Models for On-Device Use Cases (ICML 2024) https://arxiv.org/abs/2402.14905 β’ 9 items β’ Updated 29 days ago β’ 99
LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level Mathematical Reasoning Paper β’ 2410.02884 β’ Published Oct 3 β’ 52
D_AU - Source files for GGUF, EXL2, AWQ, GPTQ, HQQ etc etc Collection Safetensor source files (by David_AU) to use directly and/or create different quants and/or merges. Link to GGUFS/full model card on each. β’ 75 items β’ Updated 1 day ago β’ 6
GGUF Image Model Quants Collection List of GGUF quants for text to image base models. β’ 12 items β’ Updated 5 days ago β’ 17
Ferret-UI: Grounded Mobile UI Understanding with Multimodal LLMs Paper β’ 2404.05719 β’ Published Apr 8 β’ 82
view article Article MedEmbed: Fine-Tuned Embedding Models for Medical / ClinicalΒ IR By abhinand β’ Oct 20 β’ 33
view article Article Advanced Flux Dreambooth LoRA Training with 𧨠diffusers By linoyts ⒠Oct 21 ⒠32
Granite 3.0 Language Models Collection A series of language models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. β’ 8 items β’ Updated 8 days ago β’ 96