opensearch-project/opensearch-neural-sparse-encoding-doc-v2-distill Fill-Mask • Updated 17 days ago • 1.87M • 7
view article Article Welcome FalconMamba: The first strong attention-free 7B model Aug 12, 2024 • 109
hugging-quants/Meta-Llama-3.1-8B-Instruct-AWQ-INT4 Text Generation • Updated Aug 7, 2024 • 268k • 62
MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases Paper • 2402.14905 • Published Feb 22, 2024 • 128