view post Post 2927 Already almost 1,000 llama3 model variations have been shared publicly on HF (many more in private use at companies): https://huggingface.co/models?p=5&sort=trending&search=llama3. Everyone should fine-tune their own models for their use-cases, languages, industry, infra constraints,... 10,000 llama3 variants by the end of next week? 4 replies · 👍 15 15 ❤️ 10 10 🤯 3 3 + Reply
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits Paper • 2402.17764 • Published Feb 27, 2024 • 624
Qwen1.5 Collection Qwen1.5 is the improved version of Qwen, the large language model series developed by Alibaba Cloud. • 55 items • Updated 15 days ago • 209