DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published Jan 22 • 404
view post Post 2924 Already almost 1,000 llama3 model variations have been shared publicly on HF (many more in private use at companies): https://huggingface.co/models?p=5&sort=trending&search=llama3. Everyone should fine-tune their own models for their use-cases, languages, industry, infra constraints,... 10,000 llama3 variants by the end of next week? 4 replies · 👍 15 15 ❤️ 10 10 🤯 3 3 + Reply
view article Article Welcome Llama 3 - Meta's new open LLM By philschmid and 4 others • Apr 18, 2024 • 289