view article Article Introducing smolagents: simple agents that write actions in code. By m-ric and 2 others • Dec 31, 2024 • 1.06k
view article Article ChatGPT-4o's Image Generation Capabilities and Its Wild Examples By prithivMLmods • Apr 5 • 20
view article Article Training Large Language Models with Interpreter Feedback using WebAssembly By axolotl-ai-co and 1 other • Apr 3 • 13
distil-large-v3.5 Collection This collection contains the model repositories for distil-large-v3.5, which provides support for the most popular Whisper libraries. • 5 items • Updated Mar 25 • 7
SigLIP Collection Contrastive (sigmoid) image-text models from https://arxiv.org/abs/2303.15343 • 10 items • Updated 9 days ago • 56
view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM By ariG23498 and 3 others • Mar 12 • 427
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper • 2502.02737 • Published Feb 4 • 232
view article Article How to deploy and fine-tune DeepSeek models on AWS By pagezyhf and 2 others • Jan 30 • 52
Judging LLM-as-a-judge with MT-Bench and Chatbot Arena Paper • 2306.05685 • Published Jun 9, 2023 • 35
Emu: Enhancing Image Generation Models Using Photogenic Needles in a Haystack Paper • 2309.15807 • Published Sep 27, 2023 • 32