bartowski/Teuken-7B-instruct-research-v0.4-GGUF Text Generation • 7B • Updated Nov 26, 2024 • 1.22k • 4
SebastianBodza/Kartoffel_Orpheus-3B_german_natural-v0.1 Text-to-Speech • 3B • Updated May 17 • 129 • 10
view post Post 2922 this paper has been blowing upthey train an open-source multimodal LLM (InternVL3) that can compete with GPT-4o and Claude 3.5 Sonnet by:> training text and vision on a single stage> a novel V2PE positional encoding> SFT & mixed preference optimizationPaper: InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models (2504.10479)> test-time scaling See translation ❤️ 6 6 👍 2 2 🔥 2 2 👀 1 1 + Reply
diarizers-community/speaker-segmentation-fine-tuned-callhome-deu 0.0B • Updated Apr 25, 2024 • 54 • 6