Unsloth Dynamic 2.0 Quants Collection New 2.0 version of our Dynamic GGUF + Quants. Dynamic 2.0 achieves superior accuracy & SOTA quantization performance. • 43 items • Updated 9 days ago • 177
Voila Collection Voila: Voice-Language Foundation Models. https://voila.maitrix.org • 7 items • Updated May 6 • 23
view article Article Tencent InstantCharacter 1-Click Installers for Windows, RunPod and Massed Compute, Supports RTX 5000 series as well By MonsterMMORPG • Apr 19 • 3
Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 • 11 items • Updated 19 days ago • 522
Sa2VA Model Zoo Collection Huggingace Model Zoo For Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos By Bytedance Seed CV Research • 4 items • Updated Feb 9 • 37
Qwen2.5-VL (All Versions) Collection All versions of Qwen2.5-VL including the new 32B version and 4-bit, 16-bit and more! • 16 items • Updated 10 days ago • 19
Stranger Zone Collections [ Org ] Collection Artificial general intelligence • 57 items • Updated Jan 5 • 7
Wavelets Are All You Need for Autoregressive Image Generation Paper • 2406.19997 • Published Jun 28, 2024 • 32
PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation Paper • 2403.04692 • Published Mar 7, 2024 • 42
A Picture is Worth a Thousand Words: Principled Recaptioning Improves Image Generation Paper • 2310.16656 • Published Oct 25, 2023 • 47