DiffRhythm: Blazingly Fast and Embarrassingly Simple End-to-End Full-Length Song Generation with Latent Diffusion Paper • 2503.01183 • Published 7 days ago • 26
view article Article Wan 2.1 by Wan AI :best cost efficient video generation model Now Available By LLMhacker • 13 days ago • 27
view article Article Mastering Long Contexts in LLMs with KVPress By nvidia and 1 other • Jan 23 • 64
FuseChat 3.0 Collection Preference Optimization for Implicit Model Fusion • 14 items • Updated 3 days ago • 13
view article Article FuseChat-3.0: Preference Optimization for Implicit Model Fusion By Wanfq and 2 others • Dec 18, 2024 • 5
SmolVLM 256M & 500M Collection Collection for models & demos for even smoller SmolVLM release • 12 items • Updated 18 days ago • 70
view article Article FuseO1-Preview: System-II Reasoning Fusion of LLMs By Wanfq and 4 others • Jan 20 • 20
SmolLM2 Collection State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 16 items • Updated 18 days ago • 245
The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale Paper • 2406.17557 • Published Jun 25, 2024 • 93