GNER Collection We introduce GNER, a Generative Named Entity Recognition framework, which demonstrates enhanced zero-shot capabilities across unseen entity domains. • 7 items • Updated Apr 30 • 9
ComfyUI-R1: Exploring Reasoning Models for Workflow Generation Paper • 2506.09790 • Published about 1 month ago • 52
Seedance 1.0: Exploring the Boundaries of Video Generation Models Paper • 2506.09113 • Published Jun 10 • 96
Wan2.1 14B 480p I2V LoRAs Collection A collection of Remade's Wan2.1 14B 480p I2V LoRAs • 49 items • Updated May 24 • 185
Rendering-Aware Reinforcement Learning for Vector Graphics Generation Paper • 2505.20793 • Published May 27 • 11
Voila Collection Voila: Voice-Language Foundation Models. https://voila.maitrix.org • 7 items • Updated May 6 • 23
Wan2.1 14B T2V LoRAs Collection A collection of Remade's Wan2.1 14B T2V LoRAs • 20 items • Updated Mar 27 • 29
Audio-visual Controlled Video Diffusion with Masked Selective State Spaces Modeling for Natural Talking Head Generation Paper • 2504.02542 • Published Apr 3 • 49
Gemma 3 QAT Collection Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory • 15 items • Updated 2 days ago • 203
xLAM-2 Collection A family of Large Action Model for multi-turn conversation and tool-use • 10 items • Updated May 5 • 19
💫StarVector Models Collection StarVector is a multimodal LLM for Scalable Vector Graphics (SVG) generation, producing structured SVG code directly from images and text. • 2 items • Updated Mar 20 • 96
Babel Collection Open Multilingual Large Language Models Serving Over 90% of Global Speakers • 5 items • Updated Apr 15 • 18
view article Article SigLIP 2: A better multilingual vision language encoder By ariG23498 and 2 others • Feb 21 • 174
PixArt-Alpha Collection This collection organize all the PixArt-Alpha related models, datasets and so on. • 9 items • Updated May 4, 2024 • 5