FramePack Video Gen Collection fast & compact video generation with FramePack - a next-frame prediction neural network structure that generates videos progressively • 7 items • Updated 6 days ago • 3
MambaVision Collection MambaVision: A Hybrid Mamba-Transformer Vision Backbone. Includes both 1K and 21K pretrained models. • 13 items • Updated 3 days ago • 31
PhotoDoodle: Learning Artistic Image Editing from Few-Shot Pairwise Data Paper • 2502.14397 • Published Feb 20 • 42
VisCoT Collection Visual CoT: Unleashing Chain-of-Thought Reasoning in the Multi-Modal Language Model • 5 items • Updated Jun 13, 2024 • 4
ViTPose Collection Collection for ViTPose models based on transformers implementation. • 10 items • Updated Jan 12 • 13
SmolLM2 Collection State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 16 items • Updated 8 days ago • 258