Packing Input Frame Context in Next-Frame Prediction Models for Video Generation Paper • 2504.12626 • Published Apr 17 • 52
FramePack Video Generation Collection fast & compact video generation with FramePack - a next-frame prediction neural network structure that generates videos progressively • 9 items • Updated 30 days ago • 6
AI Tools for Art - March '25 Collection Tools & models from the 3rd issue of AI Tools for Art 🎉 read more: https://open.substack.com/pub/multimodalaiart • 17 items • Updated 30 days ago • 2
SANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation Paper • 2503.09641 • Published Mar 12 • 40
Shot categorizer Collection Fine-tune of Florence-2 to generate shot categories, useful for data curation. Code: https://github.com/huggingface/movie-shot-categorizer. • 3 items • Updated Mar 6 • 2
view article Article You could have designed state of the art positional encoding By FL33TW00D-HF • Nov 25, 2024 • 304
video-effects datasets Collection Smol datasets to emulate cool video effects like "squish", "dissolve", etc. Inspired by Pika effects. • 4 items • Updated Jan 28 • 4
Stretching Each Dollar: Diffusion Training from Scratch on a Micro-Budget Paper • 2407.15811 • Published Jul 22, 2024 • 2
AI Tools for Art - Feb '25 Collection Tools & models from the 2nd issue of AI Tools for Art 🎉 Read more about February's releases: https://open.substack.com/pub/multimodalaiart • 18 items • Updated 30 days ago • 1
Remote VAE Inference Endpoints Collection Models and handler code used in https://huggingface.co/blog/remote_vae • 5 items • Updated Mar 10 • 6
AnyText2: Visual Text Generation and Editing With Customizable Attributes Paper • 2411.15245 • Published Nov 22, 2024 • 1