Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
MonsterMMORPG 
posted an update 3 days ago
Post
2683
WAN 2.1 FusionX + Self Forcing LoRA are the New Best of Local Video Generation with Only 8 Steps + FLUX Upscaling Guide : https://www.youtube.com/watch?v=Xbn93GRQKsQ

Tutorial : https://www.youtube.com/watch?v=Xbn93GRQKsQ

Video Chapters

0:00 Introduction to the New FusionX Video Model & FLUX Upscaling
0:30 One-Click Presets & The SwarmUI Model Downloader Explained
1:07 Achieving Hyper-Realism with the FLUX 2x Latent Upscale Preset
1:58 How to Download & Install the SwarmUI Model Downloader
2:49 Downloading Full Models vs. Downloading Just The LoRAs
3:48 Final Setup: Updating SwarmUI & Importing The New Presets
4:32 Generating a Video: Applying the FusionX Image-to-Video Preset
5:03 Critical Step: Correcting The Model's Native Resolution Metadata
5:55 Finalizing Image-to-Video Settings (Frame Count & RIFE Interpolation)
6:49 Troubleshooting Performance: Identifying Low GPU Usage & Shared VRAM Bug
8:35 The Solution: Disabling Sage Attention for Image-to-Video Models
10:02 Final Result: Showcasing The Amazing HD Quality Animation
10:40 How to Use the FusionX Text-to-Video Model with Presets
11:49 Text-to-Video Result & Quality Comparison
12:08 How to Use the FusionX LoRA with the Base Wan 2.1 Model
13:07 FLUX Tutorial: Downloading The Required Upscaler & Face Models
13:48 Generating a High-Quality Image with The Official FLUX Preset
14:50 Using Automatic Face Segmentation & Inpainting with FLUX
16:05 The Ultimate Upgrade: Applying The FLUX 2x Latent Upscaler Preset
16:32 Final Result: Comparing Standard vs. 2x Upscaled Image Quality
16:50 Outro & Sneak Peek of The New Ultimate Video Processing App

How are you doing the audio sync in the dragon and Ratatouille videos? Amazing work, thanks for sharing!

·

for Audio gen currently best local option is MMAudio : https://youtu.be/504f8S4MLTw

Wow wow wow my man said hold my gpus and killed that shit congrats this is awesome work!!!

·

It's been an amazing couple weeks for me with this stuff. Went from 15-20 minute generations at like 480x480, to suddenly doing 1280x720 in like 6-7 minutes thanks to all this lora business and NAG and RIFE.
I've currently been sniffing around flux, hoping I can run a larger quant of it as well with the same black magic that i've been using with Wan.
ENqydBxWkAAcU7g.jpg

·

yep we getting better