Wan 2.1 Ultra Advanced Gradio APP for - Works as low as 4GB VRAM - 1-Click Installers for Windows, RunPod, Massed Compute - Batch Processing - T2V - I2V - V2V

#3
by MonsterMMORPG - opened

First of all hats of never seen such a good public model. This model is FLUX impact of video generation models.

Installer and APP : https://www.patreon.com/posts/123105403

Installs into Python 3.10 VENV via pip

I have been working 14 hours today to make this APP before sleeping for you guys :)

We have all the features of Wan 2.1 model

Text to Video 1.3B (as low as 3.5 GB VRAM) - Really fast - 480x832px or 832x480px

Video to Video 1.3B (as low as 3.5 GB VRAM) - Really fast - 480x832px or 832x480px

Text to Video 14B (as low as 17 GB VRAM) - still may work at below VRAM but slower - 720x1280px or 1280x720px

Image to Video 14B (as low as 17 GB VRAM) - still may work at below VRAM but slower - 720x1280px or 1280x720px

When you analyze the below images

First video is animated from the input image with following prompt

A hooded wraith stands motionless in a torrential downpour, lightning cracking across the stormy sky behind it. Its face is an impenetrable void of darkness beneath the tattered hood. Rain cascades down its ragged, flowing cloak, which appears to disintegrate into wisps of shadow at the edges. The mysterious figure holds an enormous sword of pure energy, crackling with electric blue lightning that pulses and flows through the blade like liquid electricity. The weapon drags slightly on the wet ground, sending ripples of power across the puddles forming at the figure's feet. Three glowing blue gems embedded in its chest pulse in rhythm with the storm's lightning strikes, each flash illuminating the decaying, ancient fabric of its attire. The rain intensifies around the figure, droplets seemingly slowing as they near the dark entity, while forks of lightning repeatedly illuminate its imposing silhouette. The atmosphere grows heavier with each passing moment as the wraith slowly raises its crackling blade, the blue energy intensifying and casting eerie shadows across the ruined landscape.

The cat video is purely 1.3B model generation via below prompt

A cute cat walking gracefully on a lush green grass field, its tail swaying gently as it moves. Close-up, moving camera following the cat's steps.

3.png

4.png

write.png

APP has been update to V3 : https://www.patreon.com/posts/wan-2-1-ultra-as-123105403

26 February 2025 Update

APP updated to v3

You can extract zip file into same folder and use Windows_Update.bat file to update

We have added any custom resolution and preset aspect ratios

Check the newest interface screenshot above

Also we have added check box for Tiled VAE Decode (Disable for 1.3B model for 12GB or more GPUs)

When you disable tiled it will use more VRAM but final step of video decoding will be almost instant

Auto cropping added so you shouldn't get error regarding mismatched input image or video

3.png

Sign up or log in to comment