a-r-r-o-w's picture
a-r-r-o-w HF Staff
Create README.md
b310f2e verified
|
raw
history blame
2.86 kB
metadata
base_model:
  - Wan-AI/Wan2.1-I2V-14B-480P
  - Wan-AI/Wan2.1-I2V-14B-480P-Diffusers
datasets:
  - finetrainers/3dgs-dissolve
library_name: diffusers
license: other
license_link: https://huggingface.co/Wan-AI/Wan2.1-I2V-14B-480P/blob/main/LICENSE.txt
widget:
  - text: >-
      3DGS_DISSOLVE A vibrant green Mustang GT parked in a parking lot. The car
      is positioned at an angle, showcasing its sleek design and black rims. The
      car's hood is black, contrasting with the green body. The car gradually
      transforms and bursts into red sparks, creating a dramatic and dynamic
      visual effect against a dark backdrop.
    output:
      url: validation-909-0-2-3DGS_DISSOLVE-A-cooking-t-1745185487.mp4
  - text: >-
      3DGS_DISSOLVE A cooking tutorial featuring a man in a kitchen. He is
      wearing a white t-shirt and a black apron. As the scene progresses, light
      starts to emanate from the man and he burst into a fiery flame of red
      sparks.
    output:
      url: validation-909-0-2-3DGS_DISSOLVE-A-man-in-a--1745185894.mp4
  - text: >-
      3DGS_DISSOLVE A man in a suit and tie, standing against a blue background
      with a digital pattern. He appears to be speaking or presenting, as
      suggested by his open mouth and focused expression. Suddenly, the man
      starts to dissolve into thin air with a bright fiery flame of red sparks.
    output:
      url: validation-909-0-2-3DGS_DISSOLVE-A-man-in-a--1745186300.mp4
  - text: >-
      3DGS_DISSOLVE A man in a workshop, dressed in a black shirt and a beige
      hat, with a beard and glasses. He is holding a hammer and a metal object,
      possibly a piece of iron or a tool. The scene erupts with a bright fiery
      flame of red sparks.
    output:
      url: validation-909-0-2-3DGS_DISSOLVE-A-vibrant-g-1745185082.mp4
tags:
  - text-to-video
  - image-to-video
  - diffusers-training
  - diffusers
  - template:sd-lora
  - wan
Prompt
3DGS_DISSOLVE A vibrant green Mustang GT parked in a parking lot. The car is positioned at an angle, showcasing its sleek design and black rims. The car's hood is black, contrasting with the green body. The car gradually transforms and bursts into red sparks, creating a dramatic and dynamic visual effect against a dark backdrop.
Prompt
3DGS_DISSOLVE A cooking tutorial featuring a man in a kitchen. He is wearing a white t-shirt and a black apron. As the scene progresses, light starts to emanate from the man and he burst into a fiery flame of red sparks.
Prompt
3DGS_DISSOLVE A man in a suit and tie, standing against a blue background with a digital pattern. He appears to be speaking or presenting, as suggested by his open mouth and focused expression. Suddenly, the man starts to dissolve into thin air with a bright fiery flame of red sparks.
Prompt
3DGS_DISSOLVE A man in a workshop, dressed in a black shirt and a beige hat, with a beard and glasses. He is holding a hammer and a metal object, possibly a piece of iron or a tool. The scene erupts with a bright fiery flame of red sparks.

This is a LoRA fine-tune of the Wan-AI/Wan2.1-I2V-14B-480P-Diffusers model on the finetrainers/3dgs-dissolve dataset.

Code: https://github.com/a-r-r-o-w/finetrainers

This is an experimental checkpoint and its poor generalization is well-known.

Inference code:

import torch
from diffusers import WanImageToVideoPipeline
from diffusers.utils import export_to_video, load_image

pipe = WanImageToVideoPipeline.from_pretrained(
    "Wan-AI/Wan2.1-I2V-14B-480P-Diffusers", torch_dtype=torch.bfloat16
).to("cuda")
pipe.load_lora_weights("finetrainers/Wan2.1-I2V-14B-480P-3dgs-v0", adapter_name="wan-lora")
pipe.set_adapters(["wan-lora"], [0.9])

image = load_image("<URL_OR_PATH>")
video = pipe("<my-awesome-prompt>", image=<image>).frames[0]
export_to_video(video, "output.mp4", fps=24)

Training logs are available on WandB here.