Create README.md
Browse files
README.md
ADDED
@@ -0,0 +1,64 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
base_model:
|
3 |
+
- Wan-AI/Wan2.1-I2V-14B-480P
|
4 |
+
- Wan-AI/Wan2.1-I2V-14B-480P-Diffusers
|
5 |
+
datasets:
|
6 |
+
- finetrainers/3dgs-dissolve
|
7 |
+
library_name: diffusers
|
8 |
+
license: other
|
9 |
+
license_link: https://huggingface.co/Wan-AI/Wan2.1-I2V-14B-480P/blob/main/LICENSE.txt
|
10 |
+
widget:
|
11 |
+
- text: >-
|
12 |
+
3DGS_DISSOLVE A vibrant green Mustang GT parked in a parking lot. The car is positioned at an angle, showcasing its sleek design and black rims. The car's hood is black, contrasting with the green body. The car gradually transforms and bursts into red sparks, creating a dramatic and dynamic visual effect against a dark backdrop.
|
13 |
+
output:
|
14 |
+
url: validation-909-0-2-3DGS_DISSOLVE-A-cooking-t-1745185487.mp4
|
15 |
+
- text: >-
|
16 |
+
3DGS_DISSOLVE A cooking tutorial featuring a man in a kitchen. He is wearing a white t-shirt and a black apron. As the scene progresses, light starts to emanate from the man and he burst into a fiery flame of red sparks.
|
17 |
+
output:
|
18 |
+
url: validation-909-0-2-3DGS_DISSOLVE-A-man-in-a--1745185894.mp4
|
19 |
+
- text: >-
|
20 |
+
3DGS_DISSOLVE A man in a suit and tie, standing against a blue background with a digital pattern. He appears to be speaking or presenting, as suggested by his open mouth and focused expression. Suddenly, the man starts to dissolve into thin air with a bright fiery flame of red sparks.
|
21 |
+
output:
|
22 |
+
url: validation-909-0-2-3DGS_DISSOLVE-A-man-in-a--1745186300.mp4
|
23 |
+
- text: >-
|
24 |
+
3DGS_DISSOLVE A man in a workshop, dressed in a black shirt and a beige hat, with a beard and glasses. He is holding a hammer and a metal object, possibly a piece of iron or a tool. The scene erupts with a bright fiery flame of red sparks.
|
25 |
+
output:
|
26 |
+
url: validation-909-0-2-3DGS_DISSOLVE-A-vibrant-g-1745185082.mp4
|
27 |
+
tags:
|
28 |
+
- text-to-video
|
29 |
+
- image-to-video
|
30 |
+
- diffusers-training
|
31 |
+
- diffusers
|
32 |
+
- template:sd-lora
|
33 |
+
- wan
|
34 |
+
---
|
35 |
+
|
36 |
+
<Gallery />
|
37 |
+
|
38 |
+
This is a LoRA fine-tune of the [Wan-AI/Wan2.1-I2V-14B-480P-Diffusers](https://huggingface.co/Wan-AI/Wan2.1-I2V-14B-480P-Diffusers) model on the
|
39 |
+
[finetrainers/3dgs-dissolve](https://huggingface.co/datasets/finetrainers/3dgs-dissolve) dataset.
|
40 |
+
|
41 |
+
Code: https://github.com/a-r-r-o-w/finetrainers
|
42 |
+
|
43 |
+
> [!IMPORTANT]
|
44 |
+
> This is an experimental checkpoint and its poor generalization is well-known.
|
45 |
+
|
46 |
+
Inference code:
|
47 |
+
|
48 |
+
```python
|
49 |
+
import torch
|
50 |
+
from diffusers import WanImageToVideoPipeline
|
51 |
+
from diffusers.utils import export_to_video, load_image
|
52 |
+
|
53 |
+
pipe = WanImageToVideoPipeline.from_pretrained(
|
54 |
+
"Wan-AI/Wan2.1-I2V-14B-480P-Diffusers", torch_dtype=torch.bfloat16
|
55 |
+
).to("cuda")
|
56 |
+
pipe.load_lora_weights("finetrainers/Wan2.1-I2V-14B-480P-3dgs-v0", adapter_name="wan-lora")
|
57 |
+
pipe.set_adapters(["wan-lora"], [0.9])
|
58 |
+
|
59 |
+
image = load_image("<URL_OR_PATH>")
|
60 |
+
video = pipe("<my-awesome-prompt>", image=<image>).frames[0]
|
61 |
+
export_to_video(video, "output.mp4", fps=24)
|
62 |
+
```
|
63 |
+
|
64 |
+
Training logs are available on WandB [here](https://wandb.ai/aryanvs/finetrainers-wan-i2v).
|