TheDenk commited on
Commit
6053f46
·
verified ·
1 Parent(s): 4b04a94

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +98 -98
README.md CHANGED
@@ -1,98 +1,98 @@
1
- ---
2
- license: apache-2.0
3
- language:
4
- - en
5
- tags:
6
- - video
7
- - video-generation
8
- - video-to-video
9
- - controlnet
10
- - diffusers
11
- ---
12
- # Dilated Controlnet for Wan2.1 (depth)
13
-
14
-
15
- <video controls autoplay src="https://cdn-uploads.huggingface.co/production/uploads/63fde49f6315a264aba6a7ed/bA5U06VI9Bs6DjmXJ7Z5S.mp4"></video>
16
-
17
- This repo contains the code for dilated controlnet module for Wan2.1 model.
18
- Dilated controlnet has less basic blocks and also has `stride` parameter. For Wan14B model controlnet blocks count = 6 and stride = 4.
19
- See <a href="https://github.com/TheDenk/wan2.1-dilated-controlnet">Github code</a>.
20
-
21
- ### How to
22
- Clone repo
23
- ```bash
24
- git clone https://github.com/TheDenk/wan2.1-dilated-controlnet.git
25
- cd wan2.1-dilated-controlnet
26
- ```
27
-
28
- Create venv
29
- ```bash
30
- python -m venv venv
31
- source venv/bin/activate
32
- ```
33
-
34
- Install requirements
35
- ```bash
36
- pip install -r requirements.txt
37
- ```
38
-
39
- ### Inference examples
40
- #### Inference with cli
41
- ```bash
42
- python -m inference.cli_demo \
43
- --video_path "resources/physical-4.mp4" \
44
- --prompt "A balloon filled with water was thrown to the ground, exploding and splashing water in all directions. There were graffiti on the wall, studio lighting, and commercial movie shooting." \
45
- --controlnet_type "depth" \
46
- --controlnet_stride 4 \
47
- --base_model_path Wan-AI/Wan2.1-T2V-14B-Diffusers \
48
- --controlnet_model_path TheDenk/wan2.1-t2v-14b-controlnet-depth-v1
49
- ```
50
-
51
- #### Inference with Gradio
52
- ```bash
53
- python -m inference.gradio_web_demo \
54
- --controlnet_type "depth" \
55
- --base_model_path Wan-AI/Wan2.1-T2V-14B-Diffusers \
56
- --controlnet_model_path TheDenk/wan2.1-t2v-14b-controlnet-depth-v1
57
- ```
58
- #### Detailed Inference
59
- ```bash
60
- python -m inference.cli_demo \
61
- --video_path "resources/physical-4.mp4" \
62
- --prompt "A balloon filled with water was thrown to the ground, exploding and splashing water in all directions. There were graffiti on the wall, studio lighting, and commercial movie shooting." \
63
- --controlnet_type "depth" \
64
- --base_model_path Wan-AI/Wan2.1-T2V-14B-Diffusers \
65
- --controlnet_model_path TheDenk/wan2.1-t2v-14b-controlnet-depth-v1 \
66
- --controlnet_weight 0.8 \
67
- --controlnet_guidance_start 0.0 \
68
- --controlnet_guidance_end 0.8 \
69
- --controlnet_stride 4 \
70
- --num_inference_steps 50 \
71
- --guidance_scale 5.0 \
72
- --video_height 480 \
73
- --video_width 832 \
74
- --num_frames 81 \
75
- --negative_prompt "Bright tones, overexposed, static, blurred details, subtitles, style, works, paintings, images, static, overall gray, worst quality, low quality, JPEG compression residue, ugly, incomplete, extra fingers, poorly drawn hands, poorly drawn faces, deformed, disfigured, misshapen limbs, fused fingers, still picture, messy background, three legs, many people in the background, walking backwards" \
76
- --seed 42 \
77
- --out_fps 16 \
78
- --output_path "result.mp4"
79
- ```
80
-
81
-
82
- ## Acknowledgements
83
- Original code and models [Wan2.1](https://github.com/Wan-Video/Wan2.1).
84
-
85
-
86
- ## Citations
87
- ```
88
- @misc{TheDenk,
89
- title={Dilated Controlnet},
90
- author={Karachev Denis},
91
- url={https://github.com/TheDenk/wan2.1-dilated-controlnet},
92
- publisher={Github},
93
- year={2025}
94
- }
95
- ```
96
-
97
- ## Contacts
98
- <p>Issues should be raised directly in the repository. For professional support and recommendations please <a>[email protected]</a>.</p>
 
1
+ ---
2
+ license: apache-2.0
3
+ language:
4
+ - en
5
+ tags:
6
+ - video
7
+ - video-generation
8
+ - video-to-video
9
+ - controlnet
10
+ - diffusers
11
+ ---
12
+ # Dilated Controlnet for Wan2.1 (depth)
13
+
14
+
15
+ <video controls autoplay src="https://cdn-uploads.huggingface.co/production/uploads/63fde49f6315a264aba6a7ed/tGBCvJC9Zk44gtJpCoRz4.mp4"></video>
16
+
17
+ This repo contains the code for dilated controlnet module for Wan2.1 model.
18
+ Dilated controlnet has less basic blocks and also has `stride` parameter. For Wan14B model controlnet blocks count = 6 and stride = 4.
19
+ See <a href="https://github.com/TheDenk/wan2.1-dilated-controlnet">Github code</a>.
20
+
21
+ ### How to
22
+ Clone repo
23
+ ```bash
24
+ git clone https://github.com/TheDenk/wan2.1-dilated-controlnet.git
25
+ cd wan2.1-dilated-controlnet
26
+ ```
27
+
28
+ Create venv
29
+ ```bash
30
+ python -m venv venv
31
+ source venv/bin/activate
32
+ ```
33
+
34
+ Install requirements
35
+ ```bash
36
+ pip install -r requirements.txt
37
+ ```
38
+
39
+ ### Inference examples
40
+ #### Inference with cli
41
+ ```bash
42
+ python -m inference.cli_demo \
43
+ --video_path "resources/physical-4.mp4" \
44
+ --prompt "A balloon filled with water was thrown to the ground, exploding and splashing water in all directions. There were graffiti on the wall, studio lighting, and commercial movie shooting." \
45
+ --controlnet_type "depth" \
46
+ --controlnet_stride 4 \
47
+ --base_model_path Wan-AI/Wan2.1-T2V-14B-Diffusers \
48
+ --controlnet_model_path TheDenk/wan2.1-t2v-14b-controlnet-depth-v1
49
+ ```
50
+
51
+ #### Inference with Gradio
52
+ ```bash
53
+ python -m inference.gradio_web_demo \
54
+ --controlnet_type "depth" \
55
+ --base_model_path Wan-AI/Wan2.1-T2V-14B-Diffusers \
56
+ --controlnet_model_path TheDenk/wan2.1-t2v-14b-controlnet-depth-v1
57
+ ```
58
+ #### Detailed Inference
59
+ ```bash
60
+ python -m inference.cli_demo \
61
+ --video_path "resources/physical-4.mp4" \
62
+ --prompt "A balloon filled with water was thrown to the ground, exploding and splashing water in all directions. There were graffiti on the wall, studio lighting, and commercial movie shooting." \
63
+ --controlnet_type "depth" \
64
+ --base_model_path Wan-AI/Wan2.1-T2V-14B-Diffusers \
65
+ --controlnet_model_path TheDenk/wan2.1-t2v-14b-controlnet-depth-v1 \
66
+ --controlnet_weight 0.8 \
67
+ --controlnet_guidance_start 0.0 \
68
+ --controlnet_guidance_end 0.8 \
69
+ --controlnet_stride 4 \
70
+ --num_inference_steps 50 \
71
+ --guidance_scale 5.0 \
72
+ --video_height 480 \
73
+ --video_width 832 \
74
+ --num_frames 81 \
75
+ --negative_prompt "Bright tones, overexposed, static, blurred details, subtitles, style, works, paintings, images, static, overall gray, worst quality, low quality, JPEG compression residue, ugly, incomplete, extra fingers, poorly drawn hands, poorly drawn faces, deformed, disfigured, misshapen limbs, fused fingers, still picture, messy background, three legs, many people in the background, walking backwards" \
76
+ --seed 42 \
77
+ --out_fps 16 \
78
+ --output_path "result.mp4"
79
+ ```
80
+
81
+
82
+ ## Acknowledgements
83
+ Original code and models [Wan2.1](https://github.com/Wan-Video/Wan2.1).
84
+
85
+
86
+ ## Citations
87
+ ```
88
+ @misc{TheDenk,
89
+ title={Dilated Controlnet},
90
+ author={Karachev Denis},
91
+ url={https://github.com/TheDenk/wan2.1-dilated-controlnet},
92
+ publisher={Github},
93
+ year={2025}
94
+ }
95
+ ```
96
+
97
+ ## Contacts
98
+ <p>Issues should be raised directly in the repository. For professional support and recommendations please <a>[email protected]</a>.</p>