TheDenk commited on
Commit
1735ab9
·
verified ·
1 Parent(s): 39b12b7

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +100 -99
README.md CHANGED
@@ -1,99 +1,100 @@
1
- ---
2
- license: apache-2.0
3
- language:
4
- - en
5
- tags:
6
- - video
7
- - video-generation
8
- - video-to-video
9
- - controlnet
10
- - diffusers
11
- pipeline_tag: video-to-video
12
- ---
13
- # Dilated Controlnet for Wan2.1 (canny)
14
-
15
-
16
- This repo contains the code for dilated controlnet module for Wan2.1 model.
17
- Dilated controlnet has less basic blocks and also has `stride` parameter. For Wan1.3B model controlnet blocks count = 8 and stride = 3.
18
- See <a href="https://github.com/TheDenk/wan2.1-dilated-controlnet">Github code</a>.
19
- General scheme
20
- ![image/png](https://cdn-uploads.huggingface.co/production/uploads/63fde49f6315a264aba6a7ed/XPa3l2dm-BhuqyAH_Yk63.png)
21
-
22
- ### How to
23
- Clone repo
24
- ```bash
25
- git clone https://github.com/TheDenk/wan2.1-dilated-controlnet.git
26
- cd wan2.1-dilated-controlnet
27
- ```
28
-
29
- Create venv
30
- ```bash
31
- python -m venv venv
32
- source venv/bin/activate
33
- ```
34
-
35
- Install requirements
36
- ```bash
37
- pip install -r requirements.txt
38
- ```
39
-
40
- ### Inference examples
41
- #### Inference with cli
42
- ```bash
43
- python -m inference.cli_demo \
44
- --video_path "resources/physical-4.mp4" \
45
- --prompt "A balloon filled with water was thrown to the ground, exploding and splashing water in all directions. There were graffiti on the wall, studio lighting, and commercial movie shooting." \
46
- --controlnet_type "canny" \
47
- --controlnet_stride 3 \
48
- --base_model_path Wan-AI/Wan2.1-T2V-1.3B-Diffusers \
49
- --controlnet_model_path TheDenk/wan2.1-t2v-1.3b-controlnet-canny-v1
50
- ```
51
-
52
- #### Inference with Gradio
53
- ```bash
54
- python -m inference.gradio_web_demo \
55
- --controlnet_type "canny" \
56
- --base_model_path Wan-AI/Wan2.1-T2V-1.3B-Diffusers \
57
- --controlnet_model_path TheDenk/wan2.1-t2v-1.3b-controlnet-canny-v1
58
- ```
59
- #### Detailed Inference
60
- ```bash
61
- python -m inference.cli_demo \
62
- --video_path "resources/physical-4.mp4" \
63
- --prompt "A balloon filled with water was thrown to the ground, exploding and splashing water in all directions. There were graffiti on the wall, studio lighting, and commercial movie shooting." \
64
- --controlnet_type "canny" \
65
- --base_model_path Wan-AI/Wan2.1-T2V-1.3B-Diffusers \
66
- --controlnet_model_path TheDenk/wan2.1-t2v-1.3b-controlnet-canny-v1 \
67
- --controlnet_weight 0.8 \
68
- --controlnet_guidance_start 0.0 \
69
- --controlnet_guidance_end 0.8 \
70
- --controlnet_stride 3 \
71
- --num_inference_steps 50 \
72
- --guidance_scale 5.0 \
73
- --video_height 480 \
74
- --video_width 832 \
75
- --num_frames 81 \
76
- --negative_prompt "Bright tones, overexposed, static, blurred details, subtitles, style, works, paintings, images, static, overall gray, worst quality, low quality, JPEG compression residue, ugly, incomplete, extra fingers, poorly drawn hands, poorly drawn faces, deformed, disfigured, misshapen limbs, fused fingers, still picture, messy background, three legs, many people in the background, walking backwards" \
77
- --seed 42 \
78
- --out_fps 16 \
79
- --output_path "result.mp4"
80
- ```
81
-
82
-
83
- ## Acknowledgements
84
- Original code and models [Wan2.1](https://github.com/Wan-Video/Wan2.1).
85
-
86
-
87
- ## Citations
88
- ```
89
- @misc{TheDenk,
90
- title={Dilated Controlnet},
91
- author={Karachev Denis},
92
- url={https://github.com/TheDenk/wan2.1-dilated-controlnet},
93
- publisher={Github},
94
- year={2025}
95
- }
96
- ```
97
-
98
- ## Contacts
99
- <p>Issues should be raised directly in the repository. For professional support and recommendations please <a>[email protected]</a>.</p>
 
 
1
+ ---
2
+ license: apache-2.0
3
+ language:
4
+ - en
5
+ tags:
6
+ - video
7
+ - video-generation
8
+ - video-to-video
9
+ - controlnet
10
+ - diffusers
11
+ pipeline_tag: video-to-video
12
+ ---
13
+ # Dilated Controlnet for Wan2.1 (canny)
14
+
15
+ <video controls autoplay src="https://cdn-uploads.huggingface.co/production/uploads/63fde49f6315a264aba6a7ed/XHKT6OS-YMMlQR1Jo3ezy.mp4"></video>
16
+
17
+ This repo contains the code for dilated controlnet module for Wan2.1 model.
18
+ Dilated controlnet has less basic blocks and also has `stride` parameter. For Wan1.3B model controlnet blocks count = 8 and stride = 3.
19
+ See <a href="https://github.com/TheDenk/wan2.1-dilated-controlnet">Github code</a>.
20
+ General scheme
21
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/63fde49f6315a264aba6a7ed/XPa3l2dm-BhuqyAH_Yk63.png)
22
+
23
+ ### How to
24
+ Clone repo
25
+ ```bash
26
+ git clone https://github.com/TheDenk/wan2.1-dilated-controlnet.git
27
+ cd wan2.1-dilated-controlnet
28
+ ```
29
+
30
+ Create venv
31
+ ```bash
32
+ python -m venv venv
33
+ source venv/bin/activate
34
+ ```
35
+
36
+ Install requirements
37
+ ```bash
38
+ pip install -r requirements.txt
39
+ ```
40
+
41
+ ### Inference examples
42
+ #### Inference with cli
43
+ ```bash
44
+ python -m inference.cli_demo \
45
+ --video_path "resources/physical-4.mp4" \
46
+ --prompt "A balloon filled with water was thrown to the ground, exploding and splashing water in all directions. There were graffiti on the wall, studio lighting, and commercial movie shooting." \
47
+ --controlnet_type "canny" \
48
+ --controlnet_stride 3 \
49
+ --base_model_path Wan-AI/Wan2.1-T2V-1.3B-Diffusers \
50
+ --controlnet_model_path TheDenk/wan2.1-t2v-1.3b-controlnet-canny-v1
51
+ ```
52
+
53
+ #### Inference with Gradio
54
+ ```bash
55
+ python -m inference.gradio_web_demo \
56
+ --controlnet_type "canny" \
57
+ --base_model_path Wan-AI/Wan2.1-T2V-1.3B-Diffusers \
58
+ --controlnet_model_path TheDenk/wan2.1-t2v-1.3b-controlnet-canny-v1
59
+ ```
60
+ #### Detailed Inference
61
+ ```bash
62
+ python -m inference.cli_demo \
63
+ --video_path "resources/physical-4.mp4" \
64
+ --prompt "A balloon filled with water was thrown to the ground, exploding and splashing water in all directions. There were graffiti on the wall, studio lighting, and commercial movie shooting." \
65
+ --controlnet_type "canny" \
66
+ --base_model_path Wan-AI/Wan2.1-T2V-1.3B-Diffusers \
67
+ --controlnet_model_path TheDenk/wan2.1-t2v-1.3b-controlnet-canny-v1 \
68
+ --controlnet_weight 0.8 \
69
+ --controlnet_guidance_start 0.0 \
70
+ --controlnet_guidance_end 0.8 \
71
+ --controlnet_stride 3 \
72
+ --num_inference_steps 50 \
73
+ --guidance_scale 5.0 \
74
+ --video_height 480 \
75
+ --video_width 832 \
76
+ --num_frames 81 \
77
+ --negative_prompt "Bright tones, overexposed, static, blurred details, subtitles, style, works, paintings, images, static, overall gray, worst quality, low quality, JPEG compression residue, ugly, incomplete, extra fingers, poorly drawn hands, poorly drawn faces, deformed, disfigured, misshapen limbs, fused fingers, still picture, messy background, three legs, many people in the background, walking backwards" \
78
+ --seed 42 \
79
+ --out_fps 16 \
80
+ --output_path "result.mp4"
81
+ ```
82
+
83
+
84
+ ## Acknowledgements
85
+ Original code and models [Wan2.1](https://github.com/Wan-Video/Wan2.1).
86
+
87
+
88
+ ## Citations
89
+ ```
90
+ @misc{TheDenk,
91
+ title={Dilated Controlnet},
92
+ author={Karachev Denis},
93
+ url={https://github.com/TheDenk/wan2.1-dilated-controlnet},
94
+ publisher={Github},
95
+ year={2025}
96
+ }
97
+ ```
98
+
99
+ ## Contacts
100
+ <p>Issues should be raised directly in the repository. For professional support and recommendations please <a>[email protected]</a>.</p>