Remade commited on
Commit
f33f967
·
verified ·
1 Parent(s): 7dbb285

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +122 -0
README.md ADDED
@@ -0,0 +1,122 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ language:
4
+ - en
5
+ base_model:
6
+ - Wan-AI/Wan2.1-T2V-14B
7
+ pipeline_tag: text-to-video
8
+ tags:
9
+ - text-to-video
10
+ - text-to-image
11
+ - lora
12
+ - diffusers
13
+ - template:diffusion-lora
14
+ widget:
15
+ - text: >-
16
+ The video shows a [z00m_ca11] with four participants. In the top left box, a medieval knight in full armor adjusts his helmet. To his right, a pirate with a parrot on his shoulder drinks from a mug. In the bottom left, a scientist in a lab coat scribbles on a whiteboard. In the bottom right, an alien in a suit waves awkwardly.
17
+ output:
18
+ url: example_videos/zoom1.mp4
19
+ - text: >-
20
+ The video shows a [z00m_ca11] with three participants. In the top left box, a centaur in business attire is seated at a large wooden desk. The top right box shows a wizard with a long beard reviewing spreadsheets. The bottom box shows a velociraptor wearing glasses, sipping coffee and nodding seriously.
21
+ output:
22
+ url: example_videos/zoom2.mp4
23
+ - text: >-
24
+ The video shows a [z00m_ca11] with four participants. In the top left, a chef covered in flour frantically checks a recipe. To the right, a yoga instructor sits calmly with candles lit. The bottom left shows a DJ with headphones bobbing their head. The bottom right shows a firefighter in full gear, sipping coffee.
25
+ output:
26
+ url: example_videos/zoom3.mp4
27
+ - text: >-
28
+ The video shows a [z00m_ca11] with three participants in a 3x3 grid formation. The first person in the top left is a cat wearing glasses, sitting in front of a computer. The second person has a hood and looks down. The third person is a dog wearing a tie, attentively watching the screen.​
29
+ output:
30
+ url: example_videos/zoom4.mp4
31
+ ---
32
+
33
+ <div style="background-color: #f8f9fa; padding: 20px; border-radius: 10px; margin-bottom: 20px;">
34
+ <h1 style="color: #24292e; margin-top: 0;">360 Degree Rotation Effect LoRA for Wan2.1 14B I2V 480p</h1>
35
+
36
+ <div style="background-color: white; padding: 15px; border-radius: 8px; margin: 15px 0; box-shadow: 0 2px 4px rgba(0,0,0,0.1);">
37
+ <h2 style="color: #24292e; margin-top: 0;">Overview</h2>
38
+ <p>This LoRA is trained on the Wan2.1 14B I2V 480p model and allows you to rotate any object in an image. The effect works on a wide variety of objects, from animals to vehicles to people!</p>
39
+ </div>
40
+
41
+ <div style="background-color: white; padding: 15px; border-radius: 8px; margin: 15px 0; box-shadow: 0 2px 4px rgba(0,0,0,0.1);">
42
+ <h2 style="color: #24292e; margin-top: 0;">Features</h2>
43
+ <ul style="margin-bottom: 0;">
44
+ <li>Transform any image into a video of it being rotated</li>
45
+ <li>Trained on the Wan2.1 14B 480p I2V base model</li>
46
+ <li>Consistent results across different object types</li>
47
+ <li>Simple prompt structure that's easy to adapt</li>
48
+ </ul>
49
+ </div>
50
+
51
+ <div style="background-color: white; padding: 15px; border-radius: 8px; margin: 15px 0; box-shadow: 0 2px 4px rgba(0,0,0,0.1);">
52
+ <h2 style="color: #24292e; margin-top: 0;">Community</h2>
53
+ <ul style="margin-bottom: 0;">
54
+ <li><b>Discord:</b> <a href="https://discord.com/invite/7tsKMCbNFC" style="color: #0366d6; text-decoration: none;">Join our community</a> to generate videos with this LoRA for free</li>
55
+ <li><b>Request LoRAs:</b> We're training and open-sourcing Wan2.1 LoRAs for free - join our Discord to make requests!</li>
56
+ </ul>
57
+ </div>
58
+ </div>
59
+
60
+ <Gallery />
61
+
62
+ # Model File and Inference Workflow
63
+
64
+ ## 📥 Download Links:
65
+
66
+ - [rotate_20_epochs.safetensors](./rotate_20_epochs.safetensors) - LoRA Model File
67
+ - [wan_img2vid_lora_workflow.json](./workflow/wan_img2vid_lora_workflow.json) - Wan I2V with LoRA Workflow for ComfyUI
68
+
69
+ ---
70
+ <div style="background-color: #f8f9fa; padding: 20px; border-radius: 10px; margin-bottom: 20px;">
71
+ <div style="background-color: white; padding: 15px; border-radius: 8px; margin: 15px 0; box-shadow: 0 2px 4px rgba(0,0,0,0.1);">
72
+ <h2 style="color: #24292e; margin-top: 0;">Recommended Settings</h2>
73
+ <ul style="margin-bottom: 0;">
74
+ <li><b>LoRA Strength:</b> 1.0</li>
75
+ <li><b>Embedded Guidance Scale:</b> 6.0</li>
76
+ <li><b>Flow Shift:</b> 5.0</li>
77
+ </ul>
78
+ </div>
79
+
80
+ <div style="background-color: white; padding: 15px; border-radius: 8px; margin: 15px 0; box-shadow: 0 2px 4px rgba(0,0,0,0.1);">
81
+ <h2 style="color: #24292e; margin-top: 0;">Trigger Words</h2>
82
+ <p>The key trigger phrase is: <code style="background-color: #f0f0f0; padding: 3px 6px; border-radius: 4px;">r0t4tion 360 degrees rotation</code></p>
83
+ </div>
84
+
85
+ <div style="background-color: white; padding: 15px; border-radius: 8px; margin: 15px 0; box-shadow: 0 2px 4px rgba(0,0,0,0.1);">
86
+ <h2 style="color: #24292e; margin-top: 0;">Prompt Template</h2>
87
+ <p>For prompting, check out the example prompts; this way of prompting seems to work very well.</p>
88
+
89
+
90
+ <div style="background-color: white; padding: 15px; border-radius: 8px; margin: 15px 0; box-shadow: 0 2px 4px rgba(0,0,0,0.1);">
91
+ <h2 style="color: #24292e; margin-top: 0;">ComfyUI Workflow</h2>
92
+ <p>This LoRA works with a modified version of <a href="https://github.com/kijai/ComfyUI-WanVideoWrapper/blob/main/example_workflows/wanvideo_480p_I2V_example_02.json" style="color: #0366d6; text-decoration: none;">Kijai's Wan Video Wrapper workflow</a>. The main modification is adding a Wan LoRA node connected to the base model.</p>
93
+ <img src="./workflow/workflow_screenshot.png" style="width: 100%; border-radius: 8px; margin: 15px 0; box-shadow: 0 4px 8px rgba(0,0,0,0.1);">
94
+ <p>See the Downloads section above for the modified workflow.</p>
95
+ </div>
96
+ </div>
97
+
98
+ <div style="background-color: #f8f9fa; padding: 20px; border-radius: 10px; margin-bottom: 20px;">
99
+ <div style="background-color: white; padding: 15px; border-radius: 8px; margin: 15px 0; box-shadow: 0 2px 4px rgba(0,0,0,0.1);">
100
+ <h2 style="color: #24292e; margin-top: 0;">Model Information</h2>
101
+ <p>The model weights are available in Safetensors format. See the Downloads section above.</p>
102
+ </div>
103
+
104
+ <div style="background-color: white; padding: 15px; border-radius: 8px; margin: 15px 0; box-shadow: 0 2px 4px rgba(0,0,0,0.1);">
105
+ <h2 style="color: #24292e; margin-top: 0;">Training Details</h2>
106
+ <ul style="margin-bottom: 0;">
107
+ <li><b>Base Model:</b> Wan2.1 14B I2V 480p</li>
108
+ <li><b>Training Data:</b> Trained on 30 seconds of video comprised of 12 short clips (each clip captioned separately) of things being rotated</li>
109
+ <li><b> Epochs:</b> 20</li>
110
+ </ul>
111
+ </div>
112
+
113
+ <div style="background-color: white; padding: 15px; border-radius: 8px; margin: 15px 0; box-shadow: 0 2px 4px rgba(0,0,0,0.1);">
114
+ <h2 style="color: #24292e; margin-top: 0;">Additional Information</h2>
115
+ <p>Training was done using <a href="https://github.com/tdrussell/diffusion-pipe" style="color: #0366d6; text-decoration: none;">Diffusion Pipe for Training</a></p>
116
+ </div>
117
+
118
+ <div style="background-color: white; padding: 15px; border-radius: 8px; margin: 15px 0; box-shadow: 0 2px 4px rgba(0,0,0,0.1);">
119
+ <h2 style="color: #24292e; margin-top: 0;">Acknowledgments</h2>
120
+ <p style="margin-bottom: 0;">Special thanks to Kijai for the ComfyUI Wan Video Wrapper and tdrussell for the training scripts!</p>
121
+ </div>
122
+ </div>