--- license: apache-2.0 language: - en base_model: - Wan-AI/Wan2.1-T2V-14B pipeline_tag: text-to-video tags: - text-to-video - text-to-image - lora - diffusers - template:diffusion-lora widget: - text: >- p0v_dr1v1n6, video shows a person driving a car through a burning hellscape. The driver is holding the steering wheel with both hands. Rivers of lava flow on both sides of the cracked road, and firestorms rage in the distance. The driver is looking straight ahead. The car has a digital dashboard and a touchscreen display flickering with demonic symbols. output: url: example_videos/pov1.mp4 - text: >- p0v_dr1v1n6 through a sandstorm in the desert, visibility dropping as golden dust engulfs the horizon, digital dashboard displaying emergency alerts, the car struggling against the powerful winds. output: url: example_videos/pov2.mp4 - text: >- dr1v12ng POV Driving. The video shows the interior of a car driving down a city street at night. The driver's hands are visible on the steering wheel. The city lights are reflecting in the windshield. output: url: example_videos/pov3.mp4 - text: >- p0v_dr1v1n6, video shows a person driving a car on the surface of the Moon. The driver is holding the steering wheel with both hands. The road is covered in lunar dust, and Earth glows brightly in the sky. The driver is looking straight ahead. The car has a digital dashboard and a touchscreen display output: url: example_videos/pov4.mp4 ---

POV Driving LoRA for Wan2.1 14B T2V

Overview

This LoRA is trained on the Wan2.1 14B T2V model and allows you to generate POV driving videos in any scene or landscape you desire!

Features

Community

# Model File and Inference Workflow ## 📥 Download Links: - [pov_driving_5_epochs.safetensors](./pov_driving_5_epochs.safetensors) - LoRA Model File - [wan_txt2vid_lora_workflow.json](./workflow/wan_txt2vid_lora_workflow.json) - Wan T2V with LoRA Workflow for ComfyUI ---

Recommended Settings

  • LoRA Strength: 1.0
  • Embedded Guidance Scale: 6.0
  • Flow Shift: 5.0

Trigger Words

The key trigger phrase is: p0v_dr1v1n6

Prompt Template

For prompting, check out the example prompts; this way of prompting seems to work very well.

ComfyUI Workflow

This LoRA works with a modified version of Kijai's Wan Video Wrapper workflow. The main modification is adding a Wan LoRA node connected to the base model.

See the Downloads section above for the modified workflow.

Model Information

The model weights are available in Safetensors format. See the Downloads section above.

Training Details

  • Base Model: Wan2.1 14B T2V
  • Training Data: Trained on 17 minutes of video comprised of 204 short clips (each clip captioned separately) of various POV driving footage.
  • Epochs: 5

Additional Information

Training was done using Diffusion Pipe for Training

Acknowledgments

Special thanks to Kijai for the ComfyUI Wan Video Wrapper and tdrussell for the training scripts!