Anime Bright Landscape Text-to-Video Generation
This repository contains the necessary steps and scripts to generate anime-style videos using the Anime_Landscape text-to-video model with LoRA (Low-Rank Adaptation) weights. The model produces anime-style videos based on textual prompts with distinctive geometric and neon aesthetic.
Prerequisites
Before proceeding, ensure that you have the following installed on your system:
• Ubuntu (or a compatible Linux distribution) • Python 3.x • pip (Python package manager) • Git • Git LFS (Git Large File Storage) • FFmpeg
Installation
Update and Install Dependencies
sudo apt-get update && sudo apt-get install cbm git-lfs ffmpeg
Clone the Repository
git clone https://huggingface.co/svjack/Anime_Bright_Landscape_wan_2_1_14_B_text2video_lora cd Anime_Bright_Landscape_wan_2_1_14_B_text2video_lora
Install Python Dependencies
pip install torch torchvision pip install -r requirements.txt pip install ascii-magic matplotlib tensorboard huggingface_hub datasets pip install moviepy==1.0.3 pip install sageattention==1.0.6
Download Model Weights
wget https://huggingface.co/Wan-AI/Wan2.1-T2V-14B/resolve/main/models_t5_umt5-xxl-enc-bf16.pth wget https://huggingface.co/DeepBeepMeep/Wan2.1/resolve/main/models_clip_open-clip-xlm-roberta-large-vit-huge-14.pth wget https://huggingface.co/Wan-AI/Wan2.1-T2V-14B/resolve/main/Wan2.1_VAE.pth wget https://huggingface.co/Comfy-Org/Wan_2.1_ComfyUI_repackaged/resolve/main/split_files/diffusion_models/wan2.1_t2v_1.3B_bf16.safetensors wget https://huggingface.co/Comfy-Org/Wan_2.1_ComfyUI_repackaged/resolve/main/split_files/diffusion_models/wan2.1_t2v_14B_bf16.safetensors
Usage
To generate a video, use the wan_generate_video.py
script with the appropriate parameters.
Interactive Mode
For experimenting with different prompts:
python wan_generate_video.py --fp8 --task t2v-14B --video_size 480 832 --video_length 81 --infer_steps 35 \
--save_path save --output_type both \
--dit wan2.1_t2v_14B_bf16.safetensors --vae Wan2.1_VAE.pth \
--t5 models_t5_umt5-xxl-enc-bf16.pth \
--attn_mode torch \
--lora_weight ani_bright_landscape_w14_outputs/ani_bright_landscape_w14_lora-step00005500.safetensors \
--lora_multiplier 1.0 \
--interactive
anime style ,a digital illustration video about ,high quality nature video featuring a red panda balancing on a bamboo stem while a bird lands on it's head, on the background there is a waterfall
anime style ,a digital illustration video about A cat and a dog baking a cake together in a kitchen. The cat is carefully measuring flour, while the dog is stirring the batter with a wooden spoon. The kitchen is cozy, with sunlight streaming through the window.
anime style ,This vibrant digital illustration video depicts a colorful, orange-and-blue vintage train traveling along a railway track. The train is surrounded by a stunning display of pink cherry blossom trees in full bloom, their petals gently falling onto the tracks and a reflective puddle in the foreground. The bright blue sky, dotted with fluffy white clouds, enhances the lively atmosphere. The train's reflection in the puddle adds depth to the scene. On the left, a wooden utility pole stands next to a grassy area with a simple wire fence. The overall style is whimsical and anime-inspired, with vivid colors and detailed textures.
anime style ,This digital illustration video in a vibrant, anime-inspired style depicts a vintage green tram with "P.R. 13" displayed on its front, traveling down a wet, reflective railway track during a stunning sunset. The sky is ablaze with vivid orange, pink, and purple clouds, casting a warm glow over the scene. On the left, a streetlight and power poles line the track, while small buildings and a few trees are visible on the right. The tram's headlights and windows reflect the colorful sky, and the tracks glisten with rain. The overall mood is nostalgic and serene, capturing a picturesque urban sunset.
anime style, a digital illustration video about ,Sunlight filters through clouds onto a cherry blossom-filled meadow, where a blue-haired youth in an indigo coat stands by a vermilion bridge, holding a book as petals drift past his smile. The scene blends dynamic nature with human stillness, using pink, red and blue hues to create harmony, while symbolic elements like blossoms and books add depth.
- use wan 14b t2v
- use wan fusionX 14b
Key Parameters
--fp8
: Enable FP8 precision (recommended)--task
: Model version (t2v-1.3B
)--video_size
: Output resolution (e.g.,480 832
)--video_length
: Number of frames (typically 81)--infer_steps
: Quality vs speed trade-off (35-50)--lora_weight
: Path to Kinich LoRA weights--lora_multiplier
: Strength of LoRA effect (1.0 recommended)--prompt
: Should include "In the style of Kinich" for best results
Style Characteristics
For optimal results, prompts should describe:
- Characters with geometric neon hair patterns
- Black outfits with gold/teal designs
- Futuristic or high-energy backgrounds
- Vibrant color palettes with glowing elements
- Dynamic poses and expressions
Output
Generated videos and frames will be saved in the specified save_path
directory with:
- MP4 video file
- Individual frames as PNG images
Troubleshooting
• Verify all model weights are correctly downloaded • Ensure sufficient GPU memory (>=12GB recommended) • Check for version conflicts in Python packages
License
This project is licensed under the MIT License.
Acknowledgments
• Hugging Face for model hosting • Wan-AI for base models • svjack for LoRA adaptation
For support, please open an issue in the repository.