Anime Bright Landscape Text-to-Video Generation

This repository contains the necessary steps and scripts to generate anime-style videos using the Anime_Landscape text-to-video model with LoRA (Low-Rank Adaptation) weights. The model produces anime-style videos based on textual prompts with distinctive geometric and neon aesthetic.

Prerequisites

Before proceeding, ensure that you have the following installed on your system:

• Ubuntu (or a compatible Linux distribution) • Python 3.x • pip (Python package manager) • Git • Git LFS (Git Large File Storage) • FFmpeg

Installation

Update and Install Dependencies

sudo apt-get update && sudo apt-get install cbm git-lfs ffmpeg

Clone the Repository

git clone https://huggingface.co/svjack/Anime_Bright_Landscape_wan_2_1_14_B_text2video_lora
cd Anime_Bright_Landscape_wan_2_1_14_B_text2video_lora

Install Python Dependencies

pip install torch torchvision
pip install -r requirements.txt
pip install ascii-magic matplotlib tensorboard huggingface_hub datasets
pip install moviepy==1.0.3
pip install sageattention==1.0.6

Download Model Weights

wget https://huggingface.co/Wan-AI/Wan2.1-T2V-14B/resolve/main/models_t5_umt5-xxl-enc-bf16.pth
wget https://huggingface.co/DeepBeepMeep/Wan2.1/resolve/main/models_clip_open-clip-xlm-roberta-large-vit-huge-14.pth
wget https://huggingface.co/Wan-AI/Wan2.1-T2V-14B/resolve/main/Wan2.1_VAE.pth
wget https://huggingface.co/Comfy-Org/Wan_2.1_ComfyUI_repackaged/resolve/main/split_files/diffusion_models/wan2.1_t2v_1.3B_bf16.safetensors
wget https://huggingface.co/Comfy-Org/Wan_2.1_ComfyUI_repackaged/resolve/main/split_files/diffusion_models/wan2.1_t2v_14B_bf16.safetensors

Usage

To generate a video, use the wan_generate_video.py script with the appropriate parameters.

Interactive Mode

For experimenting with different prompts:

python wan_generate_video.py --fp8 --task t2v-14B --video_size 480 832 --video_length 81 --infer_steps 35 \
--save_path save --output_type both \
--dit wan2.1_t2v_14B_bf16.safetensors --vae Wan2.1_VAE.pth \
--t5 models_t5_umt5-xxl-enc-bf16.pth \
--attn_mode torch \
--lora_weight ani_bright_landscape_w14_outputs/ani_bright_landscape_w14_lora-step00005500.safetensors \
--lora_multiplier 1.0 \
--interactive

anime style ,a digital illustration video about ,high quality nature video featuring a red panda balancing on a bamboo stem while a bird lands on it's head, on the background there is a waterfall

anime style ,a digital illustration video about A cat and a dog baking a cake together in a kitchen. The cat is carefully measuring flour, while the dog is stirring the batter with a wooden spoon. The kitchen is cozy, with sunlight streaming through the window.

anime style ,This vibrant digital illustration video depicts a colorful, orange-and-blue vintage train traveling along a railway track. The train is surrounded by a stunning display of pink cherry blossom trees in full bloom, their petals gently falling onto the tracks and a reflective puddle in the foreground. The bright blue sky, dotted with fluffy white clouds, enhances the lively atmosphere. The train's reflection in the puddle adds depth to the scene. On the left, a wooden utility pole stands next to a grassy area with a simple wire fence. The overall style is whimsical and anime-inspired, with vivid colors and detailed textures.

anime style ,This digital illustration video in a vibrant, anime-inspired style depicts a vintage green tram with "P.R. 13" displayed on its front, traveling down a wet, reflective railway track during a stunning sunset. The sky is ablaze with vivid orange, pink, and purple clouds, casting a warm glow over the scene. On the left, a streetlight and power poles line the track, while small buildings and a few trees are visible on the right. The tram's headlights and windows reflect the colorful sky, and the tracks glisten with rain. The overall mood is nostalgic and serene, capturing a picturesque urban sunset.

anime style, a digital illustration video about ,Sunlight filters through clouds onto a cherry blossom-filled meadow, where a blue-haired youth in an indigo coat stands by a vermilion bridge, holding a book as petals drift past his smile. The scene blends dynamic nature with human stillness, using pink, red and blue hues to create harmony, while symbolic elements like blossoms and books add depth.

use wan 14b t2v

use wan fusionX 14b

Key Parameters

--fp8: Enable FP8 precision (recommended)
--task: Model version (t2v-1.3B)
--video_size: Output resolution (e.g., 480 832)
--video_length: Number of frames (typically 81)
--infer_steps: Quality vs speed trade-off (35-50)
--lora_weight: Path to Kinich LoRA weights
--lora_multiplier: Strength of LoRA effect (1.0 recommended)
--prompt: Should include "In the style of Kinich" for best results

Style Characteristics

For optimal results, prompts should describe:

Characters with geometric neon hair patterns
Black outfits with gold/teal designs
Futuristic or high-energy backgrounds
Vibrant color palettes with glowing elements
Dynamic poses and expressions

Output

Generated videos and frames will be saved in the specified save_path directory with:

MP4 video file
Individual frames as PNG images

Troubleshooting

• Verify all model weights are correctly downloaded • Ensure sufficient GPU memory (>=12GB recommended) • Check for version conflicts in Python packages

License

This project is licensed under the MIT License.

Acknowledgments

• Hugging Face for model hosting • Wan-AI for base models • svjack for LoRA adaptation

For support, please open an issue in the repository.