finetrainers
/

CogVideoX-1.5-crush-smol-v0

diffusers-training

cogvideox-diffusers

template:sd-lora

Model card Files Files and versions Community

CogVideoX-1.5-crush-smol-v0 / README.md

a-r-r-o-w's picture

a-r-r-o-w HF Staff

Create README.md

bf174c0 verified about 1 month ago

|

history blame contribute delete

2.35 kB

	---
	base_model:
	- THUDM/CogVideoX1.5-5b
	datasets: finetrainers/crush-smol
	library_name: diffusers
	license: other
	license_link: https://huggingface.co/THUDM/CogVideoX1.5-5b/blob/main/LICENSE
	widget:
	- text: >-
	PIKA_CRUSH A red toy car is being crushed by a large hydraulic press, which is flattening objects as if they were under a hydraulic press.
	output:
	url: final-3000-0-2-PIKA_CRUSH-A-red-toy-car-.mp4
	- text: >-
	PIKA_CRUSH A large metal cylinder is seen pressing down on a pile of colorful jelly beans, flattening them as if they were under a hydraulic press.
	output:
	url: final-3000-0-2-PIKA_CRUSH-A-large-metal-.mp4
	- text: >-
	PIKA_CRUSH A large metal cylinder is seen pressing down on a pile of Oreo cookies, flattening them as if they were under a hydraulic press.
	output:
	url: final-3000-1-2-PIKA_CRUSH-A-large-metal-.mp4
	tags:
	- text-to-video
	- diffusers-training
	- diffusers
	- cogvideox
	- cogvideox-diffusers
	- template:sd-lora
	---

	<Gallery />

	This is a LoRA fine-tune of the [THUDM/CogVideoX1.5-5b](https://huggingface.co/THUDM/CogVideoX1.5-5b) model on the
	[finetrainers/crush-smol](https://huggingface.co/datasets/finetrainers/crush-smol) dataset.

	Code: https://github.com/a-r-r-o-w/finetrainers

	> [!IMPORTANT]
	> This is an experimental checkpoint and its poor generalization is well-known.

	Inference code:

	```py
	from diffusers import CogVideoXTransformer3DModel, DiffusionPipeline
	from diffusers.utils import export_to_video
	import torch

	pipeline = DiffusionPipeline.from_pretrained(
	"THUDM/CogVideoX1.5-5b", torch_dtype=torch.bfloat16
	).to("cuda")
	pipeline.load_lora_weights("finetrainers/CogVideoX-1.5-crush-smol-v0", adapter_name="cogvideox-lora")
	pipeline.set_adapters("cogvideox-lora", 0.9)

	prompt = """
	PIKA_CRUSH A red toy car is being crushed by a large hydraulic press, which is flattening objects as if they were under a hydraulic press.
	"""
	negative_prompt = "inconsistent motion, blurry motion, worse quality, degenerate outputs, deformed outputs"

	video = pipeline(
	prompt=prompt,
	negative_prompt=negative_prompt,
	num_frames=81,
	height=480,
	width=768,
	num_inference_steps=50
	).frames[0]
	export_to_video(video, "output.mp4", fps=25)
	```

	Training logs are available on WandB [here](https://wandb.ai/aryanvs/finetrainers-cogvideox?nw=nwuseraryanvs).