martintomov
/

InfiniteZoom-Mochi

Model card Files Files and versions Community

InfiniteZoom-Mochi / README.md

martintomov's picture

dataset clarification

d83f813 verified 5 months ago

|

1.11 kB

	---
	license: apache-2.0
	base_model:
	- genmo/mochi-1-preview
	pipeline_tag: text-to-video
	tags:
	- infinite zoom
	- art style
	- mochi
	- diffusion
	widget:
	- text: Human fingers pinching to zoom on an infinite zoom canvas, a detailed cityscape at night, illuminated by neon lights and bustling with activity. The zoom focuses on a lit billboard advertising a soda can, transitioning into the sparkling surface of the liquid. As the zoom deepens, microscopic bubbles transform into entire ecosystems of floating islands within the soda.
	output:
	url: 0.mp4
	---

	# Fine-Tuning Mochi Text-to-Video: InfiniteZoom-Mochi

	This project demonstrates the fine-tuning of the Mochi Text-to-Video model using a LoRA (Low-Rank Adaptation) approach, focusing on the infinite zoom art style.

	## Training Details

	- Model Base: [genmo/mochi-1-preview](https://huggingface.co/genmo/mochi-1-preview)
	- Fine-Tuning Dataset: 23 short video clips of infinite zoom art style, and .txt descriptions
	- Training Settings :: 37 frames
	- Training Hardware: H100 GPU
	- Training Duration: 2h

	---

	<Gallery />