InfiniteZoom-Mochi / README.md
martintomov's picture
dataset clarification
d83f813 verified
|
raw
history blame
1.11 kB
---
license: apache-2.0
base_model:
- genmo/mochi-1-preview
pipeline_tag: text-to-video
tags:
- infinite zoom
- art style
- mochi
- diffusion
widget:
- text: Human fingers pinching to zoom on an infinite zoom canvas, a detailed cityscape at night, illuminated by neon lights and bustling with activity. The zoom focuses on a lit billboard advertising a soda can, transitioning into the sparkling surface of the liquid. As the zoom deepens, microscopic bubbles transform into entire ecosystems of floating islands within the soda.
output:
url: 0.mp4
---
# Fine-Tuning Mochi Text-to-Video: InfiniteZoom-Mochi
This project demonstrates the fine-tuning of the **Mochi Text-to-Video** model using a LoRA (Low-Rank Adaptation) approach, focusing on the **infinite zoom art style**.
## Training Details
- **Model Base**: [genmo/mochi-1-preview](https://huggingface.co/genmo/mochi-1-preview)
- **Fine-Tuning Dataset**: 23 short video clips of infinite zoom art style, and .txt descriptions
- **Training Settings :**: 37 frames
- **Training Hardware**: H100 GPU
- **Training Duration**: 2h
---
<Gallery />