|
--- |
|
license: apache-2.0 |
|
base_model: |
|
- genmo/mochi-1-preview |
|
pipeline_tag: text-to-video |
|
tags: |
|
- infinite zoom |
|
- art style |
|
- mochi |
|
- diffusion |
|
widget: |
|
- text: Human fingers pinching to zoom on an infinite zoom canvas, a detailed cityscape at night, illuminated by neon lights and bustling with activity. The zoom focuses on a lit billboard advertising a soda can, transitioning into the sparkling surface of the liquid. As the zoom deepens, microscopic bubbles transform into entire ecosystems of floating islands within the soda. |
|
output: |
|
url: 0.mp4 |
|
--- |
|
|
|
# Fine-Tuning Mochi Text-to-Video: InfiniteZoom-Mochi |
|
|
|
This project demonstrates the fine-tuning of the **Mochi Text-to-Video** model using a LoRA (Low-Rank Adaptation) approach, focusing on the **infinite zoom art style**. |
|
|
|
## Training Details |
|
|
|
- **Model Base**: [genmo/mochi-1-preview](https://huggingface.co/genmo/mochi-1-preview) |
|
- **Fine-Tuning Dataset**: 23 short video clips of infinite zoom art style, and .txt descriptions |
|
- **Training Settings :**: 37 frames |
|
- **Training Hardware**: H100 GPU |
|
- **Training Duration**: 2h |
|
|
|
--- |
|
|
|
<Gallery /> |