Update README.md
Browse files
README.md
CHANGED
@@ -9,7 +9,7 @@ tags:
|
|
9 |
|
10 |
<!-- Provide a quick summary of what the model is/does. -->
|
11 |
This model is a 3D VAE that encodes video into a compact latent space conditioned on a content frame. It compresses a video by a factor of \\(\frac{T}{4}\times\frac{H}{32}\times\frac{W}{32}\\), enabling 4096x downsampling.
|
12 |
-
It is part of the [Reducio-DiT](https://arxiv.org/abs/
|
13 |
|
14 |
|
15 |
## Model Details
|
@@ -19,7 +19,7 @@ It is part of the [Reducio-DiT](https://arxiv.org/abs/xxxx), which is a video ge
|
|
19 |
<!-- Provide the basic links for the model. -->
|
20 |
|
21 |
- **Repository:** [GitHub Repository](https://github.com/microsoft/Reducio-VAE)
|
22 |
-
- **Paper:** [arXiv](https://arxiv.org/abs/
|
23 |
|
24 |
## Uses
|
25 |
|
@@ -62,7 +62,7 @@ Metrics on 1K Pexels validation set and UCF-101:
|
|
62 |
@article{tian2024reducio,
|
63 |
title={REDUCIO! Generating 1024*1024 Video within 16 Seconds using Extremely Compressed Motion Latents},
|
64 |
author={Tian, Rui and Dai, Qi and Bao, Jianmin and Qiu, Kai and Yang, Yifan and Luo, Chong and Wu, Zuxuan and Jiang, Yu-Gang},
|
65 |
-
journal={arXiv preprint arXiv:
|
66 |
year={2024}
|
67 |
}
|
68 |
```
|
|
|
9 |
|
10 |
<!-- Provide a quick summary of what the model is/does. -->
|
11 |
This model is a 3D VAE that encodes video into a compact latent space conditioned on a content frame. It compresses a video by a factor of \\(\frac{T}{4}\times\frac{H}{32}\times\frac{W}{32}\\), enabling 4096x downsampling.
|
12 |
+
It is part of the [Reducio-DiT](https://arxiv.org/abs/2411.13552), which is a video generation method. Codebase available [here](https://github.com/microsoft/Reducio-VAE).
|
13 |
|
14 |
|
15 |
## Model Details
|
|
|
19 |
<!-- Provide the basic links for the model. -->
|
20 |
|
21 |
- **Repository:** [GitHub Repository](https://github.com/microsoft/Reducio-VAE)
|
22 |
+
- **Paper:** [arXiv](https://arxiv.org/abs/2411.13552)
|
23 |
|
24 |
## Uses
|
25 |
|
|
|
62 |
@article{tian2024reducio,
|
63 |
title={REDUCIO! Generating 1024*1024 Video within 16 Seconds using Extremely Compressed Motion Latents},
|
64 |
author={Tian, Rui and Dai, Qi and Bao, Jianmin and Qiu, Kai and Yang, Yifan and Luo, Chong and Wu, Zuxuan and Jiang, Yu-Gang},
|
65 |
+
journal={arXiv preprint arXiv:2411.13552},
|
66 |
year={2024}
|
67 |
}
|
68 |
```
|