test1
#26
by
battleman0526
- opened
- README.md +1 -18
- transformer/config.json +1 -1
README.md
CHANGED
@@ -15,11 +15,6 @@ library_name: diffusers
|
|
15 |
|
16 |
<span style="color: #FF5733; font-weight: bold">For more features and to experience the full capabilities of our product, please visit [https://vivago.ai/](https://vivago.ai/).</span>
|
17 |
|
18 |
-
## Project Updates
|
19 |
-
- π **July 16, 2025**: We've open-sourced the updated image editing model [**HiDream-E1.1**](https://huggingface.co/HiDream-ai/HiDream-E1-1).
|
20 |
-
- π **May 28, 2025**: We've released our technical report [HiDream-I1: A High-Efficient Image Generative Foundation Model with Sparse Diffusion Transformer](https://arxiv.org/abs/2505.22705).
|
21 |
-
- π **April 28, 2025**: We've open-sourced the instruction-based-image-editing model [**HiDream-E1-Full**](https://github.com/HiDream-ai/HiDream-E1). Experience at [https://huggingface.co/spaces/HiDream-ai/HiDream-E1-Full](https://huggingface.co/spaces/HiDream-ai/HiDream-E1-Full)!.
|
22 |
-
|
23 |
## Key Features
|
24 |
- β¨ **Superior Image Quality** - Produces exceptional results across multiple styles including photorealistic, cartoon, artistic, and more. Achieves state-of-the-art HPS v2.1 score, which aligns with human preferences.
|
25 |
- π― **Best-in-Class Prompt Following** - Achieves industry-leading scores on GenEval and DPG benchmarks, outperforming all other open-source models.
|
@@ -107,16 +102,4 @@ The Transformer models in this repository are licensed under the MIT License. Th
|
|
107 |
|
108 |
## Acknowledgements
|
109 |
- The VAE component is from `FLUX.1 [schnell]`, licensed under Apache 2.0.
|
110 |
-
- The text encoders are from `google/t5-v1_1-xxl` (licensed under Apache 2.0) and `meta-llama/Meta-Llama-3.1-8B-Instruct` (licensed under the Llama 3.1 Community License Agreement).
|
111 |
-
|
112 |
-
|
113 |
-
## Citation
|
114 |
-
|
115 |
-
```bibtex
|
116 |
-
@article{hidreami1technicalreport,
|
117 |
-
title={HiDream-I1: A High-Efficient Image Generative Foundation Model with Sparse Diffusion Transformer},
|
118 |
-
author={Cai, Qi and Chen, Jingwen and Chen, Yang and Li, Yehao and Long, Fuchen and Pan, Yingwei and Qiu, Zhaofan and Zhang, Yiheng and Gao, Fengbin and Xu, Peihan and others},
|
119 |
-
journal={arXiv preprint arXiv:2505.22705},
|
120 |
-
year={2025}
|
121 |
-
}
|
122 |
-
```
|
|
|
15 |
|
16 |
<span style="color: #FF5733; font-weight: bold">For more features and to experience the full capabilities of our product, please visit [https://vivago.ai/](https://vivago.ai/).</span>
|
17 |
|
|
|
|
|
|
|
|
|
|
|
18 |
## Key Features
|
19 |
- β¨ **Superior Image Quality** - Produces exceptional results across multiple styles including photorealistic, cartoon, artistic, and more. Achieves state-of-the-art HPS v2.1 score, which aligns with human preferences.
|
20 |
- π― **Best-in-Class Prompt Following** - Achieves industry-leading scores on GenEval and DPG benchmarks, outperforming all other open-source models.
|
|
|
102 |
|
103 |
## Acknowledgements
|
104 |
- The VAE component is from `FLUX.1 [schnell]`, licensed under Apache 2.0.
|
105 |
+
- The text encoders are from `google/t5-v1_1-xxl` (licensed under Apache 2.0) and `meta-llama/Meta-Llama-3.1-8B-Instruct` (licensed under the Llama 3.1 Community License Agreement).
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
transformer/config.json
CHANGED
@@ -1,5 +1,5 @@
|
|
1 |
{
|
2 |
-
"_class_name": "
|
3 |
"_diffusers_version": "0.32.1",
|
4 |
"attention_head_dim": 128,
|
5 |
"axes_dims_rope": [
|
|
|
1 |
{
|
2 |
+
"_class_name": "HiImageTransformer2DModel",
|
3 |
"_diffusers_version": "0.32.1",
|
4 |
"attention_head_dim": 128,
|
5 |
"axes_dims_rope": [
|