text-to-img-gen-by-maryem

#29
Files changed (2) hide show
  1. README.md +1 -18
  2. transformer/config.json +1 -1
README.md CHANGED
@@ -15,11 +15,6 @@ library_name: diffusers
15
 
16
  <span style="color: #FF5733; font-weight: bold">For more features and to experience the full capabilities of our product, please visit [https://vivago.ai/](https://vivago.ai/).</span>
17
 
18
- ## Project Updates
19
- - 🌟 **July 16, 2025**: We've open-sourced the updated image editing model [**HiDream-E1.1**](https://huggingface.co/HiDream-ai/HiDream-E1-1).
20
- - πŸ“ **May 28, 2025**: We've released our technical report [HiDream-I1: A High-Efficient Image Generative Foundation Model with Sparse Diffusion Transformer](https://arxiv.org/abs/2505.22705).
21
- - πŸš€ **April 28, 2025**: We've open-sourced the instruction-based-image-editing model [**HiDream-E1-Full**](https://github.com/HiDream-ai/HiDream-E1). Experience at [https://huggingface.co/spaces/HiDream-ai/HiDream-E1-Full](https://huggingface.co/spaces/HiDream-ai/HiDream-E1-Full)!.
22
-
23
  ## Key Features
24
  - ✨ **Superior Image Quality** - Produces exceptional results across multiple styles including photorealistic, cartoon, artistic, and more. Achieves state-of-the-art HPS v2.1 score, which aligns with human preferences.
25
  - 🎯 **Best-in-Class Prompt Following** - Achieves industry-leading scores on GenEval and DPG benchmarks, outperforming all other open-source models.
@@ -107,16 +102,4 @@ The Transformer models in this repository are licensed under the MIT License. Th
107
 
108
  ## Acknowledgements
109
  - The VAE component is from `FLUX.1 [schnell]`, licensed under Apache 2.0.
110
- - The text encoders are from `google/t5-v1_1-xxl` (licensed under Apache 2.0) and `meta-llama/Meta-Llama-3.1-8B-Instruct` (licensed under the Llama 3.1 Community License Agreement).
111
-
112
-
113
- ## Citation
114
-
115
- ```bibtex
116
- @article{hidreami1technicalreport,
117
- title={HiDream-I1: A High-Efficient Image Generative Foundation Model with Sparse Diffusion Transformer},
118
- author={Cai, Qi and Chen, Jingwen and Chen, Yang and Li, Yehao and Long, Fuchen and Pan, Yingwei and Qiu, Zhaofan and Zhang, Yiheng and Gao, Fengbin and Xu, Peihan and others},
119
- journal={arXiv preprint arXiv:2505.22705},
120
- year={2025}
121
- }
122
- ```
 
15
 
16
  <span style="color: #FF5733; font-weight: bold">For more features and to experience the full capabilities of our product, please visit [https://vivago.ai/](https://vivago.ai/).</span>
17
 
 
 
 
 
 
18
  ## Key Features
19
  - ✨ **Superior Image Quality** - Produces exceptional results across multiple styles including photorealistic, cartoon, artistic, and more. Achieves state-of-the-art HPS v2.1 score, which aligns with human preferences.
20
  - 🎯 **Best-in-Class Prompt Following** - Achieves industry-leading scores on GenEval and DPG benchmarks, outperforming all other open-source models.
 
102
 
103
  ## Acknowledgements
104
  - The VAE component is from `FLUX.1 [schnell]`, licensed under Apache 2.0.
105
+ - The text encoders are from `google/t5-v1_1-xxl` (licensed under Apache 2.0) and `meta-llama/Meta-Llama-3.1-8B-Instruct` (licensed under the Llama 3.1 Community License Agreement).
 
 
 
 
 
 
 
 
 
 
 
 
transformer/config.json CHANGED
@@ -1,5 +1,5 @@
1
  {
2
- "_class_name": "HiDreamImageTransformer2DModel",
3
  "_diffusers_version": "0.32.1",
4
  "attention_head_dim": 128,
5
  "axes_dims_rope": [
 
1
  {
2
+ "_class_name": "HiImageTransformer2DModel",
3
  "_diffusers_version": "0.32.1",
4
  "attention_head_dim": 128,
5
  "axes_dims_rope": [