etri-vilab
/

koala-700m

Text-to-Image

Diffusers

ONNX

Safetensors

StableDiffusionXLPipeline

KOALA

Model card Files Files and versions Community

ywlee88 commited on Jan 12, 2024

Commit

7af8a37

1 Parent(s): 10342f1

add metadata

Browse files

Files changed (1) hide show

README.md +29 -0

README.md CHANGED Viewed

@@ -76,6 +76,15 @@ We measure the inference time of SDM-v2.0 with 768x768 resolution and the other
 - **Self-Attention-Based Knowledge Distillation**: The core technique in KOALA focuses on the distillation of self-attention features, which proves crucial for maintaining image generation quality.
 ## Usage with 🤗[Diffusers library](https://github.com/huggingface/diffusers)
 The inference code with denoising step 25
 ```python
@@ -90,11 +99,31 @@ negative = "worst quality, low quality, illustration, low resolution"
 image = pipe(prompt=prompt, negative_prompt=negative).images[0]
 ```
 ## Limitations and Bias
 - Text Rendering: The models face challenges in rendering long, legible text within images.
 - Complex Prompts: KOALA sometimes struggles with complex prompts involving multiple attributes.
 - Dataset Dependencies: The current limitations are partially attributed to the characteristics of the training dataset (LAION-aesthetics-V2 6+).
 ## Citation
 ```bibtex
 @misc{Lee@koala,

 - **Self-Attention-Based Knowledge Distillation**: The core technique in KOALA focuses on the distillation of self-attention features, which proves crucial for maintaining image generation quality.
+## Model Description
+- Developed by [ETRI Visual Intelligence Lab](https://huggingface.co/etri-vilab)
+- Developer: [Youngwan Lee](https://youngwanlee.github.io/), [Kwanyong Park](https://pkyong95.github.io/), [Yoorhim Cho](https://ofzlo.github.io/), [Young-Ju Lee](https://scholar.google.com/citations?user=6goOQh8AAAAJ&hl=en), [Sung Ju Hwang](http://www.sungjuhwang.com/)
+- Model Description: Latent Diffusion based text-to-image generative model. KOALA models uses the same text encoders as [SDXL-Base-1.0](https://huggingface.co/stabilityai/stable-diffusion-xl-base-1.0) and only replace the denoising U-Net with the compressed U-Nets.
+- Resources for more information: Check out [KOALA report on arXiv](https://arxiv.org/abs/2312.04005) and [project page](https://youngwanlee.github.io/KOALA/).
 ## Usage with 🤗[Diffusers library](https://github.com/huggingface/diffusers)
 The inference code with denoising step 25
 ```python
 image = pipe(prompt=prompt, negative_prompt=negative).images[0]
 ```
+## Uses
+### Direct Use
+The model is intended for research purposes only. Possible research areas and tasks include
+- Generation of artworks and use in design and other artistic processes.
+- Applications in educational or creative tools.
+- Research on generative models.
+- Safe deployment of models which have the potential to generate harmful content.
+- Probing and understanding the limitations and biases of generative models.
+- Excluded uses are described below.
+### Out-of-Scope Use
+The model was not trained to be factual or true representations of people or events, and therefore using the model to generate such content is out-of-scope for the abilities of this model.
 ## Limitations and Bias
 - Text Rendering: The models face challenges in rendering long, legible text within images.
 - Complex Prompts: KOALA sometimes struggles with complex prompts involving multiple attributes.
 - Dataset Dependencies: The current limitations are partially attributed to the characteristics of the training dataset (LAION-aesthetics-V2 6+).
 ## Citation
 ```bibtex
 @misc{Lee@koala,