Qwen-Image Full Distillation Accelerated Model

Model Introduction

This model is a distilled and accelerated version of Qwen-Image.
The original model requires 40 inference steps and uses classifier-free guidance (CFG), resulting in a total of 80 forward passes.
The distilled accelerated model only requires 15 inference steps and does not need CFG, resulting in only 15 forward passes โ€” achieving about 5ร— speed-up.
Of course, the number of inference steps can be further reduced if needed, but generation quality may decrease.

The training framework is built using DiffSynth-Studio.
The training dataset consists of 16,000 images generated by the original model using randomly sampled prompts from DiffusionDB.
Training was conducted for about 1 day on 8 ร— MI308X GPUs.

Performance Comparison

Original Model Original Model Accelerated Model
Inference Steps 40 15 15
CFG Scale 4 1 1
Forward Passes 80 15 15
Example 1
Example 2
Example 3

Inference Code

git clone https://github.com/modelscope/DiffSynth-Studio.git  
cd DiffSynth-Studio
pip install -e .
from diffsynth.pipelines.qwen_image import QwenImagePipeline, ModelConfig
import torch


pipe = QwenImagePipeline.from_pretrained(
    torch_dtype=torch.bfloat16,
    device="cuda",
    model_configs=[
        ModelConfig(model_id="DiffSynth-Studio/Qwen-Image-Distill-Full", origin_file_pattern="diffusion_pytorch_model*.safetensors"),
        ModelConfig(model_id="Qwen/Qwen-Image", origin_file_pattern="text_encoder/model*.safetensors"),
        ModelConfig(model_id="Qwen/Qwen-Image", origin_file_pattern="vae/diffusion_pytorch_model.safetensors"),
    ],
    tokenizer_config=ModelConfig(model_id="Qwen/Qwen-Image", origin_file_pattern="tokenizer/"),
)
prompt = "Delicate portrait, underwater girl, flowing blue dress, hair floating, clear light and shadows, bubbles surrounding, serene face, exquisite details, dreamy and beautiful."
image = pipe(prompt, seed=0, num_inference_steps=15, cfg_scale=1)
image.save("image.jpg")

license: apache-2.0

Downloads last month
50
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for SahilCarterr/Qwen-Image-Distill-Full

Base model

Qwen/Qwen-Image
Finetuned
(7)
this model