black-forest-labs/FLUX.1-schnell · How can I make this model run faster?

Sep 9, 2024

How can I make this model run faster? It's working too slow...

import torch
from diffusers import FluxPipeline

pipe = FluxPipeline.from_pretrained("black-forest-labs/FLUX.1-schnell", torch_dtype=torch.bfloat16) # can replace schnell with dev

# to run on low vram GPUs (i.e. between 4 and 32 GB VRAM)
pipe.enable_sequential_cpu_offload()
pipe.vae.enable_slicing()
pipe.vae.enable_tiling()

pipe.to(torch.float16) # casting here instead of in the pipeline constructor because doing so in the constructor loads all models into CPU memory at once

prompt = f"{ai_msg_prompt.content}"
out = pipe(
    prompt=prompt,
    guidance_scale=0.,
    height=1024,
    width=1024,
    num_inference_steps=4,
    max_sequence_length=256,
).images[0]
out.show()
out.save("image.png")

colinw2292

Sep 10, 2024

Just from the code seen here you can save some inference time by using FP8. The reduction in quality is well worth it for the increase in speed.

Sengil

Sep 10, 2024

thank you so much @colinw2292

LeMoussel

Sep 23, 2024

Can you explain in more detail what FP8 is?
Is this: Kijai/flux-fp8?