Chroma
is a 8.9 billion parameter rectified flow transformer capable of generating images from text descriptions.
Based on FLUX.1 [schnell]
with heavy architectural modifications.
Quantized into GGUF format using a modified llama.cpp & city96's ComfyUI-GGUF/tools. Distillation layers are not quantized.
Also see silveroxides' Chroma GGUFs! (BF16, Q8_0, Q6_K, Q5_K_S, Q5_1, Q5_0, Q4_K_M, Q4_1, Q4_0, Q3_K_L)
Q*_M GGUFs are mixed quantizations with an aim at maximizing speed by selectively choosing the quantization of certain layers.
- Q8_M focuses on Q8_0 quantization of weights for performance, mixed with Q6_K on less heavy layers.
- Downloads last month
- 1,126
Hardware compatibility
Log In
to view the estimation
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
๐
Ask for provider support
HF Inference deployability: The model has no library tag.
Model tree for Clybius/Chroma-GGUF
Base model
lodestones/Chroma