lodestones/Chroma · 12B → 8.9B code?

Mar 31

re:

TL;DR: There are 3.3B parameters that only encode a single input vector, which I replaced with 250M params.

Since FLUX is so big, I had to modify the architecture and ensure minimal knowledge was lost in the process. The most obvious thing to prune was this modulation layer. In the diagram, it may look small, but in total, FLUX has 3.3B parameters allocated to it. Without glazing over the details too much, this layer's job is to let the model know which timestep it's at during the denoising process. This layer also receives information from pooled CLIP vectors.

is the code for doing this available anywhere?

Wi-zz

Apr 4

Is it not this?
https://github.com/lodestone-rock/ComfyUI_FluxMod

lodestones

Owner May 16

it's super messy and i did it on jupyter notebook
i'll clean that up soon

GuardSkill

20 days ago

Is it not this?
https://github.com/lodestone-rock/ComfyUI_FluxMod

Is that code work?