Is this improving the quality of flux altogether?

#2
by omarei - opened

I noticed that there's a level of quality this lora is ameliorating in flux dev beyond just spatial. Is this something anyone else is noticing?

Hi @omarei , thanks you for the insightful comment!

We also observed that the improvements extend beyond spatial understanding, enhancing aspects like image fidelity and text-image alignment. This is something we quantitatively analyze in our paper (Table 2). We offered a preliminary conjecture for this phenomenon in Section 4.2:

We conjecture that in base models, spatial terms are often entangled with unrelated semantics due to flawed data. By disentangling these terms, CoMPaSS may also help the model better understand other aspects of language, resulting in these broader improvements.

Sign up or log in to comment