How flux.1 dev Guiandec Embeding working ?

#3
by GuardSkill - opened

Great job, but I even don't know the theory of flux.1' Guiandec Embeding.

GuardSkill changed discussion title from Why reduce the parameters from 12B to 8B, for better trainning? to How flux.1 dev Guiandec Embeding working ?

Sign up or log in to comment