An end-to-end (e2e) Voice Language Model by Fish Audio.
Image generator/identifier/reposer
Generate images fast with SD3.5 turbo