--- license: openrail datasets: - vucinatim/spectrogram-captions language: - en library_name: diffusers pipeline_tag: text-to-image --- SDXL 1.0 finetunes on vucinatim/spectrogram-captions for 89 epochs(800 steps). It generates spectrograms for simple sounds. It currently does not produce very good sound effects, but I will train the model for longer in the future.