gguf quantized ace-step-v1-3.5b

  • base model from ace-step
  • full set gguf (model+encoder+vae) works right away

setup (once)

  • drag ace-step to > ./ComfyUI/models/diffusion_models
  • drag umt5-base to > ./ComfyUI/models/text_encoders
  • drag pig to > ./ComfyUI/models/vae

screenshot

workflow

  • drag json or demo audio below to browser for workflow
Prompt Audio Sample
female singing pop music electronic beats fennec core
cute fennec girl
massive fennec ears
big fluffy tail
long blonde wavy hair
large blue eyes
I love fennec girl
🎧 ace-step

review

  • note: as need to keep some key tensors (in f32 status) to make it works; file size might not decrease that much; but load faster than safetensors checkpoint in general (no last minute bottle neck problem)
  • rebuilding umt5-base tokenizer logic applied successfully (similar to umt5-xxl; credit should give to city96 and all other contributors whom work on solving that issue); upgrade your node to the latest version for umt5-base encoder support; hence, safetensors checkpoint is no longer needed (removed here; if you want it still, you could get it from comfyui-org)

bonus: fp8/16/32 scaled stable-audio-open-1.0 with gguf quantized t5_base encoder

  • base model from stabilityai
  • note: this is a different model; don't mix it up; also powerful and lite weight
  • dry running

setup (once)

  • drag t5-base to > ./ComfyUI/models/text_encoders
  • drag safetensors to > ./ComfyUI/models/checkpoints
  • drag pig to > ./ComfyUI/models/vae

screenshot

Prompt Audio Sample
heaven church electronic dance music 🎧 stable-audio

reference

Downloads last month
685
GGUF
Model size
3.31B params
Architecture
pig
Hardware compatibility
Log In to view the estimation

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

16-bit

32-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for calcuis/ace-gguf

Quantized
(2)
this model