dia-gguf / README.md
calcuis's picture
Update README.md
bfb157d verified
metadata
license: apache-2.0
language:
  - en
base_model:
  - nari-labs/Dia-1.6B
pipeline_tag: text-to-speech
tags:
  - gguf-connector

gguf quantized and fp8/16/32 scaled dia-1.6b

  • base model from nari-labs
  • text-to-speech synthesis

run it with gguf-connector

ggc s2

screenshot

Prompt Audio Sample
[S1] This is an open weights text to dialogue model.
[S2] You get full control over scripts and voices.
[S1] Wow. Amazing. (laughs)
[S2] Try it now on Git hub or Hugging Face.
🎧 dia-sample-1
[S1] Hey Connector, why your appearance looks so stupid?
[S2] Oh, really? maybe I ate too much smart beans.
[S1] Wow. Amazing. (laughs)
[S2] Let's go to get some more smart beans and you will become stupid as well.
🎧 dia-sample-2

review/reference

  • simply execute the command (ggc s2) above in console/terminal
  • note: model file(s) will be pulled to local cache automatically during the first launch; then opt to run it entirely offline; i.e., from local URL: http://127.0.0.1:7860 with lazy webui
  • gguf-connector (pypi)