YAML Metadata Warning: empty or missing yaml metadata in repo card (https://huggingface.co/docs/hub/model-cards#model-card-metadata)

GLM-4-Voice-9B

GLM-4-Voice 是智谱 AI 推出的端到端语音模型。GLM-4-Voice 能够直接理解和生成中英文语音,进行实时语音对话,并且能够根据用户的指令改变语音的情感、语调、语速、方言等属性。

GLM-4-Voice is an end-to-end voice model launched by Zhipu AI. GLM-4-Voice can directly understand and generate Chinese and English speech, engage in real-time voice conversations, and change attributes such as emotion, intonation, speech rate, and dialect based on user instructions.

本仓库是 GLM-4-Voice 的 LLM 部分。GLM-4-Voice-9B 在 GLM-4-9B 的基础上进行语音模态的预训练和对齐,从而能够理解和生成离散化的语音。

The repo provides the LLM part of GLM-4-Voice, pre-trained and aligned on speech modality based on GLM-4-9B, enabling understanding and generation of discretized speech.

更多信息请参考我们的仓库 GLM-4-Voice.

For more information please refer to our repo GLM-4-Voice.

Downloads last month
4,749
Safetensors
Model size
9.54B params
Tensor type
BF16
·
Inference API
Unable to determine this model's library. Check the docs .

Model tree for THUDM/glm-4-voice-9b

Quantizations
2 models

Spaces using THUDM/glm-4-voice-9b 5

Collection including THUDM/glm-4-voice-9b