Multimodal - MLX
Collection
Language Models that takes vision input and/or audio input, hand picked by Nexa Team.
β’
9 items
β’
Updated
β’
2
Run them directly with nexa-sdk installed In nexa-sdk CLI:
NexaAI/Kokoro-82M-bf16-MLX
Kokoro is an open-weight TTS model with 82 million parameters. Despite its lightweight architecture, it delivers comparable quality to larger models while being significantly faster and more cost-efficient. With Apache-licensed weights, Kokoro can be deployed anywhere from production environments to personal projects.
π GitHub: https://github.com/hexgrad/kokoro
π Demo: https://hf.co/spaces/hexgrad/Kokoro-TTS
Original model card: hexgrad/Kokoro-82M
Base model
yl4579/StyleTTS2-LJSpeech