Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Ji-Xiang
's Collections
Image-Editing
V-JEPA 2
Robotics
GUI-Actor
Reasoning models
1-bit Large Language Model (LLM)
Taiwanese Taigi Datasets
GRPO datasets
Image-Text-to-Text
Conversational Speech Model
Text Generation Inference
OCR tools
Video generation
Images Datasets
General screen parsing tool
Critique Fine-Tuning (CFT) Datasets
Reasoning datasets
Test-time scaling Datasets
RLVR Datasets
Thinking/Reasoning Datasets
WebGPU
RLHF Datasets
HTML to Markdown
Math Datasets
Logical Reasoning Datasets
Multilingual-dataset
Object Detection
Retrieval-Augmented Generation (RAG) Dataset
Image-to-Video
Multilingual Large Language Models
SFT Datasets
Recommended Datasets
Coder LLM
Text-to-Video
Multimodal Language Models
Image Chatbot
Traditional-chinese-dataset
Suggestion Models
Chinese models
China models
Uncensored models
China-dataset
common-dataset
unfiltered dataset
Image Generator
Edge Computing
Voice
Medical
Big Language Models
GGUF Models
text-to-speech (TTS)
Visual Question Answering
Chat
Multi Tasks
Vision
DPO datasets
ORPO-DPO datasets
SLM (small language models)
automatic speech recognition (ASR)
Vision-Language dataset
MoE
Dense Passage Retrieval (DPR) Datasets
Audio-To-Text
background-removal
Extreme Quantization
Try on
Voice
updated
Oct 25, 2024
Upvote
-
myshell-ai/OpenVoiceV2
Text-to-Speech
•
Updated
Dec 24, 2024
•
439
Running
on
Zero
2.58k
2.58k
F5-TTS
🗣
F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
SWivid/F5-TTS
Text-to-Speech
•
Updated
Mar 21
•
894k
•
1.1k
zai-org/glm-4-voice-9b
10B
•
Updated
Oct 25, 2024
•
14.1k
•
106
zai-org/glm-4-voice-decoder
Updated
Oct 25, 2024
•
229
•
16
zai-org/glm-4-voice-tokenizer
0.4B
•
Updated
Oct 25, 2024
•
58.6k
•
10
Upvote
-
Share collection
View history
Collection guide
Browse collections