
🧧 February 2025 - Open releases from the Chinese community
Image-Text-to-Text • Updated • 842 • 148Note MLLM - Apache 2.0 - 1/2/4/6/18/34B https://huggingface.co/collections/AIDC-AI/ovis2-67ab36c7e497429034874464
FunAudioLLM/InspireMusic-Base
Updated • 29 • 12Note Audio/Music/Song - Apache2.0 - 0.5/1.5B https://huggingface.co/FunAudioLLM
stepfun-ai/Step-Audio-Chat
Audio-Text-to-Text • Updated • 189 • 440Note MLLM - Apache 2.0 https://huggingface.co/collections/stepfun-ai/step-audio-67b33accf45735bb21131b0b
stepfun-ai/Step-Audio-TTS-3B
Text-to-Speech • Updated • 236 • 184Note TTS - Apache2.0 By StepFun
stepfun-ai/stepvideo-t2v
Text-to-Video • Updated • 347 • 455Note Video - MIT - 30B
Congliu/Chinese-DeepSeek-R1-Distill-data-110k
Viewer • Updated • 110k • 1.92k • 662Note Powerful Chinese open-source distillation dataset (R1) with 110K samples
Skywork/SkyReels-A1
Image-to-Video • Updated • 231 • 59Skywork/SkyReels-V1-Hunyuan-I2V
Image-to-Video • Updated • 833 • 269
moonshotai/Moonlight-16B-A3B-Instruct
Text Generation • Updated • 1.76k • 158Note First open MoE by Moonshot AI
ByteDance-Seed/BFS-Prover
Text Generation • Updated • 3.87k • 17
Wan-AI/Wan2.1-T2V-14B
Text-to-Video • Updated • 38.9k • • 1.27kNote Video - Apache2.0 - 1.3/14B https://huggingface.co/Wan-AI
qihoo360/TinyR1-32B-Preview
Text Generation • Updated • 4.74k • • 327Note Reasoning model - Apache2.0 - Outperforms the 70B model Deepseek-R1-Distill-Llama-70B and nearly matches the full R1 model in math.
baichuan-inc/Baichuan-Audio-Base
Updated • 100 • 11
GSAI-ML/LLaDA-8B-Instruct
Text Generation • Updated • 85.2k • 279Note Fully trained from scratch, LLaDA delivers performance on par with LLaMA3 8B
- 473
Chat with DeepSeek-VL2-small
🌍Generate responses using images and text input
- 206
LLM训练终极指南 | The Ultra-Scale Playbook
🔥了解LLM训练的方方面面
- 9
AttentiveEraser - Object Remover
🚀Unleashing Diffusion Model’s Object Removal Potential
- 1.62k
Wan2.1
💻Wan: Open and Advanced Large-Scale Video Generative Models