-
-
-
-
-
-
Inference Providers
Active filters:
sglang
bullpoint/Qwen3-Coder-Next-AWQ-4bit
Text Generation
•
14B
•
Updated
•
84.1k
•
4
Doradus-AI/MiroThinker-v1.0-30B-FP8
Text Generation
•
31B
•
Updated
•
11
•
4
Image-Text-to-Text
•
138B
•
Updated
•
476
•
1
SurfaceData/llava-v1.6-mistral-7b-sglang
Image-Text-to-Text
•
8B
•
Updated
•
5
•
9
SurfaceData/llava-v1.6-vicuna-7b-sglang
Image-Text-to-Text
•
7B
•
Updated
•
3
•
1
tclf90/qwen2.5-72b-instruct-gptq-int4
Text Generation
•
73B
•
Updated
•
24
•
2
tclf90/qwen2.5-72b-instruct-gptq-int3
Text Generation
•
69B
•
Updated
•
28
alvarobartt/grok-2-tokenizer
Text Generation
•
Updated
•
9
•
3
VibeStudio/MiniMax-M2-THRIFT
173B
•
Updated
•
1.58k
•
35
mradermacher/MiniMax-M2-THRIFT-GGUF
JasmineBBB/Kimi-Linear-48B-A3B-Instruct-bnb-4bit
Text Generation
•
49B
•
Updated
•
7
•
1
mradermacher/MiniMax-M2-THRIFT-i1-GGUF
173B
•
Updated
•
216
•
10
bartowski/VibeStudio_MiniMax-M2-THRIFT-GGUF
Text Generation
•
173B
•
Updated
•
267
•
8
VibeStudio/MiniMax-M2-THRIFT-55
106B
•
Updated
•
137
•
5
JinnP/SGLang-EAGLE3-Qwen3-Coder-30B-A3B-Instruct
Text Generation
•
0.2B
•
Updated
•
189
•
1
mradermacher/MiniMax-M2-THRIFT-55-GGUF
106B
•
Updated
•
25
•
2
mradermacher/MiniMax-M2-THRIFT-55-i1-GGUF
106B
•
Updated
•
294
•
2
VibeStudio/MiniMax-M2-THRIFT-55-MLX-4bit
106B
•
Updated
•
109
•
2
VibeStudio/MiniMax-M2-THRIFT-55-MLX-6bit
106B
•
Updated
•
97
Doradus-AI/Hermes-4.3-36B-FP8
Text Generation
•
36B
•
Updated
•
89
•
2
Doradus-AI/RnJ-1-Instruct-FP8
Text Generation
•
9B
•
Updated
•
3
•
4
QuantTrio/Qwen3-Coder-Next-E336
Text Generation
•
53B
•
Updated
•
76
QuantTrio/Qwen3-Coder-Next-E400
Text Generation
•
63B
•
Updated
•
1.18k