-
-
-
-
-
-
Inference Providers
Active filters:
8-bit
Text Generation
•
22B
•
Updated
•
5.96M
•
•
4.32k
Text Generation
•
120B
•
Updated
•
3.25M
•
•
4.46k
nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-NVFP4
Text Generation
•
18B
•
Updated
•
104k
•
82
microsoft/bitnet-b1.58-2B-4T
Text Generation
•
0.8B
•
Updated
•
6.28k
•
1.29k
mlx-community/Qwen3-Coder-Next-8bit
Text Generation
•
80B
•
Updated
•
1.07k
•
8
GadflyII/GLM-4.7-Flash-NVFP4
Text Generation
•
18B
•
Updated
•
280k
•
59
MultiverseComputingCAI/HyperNova-60B
Text Generation
•
60B
•
Updated
•
1.25k
•
52
MaziyarPanahi/Qwen3-14B-GGUF
Text Generation
•
15B
•
Updated
•
283k
•
8
openai/gpt-oss-safeguard-20b
Text Generation
•
22B
•
Updated
•
41.1k
•
•
190
GadflyII/GLM-4.7-Flash-MXFP4
Text Generation
•
18B
•
Updated
•
10.4k
•
8
unsloth/NVIDIA-Nemotron-3-Nano-30B-A3B-NVFP4
Text Generation
•
18B
•
Updated
•
321
•
8
inferencerlabs/Qwen3-Coder-Next-MLX-9bit
Text Generation
•
80B
•
Updated
•
984
•
3
MaziyarPanahi/Mistral-Nemo-Instruct-2407-GGUF
Text Generation
•
12B
•
Updated
•
193k
•
52
nvidia/Llama-3.3-70B-Instruct-NVFP4
41B
•
Updated
•
10.7k
•
34
nvidia/DeepSeek-R1-0528-NVFP4-v2
Text Generation
•
394B
•
Updated
•
102k
•
13
Text Generation
•
22B
•
Updated
•
30.6k
•
41
openai/gpt-oss-safeguard-120b
Text Generation
•
120B
•
Updated
•
28.4k
•
85
nvidia/NVIDIA-Nemotron-Nano-12B-v2-VL-NVFP4-QAD
Image-Text-to-Text
•
8B
•
Updated
•
36.1k
•
19
kldzj/gpt-oss-120b-heretic-v2
Text Generation
•
117B
•
Updated
•
398
•
20
mlx-community/mistralai_Devstral-Small-2-24B-Instruct-2512-MLX-8Bit
Text Generation
•
24B
•
Updated
•
782
•
6
Text Generation
•
177B
•
Updated
•
4.07k
•
14
lukealonso/MiniMax-M2.1-NVFP4
115B
•
Updated
•
27.8k
•
23
mlx-community/Qwen3-ASR-1.7B-8bit
0.8B
•
Updated
•
1.07k
•
7
CalamitousFelicitousness/HunyuanImage-3.0-Instruct-Distil-SDNQ-4bit-dynamic
Image-to-Image
•
45B
•
Updated
•
73
•
2
mlx-community/GLM-OCR-8bit
Image-to-Text
•
0.6B
•
Updated
•
699
•
2
EpistemeAI/rsi-gpt-oss-120bv2-8bit
Text Generation
•
120B
•
Updated
•
134
•
2
MuXodious/gpt-oss-20b-RichardErkhov-heresy
Text Generation
•
22B
•
Updated
•
72
•
2
StefanKrsteski/Phi-3-mini-4k-instruct-GPTQ-8bit
Text Generation
•
4B
•
Updated
•
27
•
2
Qwen/Qwen2.5-1.5B-Instruct-GPTQ-Int8
Text Generation
•
2B
•
Updated
•
381
•
6
brunopio/Llama3-8B-1.58-100B-tokens-GGUF
Text Generation
•
3B
•
Updated
•
966
•
20