-
-
-
-
-
-
Inference Providers
Active filters:
awq
LGAI-EXAONE/EXAONE-4.0-32B-AWQ
Text Generation
•
5B
•
Updated
•
8
•
9
LGAI-EXAONE/EXAONE-4.0-1.2B-AWQ
Text Generation
•
0.4B
•
Updated
•
3
•
7
Qwen/Qwen3-32B-AWQ
Text Generation
•
6B
•
Updated
•
400k
•
82
Qwen/Qwen3-4B-AWQ
Text Generation
•
0.9B
•
Updated
•
15.8k
•
11
stelterlab/Mistral-Small-24B-Instruct-2501-AWQ
Text Generation
•
4B
•
Updated
•
3.18k
•
23
cashlion/OpenCodeReasoning-Nemotron-1.1-32B-AWQ
Text Generation
•
33B
•
Updated
•
82
•
3
TheBloke/Mistral-7B-Instruct-v0.2-AWQ
Text Generation
•
1B
•
Updated
•
33.1k
•
48
casperhansen/llama-3-70b-instruct-awq
Text Generation
•
11B
•
Updated
•
3.14k
•
69
hugging-quants/Meta-Llama-3.1-8B-Instruct-AWQ-INT4
Text Generation
•
2B
•
Updated
•
253k
•
71
hugging-quants/Meta-Llama-3.1-70B-Instruct-AWQ-INT4
Text Generation
•
11B
•
Updated
•
328k
•
104
Qwen/Qwen2.5-7B-Instruct-AWQ
Text Generation
•
2B
•
Updated
•
14.3k
•
26
Qwen/Qwen2.5-32B-Instruct-AWQ
Text Generation
•
6B
•
Updated
•
150k
•
82
Qwen/Qwen2.5-72B-Instruct-AWQ
Text Generation
•
12B
•
Updated
•
138k
•
71
LGAI-EXAONE/EXAONE-3.5-2.4B-Instruct-AWQ
Text Generation
•
0.8B
•
Updated
•
1.1k
•
20
RichardErkhov/liminerity_-_Bitnet-Mistral.0.2-330m-v0.2-grokfast-awq
0.1B
•
Updated
•
1
•
1
Qwen/Qwen2.5-VL-3B-Instruct-AWQ
Image-Text-to-Text
•
1B
•
Updated
•
20.9k
•
44
Qwen/Qwen3-8B-AWQ
Text Generation
•
2B
•
Updated
•
105k
•
12
AngelSlim/Qwen3-14b_int4_awq
3B
•
Updated
•
42
•
1
stelterlab/NextCoder-32B-AWQ
Text Generation
•
6B
•
Updated
•
46
•
3
cpatonn/OpenCodeReasoning-Nemotron-1.1-32B-AWQ
Text Generation
•
6B
•
Updated
•
6
•
1
TheBloke/Unholy-v1-10l-13B-AWQ
Text Generation
•
2B
•
Updated
•
6
•
4
casperhansen/mpt-7b-8k-chat-awq
Text Generation
•
Updated
•
9
•
3
casperhansen/falcon-7b-awq
Text Generation
•
Updated
•
15
•
1
casperhansen/vicuna-7b-v1.5-awq
Text Generation
•
Updated
•
6
•
3
casperhansen/vicuna-7b-v1.5-awq-gemv
Text Generation
•
Updated
•
8
•
1
casperhansen/mpt-7b-8k-chat-awq-gemv
Text Generation
•
Updated
•
17
casperhansen/opt-125m-awq
Text Generation
•
0.1B
•
Updated
•
1.2k
•
3
casperhansen/tinyllama-1b-awq
Text Generation
•
Updated
•
3.45k
Bomml/Llama-2-70B-chat-w4-g128-awq
Text Generation
•
Updated
TheBloke/Llama-2-7B-Chat-AWQ
Text Generation
•
1B
•
Updated
•
7.23k
•
21