-
-
-
-
-
-
Inference Providers
Active filters:
nm-vllm
RedHatAI/TinyLlama-1.1B-Chat-v1.0-pruned2.4
Text Generation
•
Updated
•
27
•
1
RedHatAI/MiniChat-2-3B-pruned2.4
Text Generation
•
Updated
•
13
RedHatAI/OpenHermes-2.5-Mistral-7B-pruned2.4
Text Generation
•
Updated
•
17
RedHatAI/OpenHermes-2.5-Mistral-7B-pruned50
Text Generation
•
Updated
•
17
•
1
RedHatAI/Nous-Hermes-2-SOLAR-10.7B-pruned2.4
Text Generation
•
Updated
•
19
RedHatAI/Nous-Hermes-2-Yi-34B-pruned2.4
Text Generation
•
Updated
•
12
RedHatAI/Nous-Hermes-2-Yi-34B-pruned50
Text Generation
•
Updated
•
15
RedHatAI/zephyr-7b-beta-marlin
Text Generation
•
1B
•
Updated
•
211
RedHatAI/llama2.c-stories110M-pruned2.4
Text Generation
•
Updated
•
18
RedHatAI/llama2.c-stories110M-pruned50
Text Generation
•
Updated
•
2.78k
RedHatAI/phi-2-pruned50
Text Generation
•
3B
•
Updated
•
14
RedHatAI/TinyLlama-1.1B-Chat-v1.0-marlin
Text Generation
•
0.3B
•
Updated
•
5.23k
•
1
RedHatAI/OpenHermes-2.5-Mistral-7B-marlin
Text Generation
•
1B
•
Updated
•
54
•
2
RedHatAI/Nous-Hermes-2-Yi-34B-marlin
Text Generation
•
5B
•
Updated
•
28
•
5
softmax/Llama-2-70b-chat-hf-marlin
Text Generation
•
10B
•
Updated
•
8
softmax/falcon-180B-chat-marlin
Text Generation
•
26B
•
Updated
•
10
dtransposed/llama2.c-stories110M-pruned50-compressed-tensors
Text Generation
•
0.1B
•
Updated
•
9
nm-testing/llama2.c-stories110M-pruned50-compressed-tensors
Text Generation
•
0.1B
•
Updated
•
17
mradermacher/Nous-Hermes-2-SOLAR-10.7B-pruned2.4-GGUF
11B
•
Updated
•
144
mradermacher/Nous-Hermes-2-SOLAR-10.7B-pruned2.4-i1-GGUF
11B
•
Updated
•
601
tensorblock/llama2.c-stories110M-pruned50-GGUF
0.1B
•
Updated
•
122
mradermacher/phi-2-pruned50-GGUF
3B
•
Updated
•
160
mradermacher/llama2.c-stories110M-pruned50-GGUF
0.1B
•
Updated
•
87
mradermacher/OpenHermes-2.5-Mistral-7B-pruned50-GGUF
7B
•
Updated
•
85
•
1
mradermacher/MiniChat-2-3B-pruned2.4-GGUF
3B
•
Updated
•
115
mradermacher/OpenHermes-2.5-Mistral-7B-pruned50-i1-GGUF
7B
•
Updated
•
173
mradermacher/llama2.c-stories110M-pruned50-i1-GGUF
0.1B
•
Updated
•
189
mradermacher/OpenHermes-2.5-Mistral-7B-pruned2.4-GGUF
7B
•
Updated
•
124
mradermacher/OpenHermes-2.5-Mistral-7B-pruned2.4-i1-GGUF
7B
•
Updated
•
270
tensorblock/OpenHermes-2.5-Mistral-7B-pruned2.4-GGUF
7B
•
Updated
•
127