-
-
-
-
-
-
Inference Providers
Active filters:
sparse
RedHatAI/Llama-2-7b-dolphin-open_platypus-pruned_50-quantized-deepsparse
Text Generation
•
Updated
•
15
RedHatAI/Llama-2-7b-dolphin-open_platypus-pruned_70-quantized-deepsparse
Text Generation
•
Updated
•
16
•
1
kettleguts/zephyr-7b-beta_sparse05
Text Generation
•
7B
•
Updated
•
8
dtransposed/llama2.c-stories110M-pruned50-compressed-tensors
Text Generation
•
0.1B
•
Updated
•
9
nm-testing/llama2.c-stories110M-pruned50-compressed-tensors
Text Generation
•
0.1B
•
Updated
•
17
RedHatAI/Llama-2-7b-gsm8k-pruned_50
Text Generation
•
7B
•
Updated
•
28
•
1
RedHatAI/Llama-2-7b-gsm8k-pruned_70
Text Generation
•
7B
•
Updated
•
19
mradermacher/Llama-2-7b-pruned70-retrained-gsm8k-GGUF
7B
•
Updated
•
510
RedHatAI/SparseLlama-3-8B-pruned_50.2of4
Text Generation
•
8B
•
Updated
•
45
nm-testing/SparseLlama-3-8B-pruned_50.2of4-FP8
Text Generation
•
8B
•
Updated
•
9
vuiseng9/ov-mpt-7b-gsm8k-sparse70
Text Generation
•
Updated
opensearch-project/opensearch-neural-sparse-encoding-v2-distill
Feature Extraction
•
0.1B
•
Updated
•
64.9k
•
6
opensearch-project/opensearch-neural-sparse-encoding-doc-v2-mini
Feature Extraction
•
0.0B
•
Updated
•
77
•
3
mradermacher/Nous-Hermes-2-SOLAR-10.7B-pruned2.4-GGUF
11B
•
Updated
•
143
mradermacher/Nous-Hermes-2-SOLAR-10.7B-pruned2.4-i1-GGUF
11B
•
Updated
•
595
tensorblock/llama2.c-stories110M-pruned50-GGUF
0.1B
•
Updated
•
122
tensorblock/Llama-2-7b-pruned50-retrained-GGUF
Text Generation
•
7B
•
Updated
•
49
mradermacher/phi-2-pruned50-GGUF
3B
•
Updated
•
160
mradermacher/llama2.c-stories110M-pruned50-GGUF
0.1B
•
Updated
•
87
mradermacher/OpenHermes-2.5-Mistral-7B-pruned50-GGUF
7B
•
Updated
•
85
•
1
mradermacher/MiniChat-2-3B-pruned2.4-GGUF
3B
•
Updated
•
115
mradermacher/OpenHermes-2.5-Mistral-7B-pruned50-i1-GGUF
7B
•
Updated
•
173
mradermacher/llama2.c-stories110M-pruned50-i1-GGUF
0.1B
•
Updated
•
189
mradermacher/OpenHermes-2.5-Mistral-7B-pruned2.4-GGUF
7B
•
Updated
•
123
mradermacher/OpenHermes-2.5-Mistral-7B-pruned2.4-i1-GGUF
7B
•
Updated
•
267
tensorblock/OpenHermes-2.5-Mistral-7B-pruned2.4-GGUF
7B
•
Updated
•
127
tensorblock/OpenHermes-2.5-Mistral-7B-pruned50-GGUF
7B
•
Updated
•
129
mradermacher/Llama-2-7b-dolphin-open_platypus-pruned_70-GGUF
7B
•
Updated
•
152
mradermacher/Llama-2-7b-dolphin-open_platypus-pruned_50-GGUF
7B
•
Updated
•
159
mradermacher/Nous-Hermes-2-Yi-34B-pruned2.4-GGUF
34B
•
Updated
•
179