Edit Models filters

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

239

Full-text search

Active filters: sparse

tensorblock/llama2.c-stories110M-pruned50-GGUF

0.1B • Updated Jul 9 • 130

tensorblock/Llama-2-7b-pruned50-retrained-GGUF

Text Generation • 7B • Updated Jul 9 • 96

mradermacher/phi-2-pruned50-GGUF

3B • Updated Aug 1 • 167

mradermacher/llama2.c-stories110M-pruned50-GGUF

0.1B • Updated Apr 10 • 105

mradermacher/OpenHermes-2.5-Mistral-7B-pruned50-GGUF

7B • Updated Apr 10 • 77 • 1

mradermacher/MiniChat-2-3B-pruned2.4-GGUF

3B • Updated Apr 10 • 98

mradermacher/OpenHermes-2.5-Mistral-7B-pruned50-i1-GGUF

7B • Updated Apr 10 • 177

mradermacher/llama2.c-stories110M-pruned50-i1-GGUF

0.1B • Updated Apr 10 • 262

mradermacher/OpenHermes-2.5-Mistral-7B-pruned2.4-GGUF

7B • Updated Apr 10 • 100

mradermacher/OpenHermes-2.5-Mistral-7B-pruned2.4-i1-GGUF

7B • Updated Apr 10 • 203

tensorblock/OpenHermes-2.5-Mistral-7B-pruned2.4-GGUF

7B • Updated Jul 9 • 84

tensorblock/OpenHermes-2.5-Mistral-7B-pruned50-GGUF

7B • Updated Jul 9 • 141

mradermacher/Llama-2-7b-dolphin-open_platypus-pruned_70-GGUF

7B • Updated Apr 10 • 142

mradermacher/Llama-2-7b-dolphin-open_platypus-pruned_50-GGUF

7B • Updated Apr 10 • 148

mradermacher/Nous-Hermes-2-Yi-34B-pruned2.4-GGUF

34B • Updated Jul 31 • 109

mradermacher/Nous-Hermes-2-Yi-34B-pruned50-GGUF

34B • Updated Jul 31 • 154

mradermacher/opensearch-neural-sparse-encoding-doc-v2-mini-GGUF

22.6M • Updated Jul 31 • 334

mradermacher/SparseLlama-3-8B-pruned_50.2of4-GGUF

8B • Updated Jul 11 • 148 • 1

opensearch-project/opensearch-neural-sparse-encoding-doc-v3-distill

Feature Extraction • 67M • Updated Jun 30 • 1.87k • • 8

tjingrant/sparsellm-1b-40p

1B • Updated Apr 15 • 2

tjingrant/sparsellm-1b-60p-small-dense

0.7B • Updated Apr 15 • 3

tjingrant/sparsellm-1b-80p

1B • Updated Apr 15 • 2

tjingrant/sparsellm-1b-60p

1B • Updated Apr 15 • 2

tjingrant/sparsellm-1b-20p

1B • Updated Apr 15 • 3

tjingrant/sparsellm-1b-80p-small-dense

0.5B • Updated Apr 15 • 7

tjingrant/sparsellm-1b-40p-small-dense

0.9B • Updated Apr 15 • 3

tjingrant/sparsellm-1b-20p-small-dense

1B • Updated Apr 15 • 3

tensorblock/RedHatAI_llama2.c-stories110M-pruned50-GGUF

0.1B • Updated Jul 9 • 112

sparse-encoder-testing/splade-bert-tiny-nq

Feature Extraction • 4.42M • Updated May 15 • 8.57k

tomaarsen/inference-free-splade-bert-tiny-nq-3e-3-lambda-corpus

Feature Extraction • Updated May 16 • 5