Edit Models filters

Inference Providers

HF Inference API

Misc

NeelNanda/pile-10k

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Carbon Emissions

Mixture of Experts

Models

106

Full-text search

Active filters: NeelNanda/pile-10k

Intel/GLM-4.5V-int4-AutoRound

2B • Updated 11 days ago • 114 • 6

Intel/phi-2-int4-inc

Text Generation • 0.6B • Updated Oct 22, 2024 • 5 • 3

Intel/gemma-2b-int4-inc

Text Generation • 1B • Updated Aug 26, 2024 • 15 • 1

Intel/falcon-7b-sq-int8-inc

Text Generation • Updated Apr 17, 2024 • 8

Intel/Phi-3-mini-4k-instruct-int4-inc

Updated Jul 4, 2024 • 4

Intel/Baichuan2-13B-Chat-int4-inc

Updated Jul 4, 2024 • 1

Intel/SOLAR-10.7B-Instruct-v1.0-int4-inc

Updated Jul 4, 2024 • 1

Intel/opt-1.3b-int4-inc-recipe

Updated Nov 6, 2024 • 1

Intel/Phi-3-mini-128k-instruct-int4-inc-recipe

Updated Nov 8, 2024 • 1

Intel/Mistral-7B-v0.1-int4-inc-lmhead

Text Generation • 1B • Updated May 29, 2024 • 7 • 1

Fizzarolli/phi3-4x4b-v1

Text Generation • 11B • Updated Jun 4, 2024 • 5 • 1

bartowski/phi3-4x4b-v1-GGUF

Text Generation • 11B • Updated Jun 3, 2024 • 72

Intel/Qwen2-0.5B-Instuct-int4-inc

Text Generation • 0.3B • Updated Jun 6, 2024 • 4

Intel/Qwen2-1.5B-Instuct-int4-inc

Text Generation • 0.7B • Updated Jun 6, 2024 • 4 • 2

Intel/Qwen2-7B-int4-inc

Text Generation • 2B • Updated Oct 24, 2024 • 5 • 6

Intel/Meta-Llama-3.1-8B-Instruct-int4-inc

Updated Nov 28, 2024 • 2

Intel/Qwen2.5-0.5B-Instruct-int4-inc

Updated Oct 10, 2024 • 1

Intel/Qwen2.5-1.5B-Instruct-int4-inc

Updated Oct 10, 2024 • 1

mradermacher/phi3-4x4b-v1-GGUF

11B • Updated Nov 15, 2024 • 58

mradermacher/phi3-4x4b-v1-i1-GGUF

11B • Updated Nov 15, 2024 • 161

OPEA/Meta-Llama-3.1-70B-Instruct-int4-asym-inc

11B • Updated Apr 30 • 8 • 1

OPEA/Qwen2.5-32B-Instruct-int4-sym-mixed-inc

6B • Updated Apr 30 • 7 • 1

OPEA/Qwen2.5-14B-Instruct-int4-sym-inc

3B • Updated Apr 30 • 7

OPEA/Meta-Llama-3.1-8B-Instruct-int4-sym-inc

2B • Updated Jun 5 • 13

OPEA/Qwen2-VL-7B-Instruct-int4-sym-inc

3B • Updated Jun 5 • 171 • 1

OPEA/Phi-3.5-vision-instruct-int4-sym-inc

Updated Apr 30 • 28

OPEA/Qwen2.5-7B-Instruct-int4-sym-inc

2B • Updated Apr 30 • 8 • 1

OPEA/Llama-3.2-11B-Vision-Instruct-int4-sym-inc

3B • Updated Jun 5 • 27 • 2

OPEA/llava-v1.5-7b-int4-sym-inc

1B • Updated Jul 18 • 17 • 1

OPEA/cogvlm2-llama3-chat-19B-int4-sym-inc

7B • Updated Jul 18 • 4