Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Main
Tasks
Libraries
Languages
Licenses
Other
1
Apps
llama.cpp
LM Studio
Jan
Backyard AI
Draw Things
DiffusionBee
Jellybox
RecurseChat
Msty
Sanctum
Invoke
JoyFusion
LocalAI
vLLM
node-llama-cpp
Ollama
TGI
MLX LM
Docker Model Runner
Lemonade
Inference Providers
Select all
Fireworks
Cerebras
Novita
Nebius AI
Together AI
Groq
fal
Cohere
Nscale
Hyperbolic
Featherless AI
SambaNova
Replicate
HF Inference API
Misc
Reset Misc
NeelNanda/pile-10k
Inference Endpoints
text-generation-inference
Eval Results
Merge
4-bit precision
custom_code
8-bit precision
text-embeddings-inference
Carbon Emissions
Mixture of Experts
Apply filters
Models
106
Full-text search
Edit filters
Sort: Trending
Active filters:
NeelNanda/pile-10k
Clear all
Intel/GLM-4.5V-int4-AutoRound
2B
•
Updated
11 days ago
•
114
•
6
Intel/phi-2-int4-inc
Text Generation
•
0.6B
•
Updated
Oct 22, 2024
•
5
•
3
Intel/gemma-2b-int4-inc
Text Generation
•
1B
•
Updated
Aug 26, 2024
•
15
•
1
Intel/falcon-7b-sq-int8-inc
Text Generation
•
Updated
Apr 17, 2024
•
8
Intel/Phi-3-mini-4k-instruct-int4-inc
Updated
Jul 4, 2024
•
4
Intel/Baichuan2-13B-Chat-int4-inc
Updated
Jul 4, 2024
•
1
Intel/SOLAR-10.7B-Instruct-v1.0-int4-inc
Updated
Jul 4, 2024
•
1
Intel/opt-1.3b-int4-inc-recipe
Updated
Nov 6, 2024
•
1
Intel/Phi-3-mini-128k-instruct-int4-inc-recipe
Updated
Nov 8, 2024
•
1
Intel/Mistral-7B-v0.1-int4-inc-lmhead
Text Generation
•
1B
•
Updated
May 29, 2024
•
7
•
1
Fizzarolli/phi3-4x4b-v1
Text Generation
•
11B
•
Updated
Jun 4, 2024
•
5
•
1
bartowski/phi3-4x4b-v1-GGUF
Text Generation
•
11B
•
Updated
Jun 3, 2024
•
72
Intel/Qwen2-0.5B-Instuct-int4-inc
Text Generation
•
0.3B
•
Updated
Jun 6, 2024
•
4
Intel/Qwen2-1.5B-Instuct-int4-inc
Text Generation
•
0.7B
•
Updated
Jun 6, 2024
•
4
•
2
Intel/Qwen2-7B-int4-inc
Text Generation
•
2B
•
Updated
Oct 24, 2024
•
5
•
6
Intel/Meta-Llama-3.1-8B-Instruct-int4-inc
Updated
Nov 28, 2024
•
2
Intel/Qwen2.5-0.5B-Instruct-int4-inc
Updated
Oct 10, 2024
•
1
Intel/Qwen2.5-1.5B-Instruct-int4-inc
Updated
Oct 10, 2024
•
1
mradermacher/phi3-4x4b-v1-GGUF
11B
•
Updated
Nov 15, 2024
•
58
mradermacher/phi3-4x4b-v1-i1-GGUF
11B
•
Updated
Nov 15, 2024
•
161
OPEA/Meta-Llama-3.1-70B-Instruct-int4-asym-inc
11B
•
Updated
Apr 30
•
8
•
1
OPEA/Qwen2.5-32B-Instruct-int4-sym-mixed-inc
6B
•
Updated
Apr 30
•
7
•
1
OPEA/Qwen2.5-14B-Instruct-int4-sym-inc
3B
•
Updated
Apr 30
•
7
OPEA/Meta-Llama-3.1-8B-Instruct-int4-sym-inc
2B
•
Updated
Jun 5
•
13
OPEA/Qwen2-VL-7B-Instruct-int4-sym-inc
3B
•
Updated
Jun 5
•
171
•
1
OPEA/Phi-3.5-vision-instruct-int4-sym-inc
Updated
Apr 30
•
28
OPEA/Qwen2.5-7B-Instruct-int4-sym-inc
2B
•
Updated
Apr 30
•
8
•
1
OPEA/Llama-3.2-11B-Vision-Instruct-int4-sym-inc
3B
•
Updated
Jun 5
•
27
•
2
OPEA/llava-v1.5-7b-int4-sym-inc
1B
•
Updated
Jul 18
•
17
•
1
OPEA/cogvlm2-llama3-chat-19B-int4-sym-inc
7B
•
Updated
Jul 18
•
4
Previous
1
2
3
4
Next