Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference Providers
Select all
Together AI
fal
Replicate
SambaNova
Hyperbolic
Nscale
Nebius AI Studio
Cerebras
Cohere
Novita
Fireworks
HF Inference API
Misc
Reset Misc
vptq
Misc with no match
Inference Endpoints
text-generation-inference
Eval Results
Merge
4-bit precision
custom_code
8-bit precision
text-embeddings-inference
Carbon Emissions
Mixture of Experts
Apply filters
Models
84
Full-text search
Edit filters
Sort: Trending
Active filters:
vptq
Clear all
VPTQ-community/Qwen2.5-14B-Instruct-v8-k65536-65536-woft
Updated
Mar 20
•
13
VPTQ-community/Qwen2.5-7B-Instruct-v16-k65536-65536-woft
Updated
Mar 20
•
14
•
1
VPTQ-community/Qwen2.5-7B-Instruct-v8-k65536-65536-woft
Updated
Mar 20
•
27
VPTQ-community/Qwen2.5-7B-Instruct-v8-k256-256-woft
Updated
Mar 20
•
23
VPTQ-community/Qwen2.5-7B-Instruct-v8-k65536-0-woft
Updated
Mar 20
•
28
VPTQ-community/Meta-Llama-3.1-8B-Instruct-v12-k65536-4096-woft
Updated
Jan 13
•
240
•
4
VPTQ-community/Qwen2.5-72B-Instruct-v8-k65536-65536-woft
Updated
Feb 25
•
4
•
1
VPTQ-community/Llama-2-13b-hf-v4-k4096-0-ft
Updated
Nov 18, 2024
•
4
VPTQ-community/Llama-2-7b-hf-v4-k4096-0-woft
Updated
Nov 18, 2024
VPTQ-community/Llama-2-13b-hf-v6-k4096-0-woft
Updated
Nov 18, 2024
VPTQ-community/Llama-2-7b-hf-v6-k4096-0-ft
Updated
Nov 18, 2024
•
2
VPTQ-community/Meta-Llama-3-70B-v12-k4096-4096-woft
Updated
Mar 20
•
8
VPTQ-community/Llama-2-70b-hf-v6-k4096-4096-woft
Updated
Nov 18, 2024
•
9
VPTQ-community/Llama-2-13b-hf-v6-k4096-4096-woft
Updated
Nov 18, 2024
VPTQ-community/Llama-2-13b-hf-v4-k4096-0-woft
Updated
Nov 18, 2024
•
6
VPTQ-community/Meta-Llama-3-8B-v6-k4096-4096-woft
Updated
Mar 20
VPTQ-community/Meta-Llama-3-70B-v12-k4096-4096-ft
Updated
Mar 20
•
6
VPTQ-community/Meta-Llama-3-8B-v4-k4096-0-woft
Updated
Mar 20
VPTQ-community/Qwen2.5-32B-Instruct-v16-k65536-65536-woft
Updated
Feb 25
•
6
•
1
VPTQ-community/Qwen2.5-32B-Instruct-v8-k65536-0-woft
Updated
Feb 25
•
21
VPTQ-community/Qwen2.5-32B-Instruct-v8-k65536-65536-woft
Updated
Feb 25
•
14
•
1
VPTQ-community/Qwen2.5-32B-Instruct-v8-k256-256-woft
Updated
Feb 25
•
12
VPTQ-community/Qwen2.5-32B-Instruct-v8-k65536-256-woft
Updated
Feb 25
•
6
•
2
VPTQ-community/Meta-Llama-3.1-405B-Instruct-v16-k65536-65536-woft
Updated
Feb 26
•
6
•
3
VPTQ-community/Llama-2-7b-hf-v6-k4096-0-woft
Updated
Nov 18, 2024
VPTQ-community/Llama-2-13b-hf-v6-k4096-0-ft
Updated
Nov 18, 2024
VPTQ-community/Meta-Llama-3-8B-v12-k4096-4096-woft
Updated
Mar 20
•
2
VPTQ-community/Llama-2-70b-hf-v16-k65536-65536-woft
Updated
Nov 18, 2024
•
1
VPTQ-community/Meta-Llama-3-70B-v16-k65536-65536-woft
Updated
Mar 20
•
1
VPTQ-community/Llama-2-7b-hf-v6-k4096-4096
Updated
Nov 18, 2024
Previous
1
2
3
Next