About
Quantized models collected from various sources for ease of use.
Sources
Some models have been quantized by me. And others:
- GGUF: TheBloke, ollama.ai/library, Undi95, LoneStriker, s3nh, athirdpath, Sao10K, maddes8cht, Kquant03, ikawrakow, nakodanei, NousResearch, second-state, wolfram, Weyaxi, bartowski, FPHam, teknium, nold, NeverSleep, MaziyarPanahi, migtissera, openerotica, DavidAU, QuantFactory, RobertSinclair, mradermacher, BeaverAI, TheDrummer, grimjim, city96, HuggingFaceTB, liminerity, prithivMLmods
- EXL2: LoneStriker
- HQQ+ (HQQ + LoRA adapter): Mobius Labs GmbH
- SpinQuant: Meta Llama
- AWQ: ToDO
- GPTQ: ToDo
Disclaimer
These models are provided "as-is" without any warranty. The respective licenses apply to each model, and it is the user's responsibility to comply with the terms of these licenses.
- Downloads last month
- 3,973
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social
visibility and check back later, or deploy to Inference Endpoints (dedicated)
instead.