iproskurina
's Collections
Quantized LLMs
updated
iproskurina/Mistral-7B-v0.3-GPTQ-4bit-g128
Text Generation
•
Updated
•
39
iproskurina/bloom-7b1-GPTQ-4bit-g128
Text Generation
•
Updated
•
10
•
2
iproskurina/bloom-1b7-GPTQ-4bit-g128
Text Generation
•
Updated
•
92
iproskurina/bloom-3b-GPTQ-4bit-g128
Text Generation
•
Updated
•
63
iproskurina/bloom-560m-GPTQ-4bit-g128
Text Generation
•
Updated
•
92
iproskurina/bloom-1b1-GPTQ-4bit-g128
Text Generation
•
Updated
•
95
iproskurina/opt-2.7b-GPTQ-4bit-g128
Text Generation
•
Updated
•
122
iproskurina/opt-13b-GPTQ-4bit-g128
Text Generation
•
Updated
•
6
iproskurina/opt-6.7b-GPTQ-4bit-g128
Text Generation
•
Updated
•
108
iproskurina/opt-125m-GPTQ-4bit-g128
Text Generation
•
Updated
•
63
iproskurina/opt-350m-GPTQ-4bit-g128
Text Generation
•
Updated
•
95
iproskurina/opt-1.3b-GPTQ-4bit-g128
Text Generation
•
Updated
•
111
iproskurina/Mistral-7B-v0.1-GPTQ-8bit-g128
Text Generation
•
Updated
•
3
iproskurina/Mistral-7B-v0.3-GPTQ-8bit-g128
Text Generation
•
Updated
•
5
iproskurina/Mistral-7B-v0.1-GPTQ-3bit-g64
Text Generation
•
Updated
•
9
iproskurina/Mistral-7B-v0.1-GPTQ-8bit-g64
Text Generation
•
Updated
•
2
iproskurina/Mistral-7B-v0.1-GPTQ-4bit-g128
Text Generation
•
Updated
•
5
iproskurina/Mistral-7B-v0.1-GPTQ-3bit-g128
Text Generation
•
Updated
•
4
TheBloke/Mistral-7B-Instruct-v0.1-GPTQ
Text Generation
•
Updated
•
4.38k
•
79
TheBloke/Mistral-7B-Instruct-v0.2-GPTQ
Text Generation
•
Updated
•
373k
•
50
TheBloke/bloomz-176B-GPTQ
Text Generation
•
Updated
•
14
•
20
TheBloke/BLOOMChat-176B-v1-GPTQ
Text Generation
•
Updated
•
13
•
31
TheBloke/Llama-2-13B-chat-GPTQ
Text Generation
•
Updated
•
36.7k
•
362
When Quantization Affects Confidence of Large Language Models?
Paper
•
2405.00632
•
Published