This is the 8-bit quantized version of Alibaba-NLP/gte-Qwen2-1.5B-instruct by following the example from the AutoGPTQ repository.
- Downloads last month
- 165
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
🙋
Ask for provider support
HF Inference deployability: The HF Inference API does not support sentence-similarity models for transformers
library.
Model tree for ktoprakucar/gte-Qwen2-1.5B-instruct-Q8-GPTQ
Base model
Alibaba-NLP/gte-Qwen2-1.5B-instruct