ONNX version of intfloat/e5-large-v2
This is a sentence-transformers model: It maps sentences & paragraphs to a N dimensional dense vector space and can be used for tasks like clustering or semantic search.
The model conversion was made with onnx-convert tool with the following parameters:
python convert.sh --model_id intfloat/e5-large-v2 --quantize QInt8 --optimize 2
There are two versions of model available:
model.onnx
- Float32 version, with optimize=2model_opt2_QInt8.onnx
- QInt8 quantized version, with optimize=2
License
Apache 2.0
- Downloads last month
- 129
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social
visibility and check back later, or deploy to Inference Endpoints (dedicated)
instead.