Qwen/Qwen2.5-VL-7B-Instruct (Quantized)

Description

This model is a quantized version of the original model Qwen/Qwen2.5-VL-7B-Instruct.

It's quantized using the BitsAndBytes library to 4-bit using the bnb-my-repo space.

Safetensors

Model size

3.91B params

Tensor type

F32

BF16

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Base model

Quantized

(93)

this model