These are quantized version of the BakLLaVA model to be used with llama.cpp and Python bindings (llama-cpp-python and PyLLMCore).

GGUF

Model size

312M params

Architecture

clip

Hardware compatibility

4-bit

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support