Edit model card

ggml_llava-v1.5-7b

This repo contains GGUF files to inference llava-v1.5-7b with llama.cpp end-to-end without any extra dependency.

Note: The mmproj-model-f16.gguf file structure is experimental and may change. Always use the latest code in llama.cpp.

Downloads last month
2,582
GGUF
Model size
312M params
Architecture
clip

4-bit

5-bit

16-bit

Inference API
Unable to determine this model's library. Check the docs .

Spaces using mys/ggml_llava-v1.5-7b 3