plz gguf and mmproj file, thanks

by ericjing - opened Jun 26

Jun 26

plz gguf and mmproj file, thanks

Jun 26

(They've done llava models before I think, but this is a custom model "Keye" from config.json: mradermacher/llama-joycaption-beta-one-hf-llava-GGUF)

Jun 26

It's not supported by llama.cpp, unfortunately.

Jun 27

Looks like the team is integrating support with vLLM per this PR:
https://github.com/vllm-project/vllm/pull/20126

Might would have success with a bnb 4bit quant or awq after that!

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment