Need mmproj file

#2
by notmebug - opened

Hi, thank you for the model.

This needs to support image inputs via LMstudio user interface. For this the problem description I give you from LLM:

LM Studio's Multimodal Requirements:

LM Studio does support multimodal models (text + image), primarily those based on the LLaVA (Large Language and Vision Assistant) architecture or similar architectures compatible with llama.cpp's multimodal features.

For this to work, you typically need two files:

The main LLM GGUF file (e.g., model-q4_k_m.gguf) which has been specifically compiled with multimodal capabilities.

A separate CLIP Vision Encoder/Projector file (often named something like mmproj-<model_type>-f16.gguf or similar). This file processes the image and translates it into embeddings the LLM can understand.

LM Studio needs both files. It usually expects the mmproj file to be in the same directory as the main GGUF model file. When it detects both compatible files, it displays the little "eye" icon next to the model name, indicating vision capabilities are enabled.

that file adds support for images via LMstudio and the model then looks like this when searching for it:

image.png

@ngxson Sorry to bother you, but as llama.cpp's multimodal expert, you might be able to help me here. Do you know how I can create this missing file?

@mlabonne if the abliteration only affect the language model, you can simply copy-paste the mmproj file from bartowski's gguf (non-abliterated models), could you give it a try?

Thanks! I stole bartowski's file and uploaded it here. Let me know if that works for you @notmebug

It works excellently in LM-Studio.
Thank you very much.

Your need to confirm your account before you can post a new comment.

Sign up or log in to comment