Need mmproj file
#2
by
notmebug
- opened
Hi, thank you for the model.
This needs to support image inputs via LMstudio user interface. For this the problem description I give you from LLM:
LM Studio's Multimodal Requirements:
LM Studio does support multimodal models (text + image), primarily those based on the LLaVA (Large Language and Vision Assistant) architecture or similar architectures compatible with llama.cpp's multimodal features.
For this to work, you typically need two files:
The main LLM GGUF file (e.g., model-q4_k_m.gguf) which has been specifically compiled with multimodal capabilities.
A separate CLIP Vision Encoder/Projector file (often named something like mmproj-<model_type>-f16.gguf or similar). This file processes the image and translates it into embeddings the LLM can understand.
LM Studio needs both files. It usually expects the mmproj file to be in the same directory as the main GGUF model file. When it detects both compatible files, it displays the little "eye" icon next to the model name, indicating vision capabilities are enabled.
It works excellently in LM-Studio.
Thank you very much.