mlabonne/gemma-3-27b-it-qat-abliterated-GGUF · Using this model in ollama

1 day ago

"ollama run hf.co/mlabonne/gemma-3-27b-it-qat-abliterated-GGUF:Q5_K_M" works well for text input, but adding an image to the command line doesn't work and per "/show info" there is no vision capability.

Is this intentional, or a known limitation?

mlabonne

Owner 1 day ago

It's something to do with the structure of the model that is non-standard. I'll try to fix it in the next version, but it's possible to add the mmproj.

ksw74

1 day ago

Will the model work for vision if I load in python transformers rather than using ollama?

hamzaazh

1 day ago

YES

ksw74

1 day ago

•

edited 1 day ago

Great, thanks, I'll give it a go.

ksw74

1 day ago

•

edited 1 day ago

No luck loading the model via transformers, unfortunately. If I try Gemma3ForConditionalGeneration.from_pretrained, I get a warning "You are using a model of type gemma3_text to instantiate a model of type gemma3. This is not supported for all configurations of models and can yield errors." then a "size mismatch for weight" error. If I instead try loading with the generic "pipeline(task="image-text-to-text",.." I get a "ValueError: Unrecognized configuration class <class 'transformers.models.gemma.configuration_gemma.GemmaConfig'> for this kind of AutoModel: AutoModelForImageTextToText. Model type should be one of AriaConfig, AyaVisionConfig, BlipConfig, Blip2Config, ChameleonConfig, Emu3Config, FuyuConfig, Gemma3Config, GitConfig, GotOcr2Config, IdeficsConfig, Idefics2Config, Idefics3Config, InstructBlipConfig, InternVLConfig, JanusConfig, Kosmos2Config, Llama4Config, LlavaConfig, LlavaNextConfig, LlavaNextVideoConfig, LlavaOnevisionConfig, Mistral3Config, MllamaConfig, PaliGemmaConfig, Pix2StructConfig, PixtralVisionConfig, Qwen2_5_VLConfig, Qwen2VLConfig, ShieldGemma2Config, SmolVLMConfig, UdopConfig, VipLlavaConfig, VisionEncoderDecoderConfig."

Actually, I could load using AutoModel.from_pretrained, but then calling model.generate fails with "AttributeError: 'Gemma3TextModel' object has no attribute 'generate'"

mlabonne
/

gemma-3-27b-it-qat-abliterated-GGUF

Using this model in ollama - lacks vision capability?