Unable to load model directly on the machine

#2
by yjyash0701 - opened

getting this error, using the statements provided to load the model directly
ValueError: The checkpoint you are trying to load has model type llava_mistral but Transformers does not recognize this architecture. This could be because of an issue with the checkpoint, or because your version of Transformers is out of date.

You can update Transformers with the command pip install --upgrade transformers. If this does not work, and the checkpoint is very new, then there may not be a release version that supports this model yet. In this case, you can get the most up-to-date code by installing Transformers from source with the command pip install git+https://github.com/huggingface/transformers.git

yjyash0701 changed discussion status to closed
yjyash0701 changed discussion status to open

Instead of using Transformers to load it, you should use LlavaMistralForCausalLM from the LLaVA-Med repo. You’ll need to clone the LLaVA-Med repo to access those functions.

Sign up or log in to comment