Can't load in TextGen w/llamacpp_HF

by biship - opened Feb 23, 2024

Discussion

biship

Feb 23, 2024

Error: Could not load the model because a tokenizer in Transformers format was not found

However, i can load it it with llama.cpp.
Do you have the a tokenizer_config.json ?

lemonilia

Owner Feb 24, 2024

•

edited Feb 24, 2024

Only a LoRA adapter for Mistral-Instruct-7B-v0.2 and a GGUF quantization for llama.cpp are provided here, not the full FP16 weights; so tokenizer_config.json is not needed. The tokenizer is the same as Mistral-7B.

biship

Feb 24, 2024

Ah ok. Is there a difference between running it the two different ways?

lemonilia

Owner Feb 24, 2024

There shouldn't be; the provided GGUF quantization should be almost lossless.

lemonilia changed discussion status to closed Feb 27, 2024

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment