How to use?

by ishiima - opened Mar 2

Mar 2

Hi there! How do you exactly use embeds? I've been using your version of VNTL's model. So far yours has been usually better at the translations. I'm using LM studio but I do not know how to use these kinds of embeds. Do I just load it up alongside the model or something?

Casual-Autopsy

Owner Mar 2

Embeds are used for RAG. I find it helps with translations when feeding it chat pairs with the instruct format included.

However to run it, I've only had success with llama.cpp when it's converted to gguf format. Problem is that I have no idea how to build a new sentencepiece.bpe.model from the tokenizer to be able to convert it to gguf format, which means you've to run it via transformers which I've no experience in.

ishiima

Mar 2

I see thanks! I've been mostly just running the model with LunaTranslator so unfortunately I'm a complete beginner regarding this.

ishiima changed discussion status to closed Mar 2

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment