No model found error

#80

by huggit0000 - opened Nov 21, 2024

Nov 21, 2024

Hi, I'm trying to use the model offline. I downloaded the models, both , restructured models--jinaai--jina-embeddings-v3 and models--jinaai--xlm-roberta-flash-implementation, and added config.json with model_type.

It works for transformers , but gives an error for sentence_transformers:

from sentence_transformers import SentenceTransformer

model = SentenceTransformer("/home//.cache/huggingface/models--jinaai--jina-embeddings-v3", trust_remote_code=True)

import json

# Path to your config.json file
config_path = "/home//.cache/huggingface/models--jinaai--jina-embeddings-v3/config.json"

# Load the config file
with open(config_path, "r") as config_file:
   config = json.load(config_file)

# Check if the 'task_instructions' key exists
task_instructions = config.get('task_instructions', {})

# Print the loaded task instructions
print(task_instructions)



task = "retrieval.query"
embeddings = model.encode(
   ["What is the weather like in Berlin today?"],
   task=task,
   prompt_name=task,
)

No sentence-transformers model found with name /home//.cache/huggingface/models--jinaai--jina-embeddings-v3. Creating a new one with mean pooling.
{'retrieval.query': 'Represent the query for retrieving evidence documents: ', 'retrieval.passage': 'Represent the document for retrieval: ', 'separation': '', 'classification': '', 'text-matching': ''}
Traceback (most recent call last):
  File "/home//miniconda3/lib/python3.12/site-packages/sentence_transformers/SentenceTransformer.py", line 534, in encode
    prompt = self.prompts[prompt_name]
             ~~~~~~~~~~~~^^^^^^^^^^^^^
KeyError: 'retrieval.query'

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
  File "/home//mycode/jina_sentence_trnasformer.py", line 23, in <module>
    embeddings = model.encode(
                 ^^^^^^^^^^^^^
  File "/home//miniconda3/lib/python3.12/site-packages/sentence_transformers/SentenceTransformer.py", line 536, in encode
    raise ValueError(
ValueError: Prompt name 'retrieval.query' not found in the configured prompts dictionary with keys [].

Any guidance would be appreciated.

tomaarsen

Nov 21, 2024

Hello!
Does your model directory (/home//.cache/huggingface/models--jinaai--jina-embeddings-v3) contain a config_sentence_transformers.json file? (https://huggingface.co/jinaai/jina-embeddings-v3/blob/main/config_sentence_transformers.json)

This should be used to load the prompts, i.e. the one with the key "retrieval.query" in your case. This prompt wasn't loaded, resulting in your crash.

Tom Aarsen

huggit0000

Nov 21, 2024

@tomaarsen Thanks for your quick reply! No, I didn't move that one. While I'm moving, can you tell me any other files I should relocate so that I can use the full features?

.
├── config.json
├── jinaai
│   └── xlm-roberta-flash-implementation
│   ├── block.py
│   ├── configuration_xlm_roberta.py
│   ├── embedding.py
│   ├── mha.py
│   ├── mlp.py
│   ├── modeling_lora.py
│   ├── modeling_xlm_roberta.py
│   ├── rotary.py
│   ├── stochastic_depth.py
│   └── xlm_padding.py
├── model.safetensors
├── special_tokens_map.json
├── tokenizer.json
└── tokenizer_config.json

tomaarsen

Nov 21, 2024

•

edited Nov 21, 2024

My recommendation is to:

Clone this repository (https://huggingface.co/jinaai/jina-embeddings-v3/blob/main/config_sentence_transformers.json?clone=true) to get all files
Copy all files from the implementation repository into the local directory, i.e. adjacent to the model.safetensors etc.
Update the config.json (from the jina-embeddings-v3) cloning and turn the "auto_map" values from "jinaai/xlm-roberta-flash-implementation--file.class" into just "file.class". Because the files are adjacent, it'll be able to find it.

Then you can load the model by providing the path to the directory that you cloned into. Then you only have 1 directory with everything in it (it does have some necessary subdirectories)

Beyond that, this ensures that you have all files and that your performance should be the same as the remote model. But you're free to test with some arbitrary texts to make sure the embeddings are the same.

Tom Aarsen

huggit0000

Nov 21, 2024

•

edited Nov 21, 2024

Hi Tom @tomaarsen , Thank you so much. I rearranged as you guided and It works fine 🙏

If you don't mind, can you give me some follow-ups?

I couldn't find Lora adapter files, which seems to be merged in safetensors. My idea is fine-tuning over the Lora Adapter. Do you have any idea?
I'm a bit concerned that my finetuing could overfit, so can you guide a miminum size of dataset that doesn't hurt the performance?
Is Matryoshka embedding ionly enabled when I set truncate_dim=, or just default on ?

antonkulaga

Dec 17, 2024

maybe model authors can just patch it to make it usable without shamanism?

KoenGerrits

Jan 14

Hi! I could use some help.

Im trying to create a RAG using the jinaai model. I've tried to use the model using: AutoModel.from_pretrained("jinaai/jina-embeddings-v3").
However, I get the message: No such file or directory: 'rotary.py'. I don't see this file anywhere in the repository.
Then I tried to close the repository locally, but the map 'jinaai' mentioned by @huggit0000 is missing. I've looked all over the place but I cannot find it.

Am I missing something?

Hope you guys can help me out!

Koen Gerrits

Frowningface

Mar 27

•

edited Mar 27

Hi! I could use some help.

Im trying to create a RAG using the jinaai model. I've tried to use the model using: AutoModel.from_pretrained("jinaai/jina-embeddings-v3").
However, I get the message: No such file or directory: 'rotary.py'. I don't see this file anywhere in the repository.
Then I tried to close the repository locally, but the map 'jinaai' mentioned by @huggit0000 is missing. I've looked all over the place but I cannot find it.

Am I missing something?

Hope you guys can help me out!

Koen Gerrits

Rotatory.py at one point downloaded itself to my HF cache folder when I retrieved jina embeddings v3.
....cache\huggingface\modules\transformers_modules\jinaai\xlm-roberta-flash-implementation

https://huggingface.co/jinaai/xlm-roberta-flash-implementation/tree/main

I believe when I used pip install triton-windows and pip install flash-attn these rotary errors were resolved.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment