Nomic Embed
Collection
Open Source Long Context Text Embedders
•
8 items
•
Updated
•
17
nomic-embed-text-v1-ablated
is 8192 context length text encoder. This is a checkpoint trained after modifying the training dataset to be different from the dataset used to train our final model. The purpose of releasing this checkpoint is to understand the impact that subsets of our training data had on model outcomes. This release is part of our commitment to open-source training artifacts from our Nomic Embed Text tech report here
If you want to use a model to extract embeddings, we suggest using nomic-embed-text-v1.