nomic-embed-text-v1-ablated: A Reproducible Long Context (8192) Text Embedder

nomic-embed-text-v1-ablated is 8192 context length text encoder. This is a checkpoint trained after modifying the training dataset to be different from the dataset used to train our final model. The purpose of releasing this checkpoint is to understand the impact that subsets of our training data had on model outcomes. This release is part of our commitment to open-source training artifacts from our Nomic Embed Text tech report here

If you want to use a model to extract embeddings, we suggest using nomic-embed-text-v1.

Join the Nomic Community

Downloads last month
676
Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.

Model tree for nomic-ai/nomic-embed-text-v1-ablated

Quantizations
1 model

Spaces using nomic-ai/nomic-embed-text-v1-ablated 5

Collection including nomic-ai/nomic-embed-text-v1-ablated

Evaluation results