nomic-embed-text-v1-ablated: A Reproducible Long Context (8192) Text Embedder

nomic-embed-text-v1-ablated is 8192 context length text encoder. This is a checkpoint trained after modifying the training dataset to be different from the dataset used to train our final model. The purpose of releasing this checkpoint is to understand the impact that subsets of our training data had on model outcomes. This release is part of our commitment to open-source training artifacts from our Nomic Embed Text tech report here

If you want to use a model to extract embeddings, we suggest using nomic-embed-text-v1.

Join the Nomic Community

Downloads last month
657
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for nomic-ai/nomic-embed-text-v1-ablated

Quantizations
1 model

Spaces using nomic-ai/nomic-embed-text-v1-ablated 7

Collection including nomic-ai/nomic-embed-text-v1-ablated

Evaluation results