Instructions to use cnmoro/nomic-embed-text-v2-moe-distilled-high-quality with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- sentence-transformers
How to use cnmoro/nomic-embed-text-v2-moe-distilled-high-quality with sentence-transformers:
from sentence_transformers import SentenceTransformer model = SentenceTransformer("cnmoro/nomic-embed-text-v2-moe-distilled-high-quality") sentences = [ "The weather is lovely today.", "It's so sunny outside!", "He drove to the stadium." ] embeddings = model.encode(sentences) similarities = model.similarity(embeddings, embeddings) print(similarities.shape) # [3, 3] - Model2Vec
How to use cnmoro/nomic-embed-text-v2-moe-distilled-high-quality with Model2Vec:
from model2vec import StaticModel model = StaticModel.from_pretrained("cnmoro/nomic-embed-text-v2-moe-distilled-high-quality") - Notebooks
- Google Colab
- Kaggle
Update 2026/04/15:
This new model is a complete replacement (smaller, faster and better).
This Model2Vec model was created by using Tokenlearn, with nomic-embed-text-v2-moe as a base.
The output dimension is 768.
The evaluation in the model card, was executed using this model (distilled), not the original.
The process to create this one, was not a simple model2vec distill, this involved generating embeddings for 23M triplets (msmarco) with the original model, then training the tokenlearn model on it, with the nomic model as a base.
Usage
Load this model using model2vec library:
from model2vec import StaticModel
model = StaticModel.from_pretrained("cnmoro/nomic-embed-text-v2-moe-distilled-high-quality")
# Compute text embeddings
embeddings = model.encode(["Example sentence"])
Or using sentence-transformers library:
from sentence_transformers import SentenceTransformer
model = SentenceTransformer('cnmoro/nomic-embed-text-v2-moe-distilled-high-quality')
# Compute text embeddings
embeddings = model.encode(["Example sentence"])
- Downloads last month
- 38,641
Model tree for cnmoro/nomic-embed-text-v2-moe-distilled-high-quality
Base model
FacebookAI/xlm-roberta-baseDataset used to train cnmoro/nomic-embed-text-v2-moe-distilled-high-quality
Space using cnmoro/nomic-embed-text-v2-moe-distilled-high-quality 1
Collection including cnmoro/nomic-embed-text-v2-moe-distilled-high-quality
Evaluation results
- pearson on MTEB Assin2STS (default)test set self-reported64.533
- spearman on MTEB Assin2STS (default)test set self-reported58.746
- cosine_pearson on MTEB Assin2STS (default)test set self-reported64.533
- cosine_spearman on MTEB Assin2STS (default)test set self-reported58.746
- manhattan_pearson on MTEB Assin2STS (default)test set self-reported62.204
- manhattan_spearman on MTEB Assin2STS (default)test set self-reported58.837
- euclidean_pearson on MTEB Assin2STS (default)test set self-reported62.072
- euclidean_spearman on MTEB Assin2STS (default)test set self-reported58.746