sentence-transformers/natural-questions
Viewer • Updated • 100k • 4.09k • 45
How to use cnmoro/inference-free-splade-co-condenser-en-ptbr with sentence-transformers:
from sentence_transformers import SentenceTransformer
model = SentenceTransformer("cnmoro/inference-free-splade-co-condenser-en-ptbr")
sentences = [
"That is a happy person",
"That is a happy dog",
"That is a very happy person",
"Today is a sunny day"
]
embeddings = model.encode(sentences)
similarities = model.similarity(embeddings, embeddings)
print(similarities.shape)
# [4, 4]This model is a sparse retriever trained with Sentence Transformers' SPLADE stack, using a merged multilingual corpus (English + Portuguese).
from sentence_transformers import SparseEncoder
sparse_model = SparseEncoder("cnmoro/inference-free-splade-co-condenser-en-ptbr")
sparse_embeddings = sparse_model.encode(["Hello", "World"], show_progress_bar=True)
Luyu/co-condenser-marcoThe training corpus was built by row-wise concatenation of:
sentence-transformers/natural-questionscnmoro/GPT4-500k-Augmented-PTBR-Cleancnmoro/WizardVicuna-PTBR-Instruct-CleanFinal merged size:
12SpladeLoss(SparseMultipleNegativesRankingLoss)0.0302e-50.1fp16=TrueNO_DUPLICATESquery -> query, answer -> documenttrain_runtime: 5756.8386 strain_steps_per_second: 4.714train_samples_per_second: 150.841train_loss: 0.30475Base model
Luyu/co-condenser-marco
from sentence_transformers import SentenceTransformer model = SentenceTransformer("cnmoro/inference-free-splade-co-condenser-en-ptbr") sentences = [ "That is a happy person", "That is a happy dog", "That is a very happy person", "Today is a sunny day" ] embeddings = model.encode(sentences) similarities = model.similarity(embeddings, embeddings) print(similarities.shape) # [4, 4]