pplx-embed-context-v1.1-4B (private preview)

A continued fine-tune of perplexity-ai/pplx-embed-context-v1-4B. Same architecture and loading code — drop-in replacement.

Targets three failure modes observed on adversarial paraphrase-resolution probes: paraphrase-resolution, lexical-distractor pull, and glossary co-retrieval.

Usage

Using Transformers
from transformers import AutoModel

model_ctx = AutoModel.from_pretrained(
    "seslami-pplx/pplx-embed-context-v1.1-4B",
    trust_remote_code=True
)

doc_chunks = [
    [
        "Curiosity begins in childhood with endless questions about the world.",
        "As we grow, curiosity drives us to explore new ideas.",
        "Scientific breakthroughs often start with a curious question."
    ],
    [
        "The curiosity rover explores Mars searching for ancient life.",
        "Each discovery on Mars sparks new questions about the universe."
    ]
]
# Returns list of numpy arrays (one per document)
# embeddings[0].shape = (3, 1024), embeddings[1].shape = (2, 1024)
embeddings = model_ctx.encode(doc_chunks)
Downloads last month
14
Safetensors
Model size
4B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support