pplx-embed-context-v1.1-4B (private preview)

A continued fine-tune of perplexity-ai/pplx-embed-context-v1-4B. Same architecture and loading code — drop-in replacement.

Targets three failure modes observed on adversarial paraphrase-resolution probes: paraphrase-resolution, lexical-distractor pull, and glossary co-retrieval.

Usage

Using Transformers

from transformers import AutoModel

model_ctx = AutoModel.from_pretrained(
    "seslami-pplx/pplx-embed-context-v1.1-4B",
    trust_remote_code=True
)

doc_chunks = [
    [
        "Curiosity begins in childhood with endless questions about the world.",
        "As we grow, curiosity drives us to explore new ideas.",
        "Scientific breakthroughs often start with a curious question."
    ],
    [
        "The curiosity rover explores Mars searching for ancient life.",
        "Each discovery on Mars sparks new questions about the universe."
    ]
]
# Returns list of numpy arrays (one per document)
# embeddings[0].shape = (3, 1024), embeddings[1].shape = (2, 1024)
embeddings = model_ctx.encode(doc_chunks)

Downloads last month: 14

Safetensors

Model size

4B params

Tensor type

F32

Inference Providers NEW

Text Retrieval

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support