EmbeddingGemma-300m β€” 64K-vocab-trimmed, PTQ-q4 (transformers.js)

Vocabulary-trimmed (262K β†’ 64K EN+code ByteLevel-BPE) EmbeddingGemma-300m, PTQ int4, exported as a single self-contained onnx/model_q4.onnx (external data inlined β€” required for iOS WKWebView). Built for the Obsidian "Seek" plugin.

NOTE: the repo slug still says qat-q8 for URL stability β€” this is a misnomer; the actual model is trimmed PTQ-q4 (no QAT, not q8).

Downloads last month
8
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Model tree for tooape/embeddinggemma-300m-qat-q8-ONNX

Quantized
(293)
this model