tooape's picture
Replace with trimmed-64K-vocab PTQ-q4 (transformers.js, inlined single-file)
4837cec verified
|
Raw
History Blame Contribute Delete
638 Bytes
metadata
library_name: transformers.js
pipeline_tag: feature-extraction
base_model: google/embeddinggemma-300m
license: gemma
tags:
  - onnx
  - embeddinggemma
  - sentence-similarity
  - quantized

EmbeddingGemma-300m — 64K-vocab-trimmed, PTQ-q4 (transformers.js)

Vocabulary-trimmed (262K → 64K EN+code ByteLevel-BPE) EmbeddingGemma-300m, PTQ int4, exported as a single self-contained onnx/model_q4.onnx (external data inlined — required for iOS WKWebView). Built for the Obsidian "Seek" plugin.

NOTE: the repo slug still says qat-q8 for URL stability — this is a misnomer; the actual model is trimmed PTQ-q4 (no QAT, not q8).