File size: 1,493 Bytes
80a53d5 58dcbbe 50fe8c4 58dcbbe | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 | ---
license: apache-2.0
base_model:
- Qwen/Qwen3-VL-Embedding-2B
pipeline_tag: feature-extraction
tags:
- embeddings
- pytorch
---
# About
This repository contains precalculated **Feature Extraction** matrices (embeddings) designed for **Sigma-Captioner**. These caches allow for high-speed image-to-text alignment and similarity searching without re-encoding the entire vocabulary on every run.
# Compatible with any project, but recommended for use with https://github.com/uninterruptedpowersupply3-NEW/Sigma-Captioner/tree/SGLANG
### 📊 Dataset Sources
* **English:** [dwyl/english-words](https://github.com/dwyl/english-words)
* **Anime/Visual:** [cagliostrolab/860k-ordered-tags-json](https://huggingface.co/datasets/cagliostrolab/860k-ordered-tags-json) (Danbooru tags)
### 📂 Directory Structure & Usage
The embeddings are split into two versions based on reliability:
| Path | Contents | Recommendation |
| :--- | :--- | :--- |
| `root/` | `vocab_hybrid_matrix.pt` | **Base Use:** Stable and recommended for general tasks. |
| `DANANDENG/` | `vocab_hybrid_meta.pt` | **Experimental:** Hybrid Anime + English merge. |
> [!WARNING]
> **Experimental Warning:** The Anime tags in the `DANANDENG` directory are prone to hallucinations regarding specific characters. Use the root files for production/standard tagging and reserve the `DANANDENG` files for experimental research.
* **Model:** Qwen/Qwen3-VL-Embedding-2B
* **Format:** PyTorch Tensor (.pt) + JSON Metadata |