| --- |
| license: apache-2.0 |
| base_model: |
| - Qwen/Qwen3-VL-Embedding-2B |
| pipeline_tag: feature-extraction |
| tags: |
| - embeddings |
| - pytorch |
| --- |
| |
| # About |
|
|
| This repository contains precalculated **Feature Extraction** matrices (embeddings) designed for **Sigma-Captioner**. These caches allow for high-speed image-to-text alignment and similarity searching without re-encoding the entire vocabulary on every run. |
|
|
| # Compatible with any project, but recommended for use with https://github.com/uninterruptedpowersupply3-NEW/Sigma-Captioner/tree/SGLANG |
|
|
| ### ๐ Dataset Sources |
| * **English:** [dwyl/english-words](https://github.com/dwyl/english-words) |
| * **Anime/Visual:** [cagliostrolab/860k-ordered-tags-json](https://huggingface.co/datasets/cagliostrolab/860k-ordered-tags-json) (Danbooru tags) |
|
|
| ### ๐ Directory Structure & Usage |
| The embeddings are split into two versions based on reliability: |
|
|
| | Path | Contents | Recommendation | |
| | :--- | :--- | :--- | |
| | `root/` | `vocab_hybrid_matrix.pt` | **Base Use:** Stable and recommended for general tasks. | |
| | `DANANDENG/` | `vocab_hybrid_meta.pt` | **Experimental:** Hybrid Anime + English merge. | |
|
|
| > [!WARNING] |
| > **Experimental Warning:** The Anime tags in the `DANANDENG` directory are prone to hallucinations regarding specific characters. Use the root files for production/standard tagging and reserve the `DANANDENG` files for experimental research. |
|
|
| * **Model:** Qwen/Qwen3-VL-Embedding-2B |
| * **Format:** PyTorch Tensor (.pt) + JSON Metadata |