UPShf
/

Vocabulary-Qwen3-VL-Embedding-2B

Feature Extraction

Model card Files Files and versions

Vocabulary-Qwen3-VL-Embedding-2B / README.md

UPShf's picture

Update README.md

50fe8c4 verified 3 months ago

|

History Blame Contribute Delete

1.49 kB

	---
	license: apache-2.0
	base_model:
	- Qwen/Qwen3-VL-Embedding-2B
	pipeline_tag: feature-extraction
	tags:
	- embeddings
	- pytorch
	---

	# About

	This repository contains precalculated Feature Extraction matrices (embeddings) designed for Sigma-Captioner. These caches allow for high-speed image-to-text alignment and similarity searching without re-encoding the entire vocabulary on every run.

	# Compatible with any project, but recommended for use with https://github.com/uninterruptedpowersupply3-NEW/Sigma-Captioner/tree/SGLANG

	### 📊 Dataset Sources
	* English: [dwyl/english-words](https://github.com/dwyl/english-words)
	* Anime/Visual: [cagliostrolab/860k-ordered-tags-json](https://huggingface.co/datasets/cagliostrolab/860k-ordered-tags-json) (Danbooru tags)

	### 📂 Directory Structure & Usage
	The embeddings are split into two versions based on reliability:

	\| Path \| Contents \| Recommendation \|
	\| :--- \| :--- \| :--- \|
	\| `root/` \| `vocab_hybrid_matrix.pt` \| Base Use: Stable and recommended for general tasks. \|
	\| `DANANDENG/` \| `vocab_hybrid_meta.pt` \| Experimental: Hybrid Anime + English merge. \|

	> [!WARNING]
	> Experimental Warning: The Anime tags in the `DANANDENG` directory are prone to hallucinations regarding specific characters. Use the root files for production/standard tagging and reserve the `DANANDENG` files for experimental research.

	* Model: Qwen/Qwen3-VL-Embedding-2B
	* Format: PyTorch Tensor (.pt) + JSON Metadata