English
glove
lora
distillation
bpe
cl100k_base
jsanzolac's picture
Upload README.md with huggingface_hub
ddcb77d verified
---
language:
- en
license: apache-2.0
tags:
- glove
- lora
- distillation
- bpe
- cl100k_base
base_model: jsanzolac/bpe_glove_512
datasets:
- jsanzolac/qwen3_emb_512
- jsanzolac/qwen3_emb_512_packed
---
# bpe_glove_512_lora_v1
LoRA drifts on top of frozen `jsanzolac/bpe_glove_512` BPE-GloVe-512 embeddings, distilled
from `Qwen/Qwen3-Embedding-8B` (MRL-truncated to 512 dims).
**Variant 1 loss:** `位_c路InfoNCE + 位_D路||蟻_T - 蟻_S||_F虏` with `位_c=1.0`, `位_D=0.1`.
Each `rank_<r>/` folder contains:
- `checkpoint_final.pt`
- `config.json`
- `vectors_drifted.txt`
- `train_log.jsonl`
Ranks shipped: [512, 256, 128, 64, 32, 16, 8, 4, 2]