bpe_glove_300_lora_r300_qwen3_msmarco_ft

Stage 2 supervised fine-tuning of jsanzolac/bpe_glove_300_lora_r300_qwen3 on MS MARCO triplet-50 with InfoNCE-only (no MSE), 1 epoch, batch=256, K=6 mined hard negatives per anchor, τ=0.02.

Trainable: A.weight, B.weight only. E (frozen 300-d cl100k GloVe from jsanzolac/drifting-glove-distilled-r300) is excluded from the saved state dict.

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for jsanzolac/bpe_glove_300_lora_r300_qwen3_msmarco_ft

Dataset used to train jsanzolac/bpe_glove_300_lora_r300_qwen3_msmarco_ft