jsanzolac
/

bpe_glove_300_lora_r300_qwen3_msmarco_ft

Model card Files Files and versions

bpe_glove_300_lora_r300_qwen3_msmarco_ft

Stage 2 supervised fine-tuning of jsanzolac/bpe_glove_300_lora_r300_qwen3 on MS MARCO triplet-50 with InfoNCE-only (no MSE), 1 epoch, batch=256, K=6 mined hard negatives per anchor, τ=0.02.

Trainable: A.weight, B.weight only. E (frozen 300-d cl100k GloVe from jsanzolac/drifting-glove-distilled-r300) is excluded from the saved state dict.

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for jsanzolac/bpe_glove_300_lora_r300_qwen3_msmarco_ft

Base model

jsanzolac/drifting-glove-distilled-r300

Adapter

jsanzolac/bpe_glove_300_lora_r300_qwen3

Adapter

(6)

this model

Dataset used to train jsanzolac/bpe_glove_300_lora_r300_qwen3_msmarco_ft