Qwen3-8B-LQA-3e-GRPO-100s-save2 / model-00004-of-00004.safetensors

Commit History

(Trained with Unsloth)
3d19608
verified

lmq1909 commited on