Qwen3-8B-LQA-3e-GRPO-60s-save1 / model-00002-of-00004.safetensors

Commit History

(Trained with Unsloth)
fb849d6
verified

lmq1909 commited on