Qwen2.5-3B-reasoning-medical-symptoms-GRPO-f16 / pytorch_model.bin.index.json

Commit History

Trained with Unsloth
b515bf7
verified

dumbequation commited on