QAPhi4_GRPO_250220_merged_16bit / model-00008-of-00008.safetensors

Commit History

Trained with Unsloth
8e229ab
verified

YxBxRyXJx commited on