--- title: TRIAGE GRPO Training V2 emoji: 🏥 colorFrom: green colorTo: blue sdk: docker pinned: false --- # TRIAGE Agent GRPO Training — V2 (Account 2) Training Qwen3.5-27B on expanded medical dataset (V2). **This is the second training run for LoRA adapter merging.** GPU: l40s | Epochs: 3