luisf-mc
/

gui-g2-3b-ccf

@@ -180,7 +180,7 @@ The Phase 4 result in this repo is **the only "ours" finding from a 9-phase proj
 The pattern across all training experiments was the same: the hard splits (icon, web) improved at the cost of the easy splits (text), and overall accuracy never beat the base. The lesson we kept: **at 3B + a few-thousand-sample fine-tuning budget, GUI-G2-3B is near its achievable optimum.** Inference-time wraps like CCF that don't touch the weights win the hard splits without paying the easy-split tax.
-Full project writeup with per-experiment numbers: see the [benchmarks/results.md](https://github.com/luisfmoncer/gui-grounding/blob/main/benchmarks/results.md) in the source repo.
 ## Citation

 The pattern across all training experiments was the same: the hard splits (icon, web) improved at the cost of the easy splits (text), and overall accuracy never beat the base. The lesson we kept: **at 3B + a few-thousand-sample fine-tuning budget, GUI-G2-3B is near its achievable optimum.** Inference-time wraps like CCF that don't touch the weights win the hard splits without paying the easy-split tax.
+Full project writeup with per-experiment numbers: see [benchmarks/results.md](https://github.com/LufeMC/gui-g2-3b-ccf/blob/main/benchmarks/results.md) in the [GitHub repo](https://github.com/LufeMC/gui-g2-3b-ccf).
 ## Citation