luisf-mc commited on
Commit
0abaa87
·
verified ·
1 Parent(s): e97c820

Fix benchmarks link: point at the GitHub repo (now public)

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -180,7 +180,7 @@ The Phase 4 result in this repo is **the only "ours" finding from a 9-phase proj
180
 
181
  The pattern across all training experiments was the same: the hard splits (icon, web) improved at the cost of the easy splits (text), and overall accuracy never beat the base. The lesson we kept: **at 3B + a few-thousand-sample fine-tuning budget, GUI-G2-3B is near its achievable optimum.** Inference-time wraps like CCF that don't touch the weights win the hard splits without paying the easy-split tax.
182
 
183
- Full project writeup with per-experiment numbers: see the [benchmarks/results.md](https://github.com/luisfmoncer/gui-grounding/blob/main/benchmarks/results.md) in the source repo.
184
 
185
  ## Citation
186
 
 
180
 
181
  The pattern across all training experiments was the same: the hard splits (icon, web) improved at the cost of the easy splits (text), and overall accuracy never beat the base. The lesson we kept: **at 3B + a few-thousand-sample fine-tuning budget, GUI-G2-3B is near its achievable optimum.** Inference-time wraps like CCF that don't touch the weights win the hard splits without paying the easy-split tax.
182
 
183
+ Full project writeup with per-experiment numbers: see [benchmarks/results.md](https://github.com/LufeMC/gui-g2-3b-ccf/blob/main/benchmarks/results.md) in the [GitHub repo](https://github.com/LufeMC/gui-g2-3b-ccf).
184
 
185
  ## Citation
186