Image-to-Text
Transformers
English
gui-grounding
screen-understanding
vision-language-model
icon-detection
screenspot
visual-search
Instructions to use luisf-mc/gui-g2-3b-ccf with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use luisf-mc/gui-g2-3b-ccf with Transformers:
# Use a pipeline as a high-level helper # Warning: Pipeline type "image-to-text" is no longer supported in transformers v5. # You must load the model directly (see below) or downgrade to v4.x with: # 'pip install "transformers<5.0.0' from transformers import pipeline pipe = pipeline("image-to-text", model="luisf-mc/gui-g2-3b-ccf")# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("luisf-mc/gui-g2-3b-ccf", dtype="auto") - Notebooks
- Google Colab
- Kaggle
Update live demo perf: ~700ms warm fast (vLLM v0.19.1)
Browse files
README.md
CHANGED
|
@@ -23,7 +23,7 @@ metrics:
|
|
| 23 |
A drop-in inference wrapper that improves GUI-G2-3B's icon-grounding accuracy by **+2.2pp on ScreenSpot-v2** at zero training cost. The base weights are unchanged; everything is in the inference pipeline.
|
| 24 |
|
| 25 |
> **Try it live (Azure A100, scale-to-zero)**: <https://guigrounding.whiteplant-27564a0e.eastus.azurecontainerapps.io>
|
| 26 |
-
> Warm latency: **~
|
| 27 |
|
| 28 |

|
| 29 |
|
|
|
|
| 23 |
A drop-in inference wrapper that improves GUI-G2-3B's icon-grounding accuracy by **+2.2pp on ScreenSpot-v2** at zero training cost. The base weights are unchanged; everything is in the inference pipeline.
|
| 24 |
|
| 25 |
> **Try it live (Azure A100, scale-to-zero)**: <https://guigrounding.whiteplant-27564a0e.eastus.azurecontainerapps.io>
|
| 26 |
+
> Warm latency: **~250-400ms server time / ~700-900ms wall time** for fast mode (CCF), **~900ms server time / ~1.6s wall** for accurate mode (6-pass self-consistency with real agreement-based confidence). The playground also streams the coarse CCF prediction at ~600ms wall so the dot appears tentatively before the refined pass completes. Cold start ~90s the first time after idle.
|
| 27 |
|
| 28 |

|
| 29 |
|