Gemma4-VL: add gemma4_e2b_qat_vl_decode_int4linsym_aotc_h18p e9fe1e8 verified mlboydaisuke commited on 13 days ago
Gemma4-VL: add gemma4_e2b_qat_vl_decode_int4linsym 032adf2 verified mlboydaisuke commited on 13 days ago
Gemma4-VL: add gemma4_e2b_qat_vl_decode_int4linsym_tbl 3e4aa57 verified mlboydaisuke commited on 13 days ago
E2B QAT gather tables (checkpoint-derived - pair with the QAT bundles) 1ff697a verified mlboydaisuke commited on 14 days ago
E2B official-QAT int4lin tbl bundle (Mac 78.9, iPhone 30.7 tok/s) 01a42dd verified mlboydaisuke commited on 14 days ago
gemma4 e2b int4lin tbl AOT h18p bundle (CoreAIChat Gemma ⚡ download target) 431f19b verified mlboydaisuke commited on 14 days ago
gpu-pipelined: gemma4 int4lin tbl bundle (PLE table as static graph input; M4 Max 77.0, iPhone 30.3 via AOT) 19a329e verified mlboydaisuke commited on 15 days ago
macOS best: int8 fused-kernel core (56.6-59 tok/s) 84433ff verified mlboydaisuke commited on 15 days ago
mmap gather front-end tables (shared: iOS GPU + iOS ANE) 3a30dba verified mlboydaisuke commited on 15 days ago
iOS GPU best: int4-kmeans fused-kernel core (17.7 tok/s) 400a691 verified mlboydaisuke commited on 15 days ago