litert-community
/

gemma-4-E2B-it-litert-lm

Model card Files Files and versions

marissaw commited on Apr 1

Commit

753814e

·

verified ·

1 Parent(s): f85d401

Update README.md

Files changed (1) hide show

README.md +2 -0

README.md CHANGED Viewed

@@ -28,6 +28,8 @@ All benchmarks were taken using 1024 prefill tokens and 256 decode tokens with a
 CPU memory was measured using, rusage::ru_maxrss on Android, Linux and Raspberry Pi, task_vm_info::phys_footprint on iOS and MacBook and process_memory_counters::PrivateUsage on Windows.
 **Android**
 *Note: On [supported Android devices](https://developers.google.com/ml-kit), Gemma 4 is available through Android AI Core as [Gemini Nano](https://developer.android.com/ai/gemini-nano#architecture), which is the recommended path for production applications.*

 CPU memory was measured using, rusage::ru_maxrss on Android, Linux and Raspberry Pi, task_vm_info::phys_footprint on iOS and MacBook and process_memory_counters::PrivateUsage on Windows.
+We use the Gemma quantization scheme that employs a mixture of 2bit, 4bit and 8bit weights.
 **Android**
 *Note: On [supported Android devices](https://developers.google.com/ml-kit), Gemma 4 is available through Android AI Core as [Gemini Nano](https://developer.android.com/ai/gemini-nano#architecture), which is the recommended path for production applications.*