--- license: apache-2.0 --- # litert-community/gemma-4-E2B-it-litert-lm Main Model Card: [google/gemma-4-E2B-it](https://huggingface.co/google/gemma-4-E2B-it) ## Try Gemma 4 E2B
| Backend | Quantization scheme | Prefill (tokens/sec) | Decode (tokens/sec) | Time-to-first-token (sec) | Model size (MB) | CPU Memory (RSS in MB) | |
|---|---|---|---|---|---|---|---|
CPU |
TODO |
TODO |
TODO |
TODO |
TODO |
TODO |
|
GPU |
TODO |
TODO |
TODO |
TODO |
TODO |
TODO |
| Backend | Quantization scheme | Prefill (tokens/sec) | Decode (tokens/sec) | Time-to-first-token (sec) | Model size (MB) | CPU Memory (RSS in MB) | |
|---|---|---|---|---|---|---|---|
CPU |
TODO |
TODO |
TODO |
TODO |
TODO |
TODO |
|
GPU |
TODO |
TODO |
TODO |
TODO |
TODO |
TODO |
| Backend | Quantization scheme | Prefill (tokens/sec) | Decode (tokens/sec) | Time-to-first-token (sec) | Model size (MB) | CPU Memory (RSS in MB) | |
|---|---|---|---|---|---|---|---|
CPU |
TODO |
TODO |
TODO |
TODO |
TODO |
TODO |
|
GPU |
TODO |
TODO |
TODO |
TODO |
TODO |
TODO |
| Backend | Quantization scheme | Prefill (tokens/sec) | Decode (tokens/sec) | Time-to-first-token (sec) | Model size (MB) | CPU Memory (RSS in MB) | |
|---|---|---|---|---|---|---|---|
CPU |
TODO |
TODO |
TODO |
TODO |
TODO |
TODO |
|
GPU |
TODO |
TODO |
TODO |
TODO |
TODO |
TODO |
| Backend | Quantization scheme | Prefill (tokens/sec) | Decode (tokens/sec) | Time-to-first-token (sec) | Model size (MB) | CPU Memory (RSS in MB) | |
|---|---|---|---|---|---|---|---|
CPU |
TODO |
TODO |
TODO |
TODO |
TODO |
TODO |
|
GPU |
TODO |
TODO |
TODO |
TODO |
TODO |
TODO |
| Backend | Quantization scheme | Prefill (tokens/sec) | Decode (tokens/sec) | Time-to-first-token (sec) | Model size (MB) | CPU Memory (RSS in MB) | |
|---|---|---|---|---|---|---|---|
CPU |
TODO |
TODO |
TODO |
TODO |
TODO |
TODO |
| Backend | Quantization scheme | Prefill (tokens/sec) | Decode (tokens/sec) | Time-to-first-token (sec) | Model size (MB) | CPU Memory (RSS in MB) | |
|---|---|---|---|---|---|---|---|
CPU |
TODO |
TODO |
TODO |
TODO |
TODO |
TODO |