---
license: gemma
base_model:
  - Jiunsong/supergemma4-26b-abliterated-multimodal
language:
  - en
  - ko
tags:
  - GGUF
  - llama.cpp
  - gemma4
  - quantization
  - turboquant
  - tq3_4s
  - multimodal
  - abliterated
  - conversational
pipeline_tag: image-text-to-text
---

![thumbnail](thumbnail.png)

# Gemma4-26b-Super-Abliterated-TQ3_4S

`Gemma4-26b-Super-Abliterated-TQ3_4S` is a GGUF quantization of `Jiunsong/supergemma4-26b-abliterated-multimodal` using **TQ3_4S**, a 4.0 bpw Walsh-Hadamard-transform weight format with four per-8 scales.

## Files

* `Gemma4-26b-Super-Abliterated-TQ3_4S.gguf` (12.0 GiB)
* `mmproj-f16.gguf` (1.2 GiB)
* `chat_template.jinja`

## Runtime Requirement

This model requires the public TurboQuant runtime fork:
* https://github.com/turbo-tan/llama.cpp-tq3

It will not load correctly on stock `llama.cpp` or other runtimes that do not include `TQ3_4S`.

## Text-Only Run

```bash
./build/bin/llama-server \
  -m /path/to/Gemma4-26b-Super-Abliterated-TQ3_4S.gguf \
  -ngl 99 -c 4096 -np 1 \
  -ctk q8_0 -ctv q8_0 -fa on \
  --cache-ram 0 --no-warmup --jinja \
  --chat-template-file /path/to/chat_template.jinja
```

## Vision / Image Input

```bash
./build/bin/llama-server \
  -m /path/to/Gemma4-26b-Super-Abliterated-TQ3_4S.gguf \
  -mm /path/to/mmproj-f16.gguf \
  -ngl 99 -c 4096 -np 1 \
  -ctk q8_0 -ctv q8_0 -fa on \
  --cache-ram 0 --no-warmup --jinja \
  --chat-template-file /path/to/chat_template.jinja \
  --no-mmproj-offload
```

## Performance (RTX 5060 Ti 16GB)

| Metric | Value |
|--------|------:|
| PP512 | 2154 tok/s |
| TG128 | 91.3 tok/s |
| Size | 12.01 GiB |
| BPW | 4.09 |

## Quality

10/10 correct on standard QA benchmark (capital of France, 2+2, Python reverse string, gravity, WW2, primes, boiling point, Shakespeare, Jupiter, hello→Hola).

## Base Model

* [`Jiunsong/supergemma4-26b-abliterated-multimodal`](https://huggingface.co/Jiunsong/supergemma4-26b-abliterated-multimodal) — abliterated multimodal variant of Google Gemma 4 26B-A4B
* Original: [`google/gemma-4-26B-A4B-it`](https://huggingface.co/google/gemma-4-26B-A4B-it)

## Credits

* **Jiunsong** for the SuperGemma4 abliterated multimodal model
* **Google DeepMind** for Gemma 4
* **huihui-ai** for the abliteration technique

## License

Same license terms as the base model apply (Gemma license).