---
base_model: TeichAI/gemma-4-26B-A4B-it-Claude-Opus-Distill-v2
language: en
pipeline_tag: image-text-to-text
library_name: mlx
tags:
  - mlx
  - text-generation-inference
  - transformers
  - unsloth
  - gemma4
  - reasoning
license: apache-2.0
datasets:
  - TeichAI/Claude-Opus-4.6-Reasoning-887x
  - TeichAI/claude-4.5-opus-high-reasoning-250x
  - Crownelius/Opus-4.6-Reasoning-2100x-formatted
---
# 🦆 zecanard/gemma-4-26B-A4B-it-Claude-Opus-Distilled-v2-MLX-3bit-affine

[This model](https://huggingface.co/zecanard/gemma-4-26B-A4B-it-Claude-Opus-Distilled-v2-MLX-3bit-affine) was converted to MLX from [`TeichAI/gemma-4-26B-A4B-it-Claude-Opus-Distill-v2`](https://huggingface.co/TeichAI/gemma-4-26B-A4B-it-Claude-Opus-Distill-v2) using `mlx-vlm` version **0.4.4**.
Please refer to the [original model card](https://huggingface.co/TeichAI/gemma-4-26B-A4B-it-Claude-Opus-Distill-v2) for more details.

## 🌟 Quality

Quantized vision language model with an effective **4.364 bits per weight**.

`mlx_vlm.convert --quantize --q-bits 3 --q-group-size 32 --q-mode affine`

## 🛠️ Customizations

This quant is aware of the current date, and also enables thinking (if available). You may disable this behavior by deleting the following line from the chat template:

`{%- set enable_thinking = true %}`

You may also need to adjust your environment’s **Reasoning Section Parsing** to recognize `<|channel>thought` as the **Start String**, and `<channel|>` as the **End String**.

## 🖥️ Use with `mlx`

```bash
pip install -U mlx-vlm
```

```bash
mlx_vlm.generate --model zecanard/gemma-4-26B-A4B-it-Claude-Opus-Distilled-v2-MLX-3bit-affine --max-tokens 100 --temperature 0 --prompt "Describe this image." --image <path_to_image>
```