--- base_model: TeichAI/gemma-4-26B-A4B-it-Claude-Opus-Distill-v2 language: en pipeline_tag: image-text-to-text library_name: mlx tags: - mlx - text-generation-inference - transformers - unsloth - gemma4 - reasoning license: apache-2.0 datasets: - TeichAI/Claude-Opus-4.6-Reasoning-887x - TeichAI/claude-4.5-opus-high-reasoning-250x - Crownelius/Opus-4.6-Reasoning-2100x-formatted --- # 🦆 zecanard/gemma-4-26B-A4B-it-Claude-Opus-Distilled-v2-MLX-3bit-affine [This model](https://huggingface.co/zecanard/gemma-4-26B-A4B-it-Claude-Opus-Distilled-v2-MLX-3bit-affine) was converted to MLX from [`TeichAI/gemma-4-26B-A4B-it-Claude-Opus-Distill-v2`](https://huggingface.co/TeichAI/gemma-4-26B-A4B-it-Claude-Opus-Distill-v2) using `mlx-vlm` version **0.4.4**. Please refer to the [original model card](https://huggingface.co/TeichAI/gemma-4-26B-A4B-it-Claude-Opus-Distill-v2) for more details. ## 🌟 Quality Quantized vision language model with an effective **4.364 bits per weight**. `mlx_vlm.convert --quantize --q-bits 3 --q-group-size 32 --q-mode affine` ## 🛠️ Customizations This quant is aware of the current date, and also enables thinking (if available). You may disable this behavior by deleting the following line from the chat template: `{%- set enable_thinking = true %}` You may also need to adjust your environment’s **Reasoning Section Parsing** to recognize `<|channel>thought` as the **Start String**, and `` as the **End String**. ## 🖥️ Use with `mlx` ```bash pip install -U mlx-vlm ``` ```bash mlx_vlm.generate --model zecanard/gemma-4-26B-A4B-it-Claude-Opus-Distilled-v2-MLX-3bit-affine --max-tokens 100 --temperature 0 --prompt "Describe this image." --image ```