The DECKARD's Brain - Gemma 4, 31b - GGUF

GGUF-quantized version of kabachuha/Gemma-4-The-Deckards-Brain-31B, a merge of two heretic Gemma 4 31B dense models with thinking capabilities.

The GGUF-converted native NVFP4 file was obtained through llama.cpp's convert_hf_to_gguf.py from the r/writingprompts calibrated base of https://huggingface.co/kabachuha/Gemma-4-The-Deckards-Brain-31B-NVFP4.

The prompt format is fully inherited from DavidAU's The Deckard Thinking, meaning the chat template will have thinking by default. To disable, override the chat template with chat_template-instruct.jinja.

Thinking is highly recommended to not be ever turned off! If you are role-playing in non-English, this is crucial to bring your language output distribution closer to the English/Japanese training set, otherwise it will be much more bland and more censored!

For the rest of the information, see the original model card.

Downloads last month
550
GGUF
Model size
31B params
Architecture
gemma4
Hardware compatibility
Log In to add your hardware

4-bit

8-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for kabachuha/Gemma-4-The-Deckards-Brain-31B-GGUF

Quantized
(4)
this model