Following on my experiments with restoring pruned models, this is Gemma 4 26B A4B pruned down from 128 to 64 experts with a significantly more effective pruning strategy than the previous attempt, and then given GaLore based full parameter training on only the LIMO and LIMA datasets.

The model has proven to be remarkably resilient to pruning, and while it failed in most complex tasks post prune, the training has repaired its ability to roleplay, follow instructions and logical reasoning. However a major caveat of this training run being so short is that the model struggles heavily in math. Asking a complex arithmetic question may cause gibberish or otherwise nonsensical output at worst, and the model being incorrect or giving up at best.

However the model was rigorously tested on both roleplaying and mathematics at the same time within one prompt, and while it struggles with math aside from basic calculations, it is exceptional at staying in character.

This model should be much higher quality than my previous attempt, but do not rely on it for complex tasks as a pruned model is always susceptible to errors.

The prune was based on a Heretic abliterated version of Gemma 4 26B A4B IT.

Downloads last month
52
GGUF
Model size
14B params
Architecture
gemma4
Hardware compatibility
Log In to add your hardware

4-bit

16-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for blascotobasco/Gemma-4-14B-A4B-Heretic-GGUF