grayarea
/

Mistral-Small-3.2-24B-Instruct-2506-Heretic-v1.2-2

Model card Files Files and versions

This is a decensored version of Mistral-Small-3.2-24B-Instruct-2506, made using Heretic v1.2.0 focusing on zero refusals with low KL divergence.

KL Divergence

Metric	This Model	Original Model
KL divergence	0.0189	0 (by definition)
Refusals	0/108	96/108

Abliteration parameters

Zero refusals with KL divergence of 0.0189
Custom heretic training dataset
Model targetted heretic configuration
Abliterated with MPOA enabled (Magnitude-Preserving Orthogonal Ablation)
Full row renormalization
Winsorization Quantile 0.997

The following are benchmarks for the quantized version of this model

Relative Perplexity

Quant	Filename	PPL ± Error
Q8_0	Mistral-Small-3.2-24B-Instruct-2506-Q8_0.gguf (original baseline)	4.6351 +/- 0.02508
Q8_0	Mistral-Small-3.2-24B-Instruct-2506-Heretic-v1.2-2-Q8_0.gguf	4.6410 +/- 0.02510
Q4_K_M	Mistral-Small-3.2-24B-Instruct-2506-Heretic-v1.2-2-Q4_K_M.gguf	4.7332 +/- 0.02566

Benchmark Comparison

Benchmark	Mistral-Small-3.2-24B-Instruct-2506-Q8_0.gguf	Mistral-Small-3.2-24B-Instruct-2506-Q4_K_M.gguf	Mistral-Small-3.2-24B-Instruct-2506-Heretic-v1.2-2-Q4_K_M.gguf
Perplexity (Wikitext-2)	4.6351	4.7078	4.7332
HellaSwag	83.25%	82.50%	82.50%
Winogrande	79.16%	78.22%	77.90%
ARC-Challenge	55.85%	53.85%	55.18%
MMLU	43.86%	44.25%	43.93%

*Note: MMLU benchmark has moral_scenarios, moral_disputes, business_ethics, professional_law and jurisprudence subjects removed. *

Downloads last month: 6

Safetensors

Model size

24B params

Tensor type

BF16

·

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for grayarea/Mistral-Small-3.2-24B-Instruct-2506-Heretic-v1.2-2

Base model

mistralai/Mistral-Small-3.1-24B-Base-2503

Finetuned

mistralai/Mistral-Small-3.2-24B-Instruct-2506

Finetuned

unsloth/Mistral-Small-3.2-24B-Instruct-2506

Finetuned

(6)

this model

Quantizations