All quants updated. Please redownload for best experience. Start from my reccomended settings and Q4_K_P quant.

#2
by LuffyTheFox - opened

Finally found a way to fix ssm_conv1d tensor drift in quantized GGUF models on binary level.

Awesome! Out of curiosity, can you explain in layman's terms what this changes? Thank you so much <3

Awesome! Out of curiosity, can you explain in layman's terms what this changes? Thank you so much <3

I turned down the volume on three over-excited neurons (tensors) with perfectly calculated alpha. Now the model can remember what you said 20 minutes ago instead of getting amnesia.

Layman's term: now the orchestra plays in harmony for the entire symphony.

Sign up or log in to comment