Would you please created a IQ3_XXS for my poor 16G vram?

#1
by hemono - opened

I don't have so much vram to quantize by my RTX 5070ti.
Thank you very much!
https://unsloth.ai/docs/models/qwen3.5/gguf-benchmarks#id-2-imatrix-works-very-well

Can't wait to try it, thanks bro!

barozp changed discussion status to closed
barozp changed discussion status to open

Hi @barozp ,

could you please quantize https://huggingface.co/0xSero/Qwen3.6-28B-REAP20-A3B to IQ3_XXS as well?

Thank you in advance!

Owner

Sign up or log in to comment