Request for Q4 GGUF version for x86 users

#2
by cltyq777 - opened

Hi Dealignal

Thank you so much for your incredible work on abliterating this hybrid SSM/CoT model! I saw your comments on Reddit mentioning the massive effort it took to get this 397B model working coherently.

I am extremely interested in running this, but my setup is an x86 server (Dual CPU with 512GB RAM) running llama.cpp, so I cannot use the MLX/JANG format.

I remember you mentioned on Reddit that you could make GGUF versions (Q4 or above) if there was enough demand. Could you please consider releasing a Q4_K_M or Q4_0 GGUF version of this CRACK abliterated model? Or alternatively, uploading the unquantized Safetensors so the community can run the conversion scripts?

Thank you again for your dedication to the open-source community!

waiting for GGUF versions too :)

Sign up or log in to comment