Mistral-Small-4-119B-TurboQuant-MLX-8bit / model-00021-of-00028.safetensors

Commit History

Add MLX 8-bit quantized model with KV cache compression
556c125
verified

majentik commited on