latent-variable
/

Qwen3.5-122B-A10B-oQ4

4-bit precision

Model card Files Files and versions

Qwen3.5-122B-A10B-oQ4

This model was quantized using oQ mixed-precision quantization.

Quantization details

Model type: qwen3_5_moe
Bits: 4
Group size: 64
Format: MLX safetensors

Downloads last month: 59

Safetensors

Model size

20B params

Tensor type

U8

·

U32

·

BF16

·

MLX

Hardware compatibility

Log In to add your hardware

4-bit

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support