--- pipeline_tag: text-generation base_model: - nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16 --- This is a MXFP4_MOE imatrix quantization of the model [NVIDIA-Nemotron-3-Nano-30B-A3B](https://huggingface.co/nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16), based on the imatrix from unsloth. Get the latest [llama.cpp](https://github.com/ggml-org/llama.cpp/releases) in order to run it. Also see the instructions here: [Unsloth NVIDIA Nemotron 3 Nano - How To Run Guide](https://docs.unsloth.ai/models/nemotron-3)