noctrex commited on
Commit
08bd4a3
·
verified ·
1 Parent(s): 175835a

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -2
README.md CHANGED
@@ -5,8 +5,7 @@ base_model:
5
  ---
6
  This is a MXFP4_MOE imatrix quantization of the model [NVIDIA-Nemotron-3-Nano-30B-A3B](https://huggingface.co/nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16), based on the imatrix from unsloth.
7
 
8
- As this is not yet supported in the mainline llama.cpp yet, you'll need to compile a special merge of it from here in order to run it:
9
- https://github.com/ggml-org/llama.cpp/pull/18058
10
 
11
  Also see the instructions here: [Unsloth NVIDIA Nemotron 3 Nano - How To Run Guide](https://docs.unsloth.ai/models/nemotron-3)
12
 
 
5
  ---
6
  This is a MXFP4_MOE imatrix quantization of the model [NVIDIA-Nemotron-3-Nano-30B-A3B](https://huggingface.co/nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16), based on the imatrix from unsloth.
7
 
8
+ Get the latest [llama.cpp](https://github.com/ggml-org/llama.cpp/releases) in order to run it.
 
9
 
10
  Also see the instructions here: [Unsloth NVIDIA Nemotron 3 Nano - How To Run Guide](https://docs.unsloth.ai/models/nemotron-3)
11