noctrex commited on
Commit
dfa6f2b
·
verified ·
1 Parent(s): b736536

Add files using upload-large-folder tool

Browse files
Files changed (1) hide show
  1. README.md +1 -5
README.md CHANGED
@@ -5,8 +5,4 @@ base_model:
5
  ---
6
  This is a MXFP4_MOE imatrix quantization of the model [NVIDIA-Nemotron-3-Nano-30B-A3B](https://huggingface.co/nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16), based on the imatrix from unsloth.
7
 
8
- This cannot run the normal llama.cpp yet, as the code is not yet included.
9
-
10
- You will need to compile it yourself with this merge:
11
- https://github.com/ggml-org/llama.cpp/pull/18058
12
- or run it from LM Studio.
 
5
  ---
6
  This is a MXFP4_MOE imatrix quantization of the model [NVIDIA-Nemotron-3-Nano-30B-A3B](https://huggingface.co/nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16), based on the imatrix from unsloth.
7
 
8
+ Get the latest [llama.cpp](https://github.com/ggml-org/llama.cpp/releases) in order to run it.