Melvin56 commited on
Commit
9995d51
·
1 Parent(s): 7ab60a6

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -6
README.md CHANGED
@@ -15,16 +15,13 @@ tags:
15
 
16
  Original Model : [huihui-ai/GLM-4-9B-0414-abliterated](https://huggingface.co/huihui-ai/GLM-4-9B-0414-abliterated)
17
 
18
- Llama.cpp build: 1d735c0b (5165)
19
 
20
  I used imatrix to create all these quants using this [Dataset](https://gist.github.com/bartowski1182/eb213dccb3571f863da82e99418f81e8).
21
 
22
- With Llama.cpp(Ollama and LM Studio were not tested), you'll need to add these [specific commands](https://github.com/ggml-org/llama.cpp/issues/12946) :
23
-
24
  ```
25
- --override-kv glm4.rope.dimension_count=int:64 \
26
- --override-kv tokenizer.ggml.eos_token_id=int:151336 \
27
- --chat-template chatglm4
28
  ```
29
 
30
  | | CPU (AVX2) | CPU (ARM NEON) | Metal | cuBLAS | rocBLAS | SYCL | CLBlast | Vulkan | Kompute |
 
15
 
16
  Original Model : [huihui-ai/GLM-4-9B-0414-abliterated](https://huggingface.co/huihui-ai/GLM-4-9B-0414-abliterated)
17
 
18
+ Llama.cpp build: 558a7647 (5190)
19
 
20
  I used imatrix to create all these quants using this [Dataset](https://gist.github.com/bartowski1182/eb213dccb3571f863da82e99418f81e8).
21
 
 
 
22
  ```
23
+ Update-01
24
+ * [Fixed Quant] Re-quantized all quants with build: 558a7647 (5190)
 
25
  ```
26
 
27
  | | CPU (AVX2) | CPU (ARM NEON) | Metal | cuBLAS | rocBLAS | SYCL | CLBlast | Vulkan | Kompute |