Smoffyy commited on
Commit
0427ebf
·
verified ·
1 Parent(s): 07554c3

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -40,7 +40,7 @@ If you have **more than 24GB of VRAM**:
40
  - Use anything above ``q5_k_m`` if you have **48GB of VRAM** for the **closest to original** quality.
41
 
42
  If you have **8–12GB of VRAM**:
43
- - Use ``q4_k_m``, and offload MoE layers to CPU, this will vary depending on your system.
44
 
45
  > Using this model at a **Context Size of 32k**, and a quantization of **q4_k_m**, uses around **21GB**. *This is WITHOUT MoE layers offloaded to CPU.*
46
 
 
40
  - Use anything above ``q5_k_m`` if you have **48GB of VRAM** for the **closest to original** quality.
41
 
42
  If you have **8–12GB of VRAM**:
43
+ - Use ``q3_k_m``, and offload MoE layers to CPU, this will vary depending on your system.
44
 
45
  > Using this model at a **Context Size of 32k**, and a quantization of **q4_k_m**, uses around **21GB**. *This is WITHOUT MoE layers offloaded to CPU.*
46