codelion commited on
Commit
5db2a0a
·
verified ·
1 Parent(s): 319aed1

Add kv_config.json (per-layer mixed-precision KV cache, 5.0 BPW target from OptiQ kv-cache sensitivity analysis)

Browse files
Files changed (1) hide show
  1. kv_config.json +42 -0
kv_config.json ADDED
@@ -0,0 +1,42 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ [
2
+ {
3
+ "layer_idx": 3,
4
+ "bits": 4,
5
+ "group_size": 64
6
+ },
7
+ {
8
+ "layer_idx": 7,
9
+ "bits": 4,
10
+ "group_size": 64
11
+ },
12
+ {
13
+ "layer_idx": 11,
14
+ "bits": 4,
15
+ "group_size": 64
16
+ },
17
+ {
18
+ "layer_idx": 15,
19
+ "bits": 4,
20
+ "group_size": 64
21
+ },
22
+ {
23
+ "layer_idx": 19,
24
+ "bits": 4,
25
+ "group_size": 64
26
+ },
27
+ {
28
+ "layer_idx": 23,
29
+ "bits": 4,
30
+ "group_size": 64
31
+ },
32
+ {
33
+ "layer_idx": 27,
34
+ "bits": 8,
35
+ "group_size": 64
36
+ },
37
+ {
38
+ "layer_idx": 31,
39
+ "bits": 8,
40
+ "group_size": 64
41
+ }
42
+ ]