GGUF
qwen3.5
openmythos
build-small-hackathon
conversational
educatedlucifer12 jabbatheduck commited on
Commit
58e33b1
·
0 Parent(s):

Duplicate from jabbatheduck/OpenMythos-GGUF

Browse files

Co-authored-by: Jeremy <jabbatheduck@users.noreply.huggingface.co>

.gitattributes ADDED
@@ -0,0 +1,43 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ *.7z filter=lfs diff=lfs merge=lfs -text
2
+ *.arrow filter=lfs diff=lfs merge=lfs -text
3
+ *.bin filter=lfs diff=lfs merge=lfs -text
4
+ *.bz2 filter=lfs diff=lfs merge=lfs -text
5
+ *.ckpt filter=lfs diff=lfs merge=lfs -text
6
+ *.ftz filter=lfs diff=lfs merge=lfs -text
7
+ *.gz filter=lfs diff=lfs merge=lfs -text
8
+ *.h5 filter=lfs diff=lfs merge=lfs -text
9
+ *.joblib filter=lfs diff=lfs merge=lfs -text
10
+ *.lfs.* filter=lfs diff=lfs merge=lfs -text
11
+ *.mlmodel filter=lfs diff=lfs merge=lfs -text
12
+ *.model filter=lfs diff=lfs merge=lfs -text
13
+ *.msgpack filter=lfs diff=lfs merge=lfs -text
14
+ *.npy filter=lfs diff=lfs merge=lfs -text
15
+ *.npz filter=lfs diff=lfs merge=lfs -text
16
+ *.onnx filter=lfs diff=lfs merge=lfs -text
17
+ *.ot filter=lfs diff=lfs merge=lfs -text
18
+ *.parquet filter=lfs diff=lfs merge=lfs -text
19
+ *.pb filter=lfs diff=lfs merge=lfs -text
20
+ *.pickle filter=lfs diff=lfs merge=lfs -text
21
+ *.pkl filter=lfs diff=lfs merge=lfs -text
22
+ *.pt filter=lfs diff=lfs merge=lfs -text
23
+ *.pth filter=lfs diff=lfs merge=lfs -text
24
+ *.rar filter=lfs diff=lfs merge=lfs -text
25
+ *.safetensors filter=lfs diff=lfs merge=lfs -text
26
+ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
27
+ *.tar.* filter=lfs diff=lfs merge=lfs -text
28
+ *.tar filter=lfs diff=lfs merge=lfs -text
29
+ *.tflite filter=lfs diff=lfs merge=lfs -text
30
+ *.tgz filter=lfs diff=lfs merge=lfs -text
31
+ *.wasm filter=lfs diff=lfs merge=lfs -text
32
+ *.xz filter=lfs diff=lfs merge=lfs -text
33
+ *.zip filter=lfs diff=lfs merge=lfs -text
34
+ *.zst filter=lfs diff=lfs merge=lfs -text
35
+ *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ OpenMythos-F16.gguf filter=lfs diff=lfs merge=lfs -text
37
+ OpenMythos-Q8_0.gguf filter=lfs diff=lfs merge=lfs -text
38
+ OpenMythos-Q6_K.gguf filter=lfs diff=lfs merge=lfs -text
39
+ OpenMythos-Q4_0.gguf filter=lfs diff=lfs merge=lfs -text
40
+ OpenMythos-Q5_k.gguf filter=lfs diff=lfs merge=lfs -text
41
+ OpenMythos-27B-Q5_K.gguf filter=lfs diff=lfs merge=lfs -text
42
+ OpenMythos-27B-Q4_K.gguf filter=lfs diff=lfs merge=lfs -text
43
+ OpenMythos-27B-Q6_K.gguf filter=lfs diff=lfs merge=lfs -text
OpenMythos-27B-Q4_K.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d3c3398ba8b512aa59074efcaa34d7dcabed7381b68a05ae6bcce77e14d7db45
3
+ size 16547399872
OpenMythos-27B-Q5_K.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9569a64a89a49064cb3d55127453ce810fb860643c4ea467e993177a5becbdf0
3
+ size 19231099072
OpenMythos-27B-Q6_K.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8b225034431666b0e4723d42efe0aae088c54a84e42ea5f9547f19cb29e2593c
3
+ size 22082529472
README.md ADDED
@@ -0,0 +1,136 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ tags:
4
+ - gguf
5
+ - qwen3.5
6
+ - openmythos
7
+ - build-small-hackathon
8
+ datasets:
9
+ - build-small-hackathon/CVE_Vulnerailities_Detailed
10
+ - himanshu17HF/ArvixImport-Filtered-Final
11
+ base_model:
12
+ - build-small-hackathon/OpenMythos
13
+ - Qwen/Qwen3.6-27B
14
+ ---
15
+
16
+ # OpenMythos 27B - GGUF
17
+
18
+ GGUF quantisation of [build-small-hackathon/OpenMythos](https://huggingface.co/build-small-hackathon/OpenMythos),
19
+ a fine-tune of [Qwen3.6-27B](https://huggingface.co/Qwen/Qwen3.6-27B).
20
+
21
+ Converted with `convert_hf_to_gguf.py --no-mtp` from llama.cpp build 9658.
22
+ The fine-tune does not include MTP head weights (dropped during training), so MTP
23
+ is not available in this GGUF.
24
+
25
+ ## Available Quantisations
26
+
27
+ | File | Size | Type |
28
+ |------|------|------|
29
+ | OpenMythos-27B-F16.gguf | 53.8 GB | F16 |
30
+ | OpenMythos-27B-Q5_K.gguf | 18.3 GB | Q5_K_M |
31
+ | OpenMythos-27B-Q4_K.gguf | 15.4 GB | Q4_K_M |
32
+ | OpenMythos-27B-Q6_K.gguf | 21.2 GB | Q6_K |
33
+
34
+ ## Benchmark
35
+
36
+ Evaluated with [SecEval](https://github.com/XuanwuAI/SecEval) (commit 7aef317) on 2189
37
+ multiple-choice security questions. Backend: llama.cpp OpenAI-compatible server, fully
38
+ offloaded to GPU. No chain-of-thought / reasoning enabled (`enable_thinking=false`).
39
+
40
+ Prompt formatted with a system prompt requesting letter-only answers (no explanation).
41
+
42
+ | Set | Model | Score |
43
+ |-----|-------|-------|
44
+ | A | OpenMythos-27B-Q5_K | 1703 / 2189 (77.8%) |
45
+ | B | VulnLLM-R-7B | 1315 / 2189 (60.1%) |
46
+
47
+ ### OpenMythos-27B-Q5_K test parameters
48
+
49
+ - model: `OpenMythos-27B-Q5_K.gguf`
50
+ - inference: `temp=0.2`, `top_p=0.8`, `top_k=20`, `min_p=0.05`, `repeat_penalty=1.02`
51
+ - benchmark script: `/mnt/storage/SecEval-tmp/run_bench.py`
52
+ - output: `seceval-1781809723.json`
53
+ - prompt speed: 282 tok/s | generation speed: 68 tok/s
54
+
55
+ #### Per-topic scores
56
+
57
+ | Topic | Score |
58
+ |-------|-------|
59
+ | PenTest | 84.2% |
60
+ | MemorySafety | 83.3% |
61
+ | WebSecurity | 82.7% |
62
+ | Vulnerability | 77.8% |
63
+ | NetworkSecurity | 77.4% |
64
+ | SoftwareSecurity | 75.0% |
65
+ | ApplicationSecurity | 74.8% |
66
+ | SystemSecurity | 73.6% |
67
+ | Cryptography | 71.4% |
68
+
69
+ ### VulnLLM-R-7B test parameters
70
+
71
+ - model: `VulnLLM-R-7B.Q6_K.gguf`
72
+ - inference: same settings as above
73
+ - output: `seceval-1781811525.json`
74
+ - prompt speed: 148 tok/s | generation speed: 39 tok/s
75
+
76
+ #### Per-topic scores
77
+
78
+ | Topic | Score |
79
+ |-------|-------|
80
+ | PenTest | 70.9% |
81
+ | WebSecurity | 66.4% |
82
+ | Vulnerability | 58.7% |
83
+ | NetworkSecurity | 58.3% |
84
+ | SystemSecurity | 56.4% |
85
+ | SoftwareSecurity | 54.7% |
86
+ | ApplicationSecurity | 54.7% |
87
+ | MemorySafety | 54.2% |
88
+ | Cryptography | 28.6% |
89
+
90
+ Full detailed results are included in this repo: `seceval-1781809723.json` and
91
+ `seceval-1781811525.json`.
92
+
93
+ ## Usage
94
+
95
+ ### llama-server (recommended)
96
+
97
+ ```ini
98
+ [OpenMythos-27B]
99
+ model = /mnt/storage/models/OpenMythos/OpenMythos-27B-Q5_K.gguf
100
+ chat-template-file = /mnt/storage/llama-server/chat_template-v15.jinja
101
+ ctx-size = 65536
102
+ cache-type-k = q8_0
103
+ cache-type-v = q8_0
104
+ cache-prompt = on
105
+ cache-reuse = 2048
106
+ batch-size = 4096
107
+ ubatch-size = 4096
108
+ kv-unified = on
109
+ parallel = 1
110
+ gpu-layers = all
111
+ temp = 0.2
112
+ top-p = 0.8
113
+ top-k = 20
114
+ min-p = 0.05
115
+ presence-penalty = 0.2
116
+ repeat-penalty = 1.02
117
+ spec-type = ngram-mod
118
+ spec-draft-n-max = 5
119
+ reasoning-format = deepseek
120
+ swa-checkpoints = 5
121
+ ```
122
+
123
+ ### llama-cli
124
+
125
+ ```bash
126
+ /mnt/storage/llama.cpp/build/bin/llama-cli \
127
+ -m /mnt/storage/models/OpenMythos/OpenMythos-27B-Q5_K.gguf \
128
+ --chat-template-file /mnt/storage/llama-server/chat_template-v15.jinja \
129
+ -c 65536 -b 4096 --ubatch-size 4096 \
130
+ --cache-type-k q8_0 --cache-type-v q8_0 \
131
+ --kv-unified -t 8 -fa \
132
+ --temp 0.2 --top-p 0.8 --top-k 20 --min-p 0.05 \
133
+ --presence-penalty 0.2 --repeat-penalty 1.02 \
134
+ -ngl all \
135
+ -p "Your prompt here"
136
+ ```
seceval-1781809723.json ADDED
The diff for this file is too large to render. See raw diff
 
seceval-1781811525.json ADDED
The diff for this file is too large to render. See raw diff