hotdogs commited on
Commit
2541b90
·
verified ·
1 Parent(s): bf6852f

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +94 -0
README.md ADDED
@@ -0,0 +1,94 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ language:
4
+ - en
5
+ - zh
6
+ - multilingual
7
+ tags:
8
+ - qwen3.5
9
+ - qwen35
10
+ - gguf
11
+ - multimodal
12
+ - vision
13
+ - image-text-to-text
14
+ - abliterated
15
+ - uncensored
16
+ - llama.cpp
17
+ - 4b
18
+ pipeline_tag: image-text-to-text
19
+ base_model:
20
+ - huihui-ai/Huihui-Qwen3.5-4B-Claude-4.6-Opus-abliterated
21
+ - Qwen/Qwen3.5-4B
22
+ library_name: llama.cpp
23
+ ---
24
+
25
+ # Huihui-Qwen3.5-4B-Claude-4.6-Opus-abliterated — GGUF
26
+
27
+ GGUF conversion of [huihui-ai/Huihui-Qwen3.5-4B-Claude-4.6-Opus-abliterated](https://huggingface.co/huihui-ai/Huihui-Qwen3.5-4B-Claude-4.6-Opus-abliterated) for use with [llama.cpp](https://github.com/ggerganov/llama.cpp).
28
+
29
+ ## Credits
30
+
31
+ | Role | Model / Author |
32
+ |---|---|
33
+ | **Base LLM** | [Qwen/Qwen3.5-4B](https://huggingface.co/Qwen/Qwen3.5-4B) — Alibaba Qwen Team |
34
+ | **Abliterated (uncensored)** | [huihui-ai/Huihui-Qwen3.5-4B-Claude-4.6-Opus-abliterated](https://huggingface.co/huihui-ai/Huihui-Qwen3.5-4B-Claude-4.6-Opus-abliterated) — Huihui AI |
35
+ | **GGUF Conversion** | [hotdogs](https://huggingface.co/hotdogs) — via [llama.cpp](https://github.com/ggerganov/llama.cpp) |
36
+
37
+ 🙏 Huge thanks to **Qwen Team** (Alibaba) for the base model, **Huihui AI** for the abliteration, and **ggerganov** for llama.cpp!
38
+
39
+ ## Model Details
40
+
41
+ | Spec | Value |
42
+ |---|---|
43
+ | Parameters | ~4B |
44
+ | Architecture | Qwen3.5 Multimodal (QWEN35) |
45
+ | hiddensize | 2560 |
46
+ | Layers | 32 |
47
+ | Attention Heads | 16 (KV: 4) |
48
+ | Context Length | **262,144** (256K tokens) |
49
+ | FFN Intermediate | 9216 |
50
+ | Vision Encoder | 24 layers, hiddensize=1024, patchsize=16 |
51
+ | Modality | **image-text-to-text** 🖼️➡️📝 |
52
+ | Censorship | **Abliterated** (refusal direction removed) |
53
+ | License | Apache 2.0 |
54
+
55
+ ## Available Quantizations
56
+
57
+ | File | Size | BPW | Quality | Recommended For |
58
+ |---|---|---|---|---|
59
+ | huihui-qwen35-4b-BF16.gguf | 7.9 GB | 16.00 | ⭐ Full | Best quality, 16GB+ VRAM |
60
+ | huihui-qwen35-4b-Q8_0.gguf | 4.2 GB | ~8.00 | ⭐ Very High | Balanced, 8GB+ VRAM |
61
+ | huihui-qwen35-4b-Q4_K_M.gguf | 2.6 GB | 5.13 | ⭐ Good | Low VRAM, 6GB+ VRAM |
62
+ | mmproj-huihui-qwen35-4b-BF16.gguf | 645 MB | — | Vision | **Multimodal projector** (required for images) |
63
+
64
+ ## Usage
65
+
66
+ ### Text-only
67
+
68
+ ./llama-cli -m huihui-qwen35-4b-Q4_K_M.gguf -p "Hello!" -n 256
69
+
70
+ ### Multimodal (image + text)
71
+
72
+ ./llama-qwen2vl-cli -m huihui-qwen35-4b-Q4_K_M.gguf --mmproj mmproj-huihui-qwen35-4b-BF16.gguf --image photo.jpg -p "Describe this image"
73
+
74
+ ### Server (OpenAI-compatible API)
75
+
76
+ ./llama-server -m huihui-qwen35-4b-Q4_K_M.gguf --mmproj mmproj-huihui-qwen35-4b-BF16.gguf --host 0.0.0.0 --port 8080
77
+
78
+ ### Python (llama-cpp-python)
79
+
80
+ llm = Llama(model_path="huihui-qwen35-4b-Q4_K_M.gguf", n_ctx=32768)
81
+ output = llm("Hello!", max_tokens=128)
82
+
83
+ ## About Abliteration
84
+
85
+ This model has undergone **directional ablation** — a technique that removes the "refusal direction" from the model's activation space (Arditi et al. 2024). The model will not refuse questions that base Qwen3.5 would normally decline.
86
+
87
+ **Use responsibly.** Ensure your use case complies with applicable laws.
88
+
89
+ ## Conversion Notes
90
+
91
+ - Converted with llama.cpp convert_hf_to_gguf.py
92
+ - BF16 output type
93
+ - QWEN35 architecture, Qwen3VLVisionModel for mmproj
94
+ - Metadata preserved from source model