huggingkot commited on
Commit ·
2c65e56
1
Parent(s): e51bb62
add files
Browse files- .gitattributes +1 -0
- README.md +17 -0
- chat_template.jinja +4 -0
- config.json +3 -0
- generation_config.json +3 -0
- model-00001-of-00003.safetensors +3 -0
- model-00002-of-00003.safetensors +3 -0
- model-00003-of-00003.safetensors +3 -0
- model.safetensors.index.json +3 -0
- special_tokens_map.json +3 -0
- tokenizer.json +3 -0
- tokenizer_config.json +3 -0
.gitattributes
CHANGED
|
@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
|
| 33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
| 34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
| 35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
|
|
|
|
|
| 33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
| 34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
| 35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
| 36 |
+
*.json filter=lfs diff=lfs merge=lfs -text
|
README.md
ADDED
|
@@ -0,0 +1,17 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
|
| 2 |
+
---
|
| 3 |
+
base_model:
|
| 4 |
+
- SicariusSicariiStuff/Impish_Magic_24B
|
| 5 |
+
---
|
| 6 |
+
|
| 7 |
+
This is a converted weight from [Impish_Magic_24B](https://huggingface.co/SicariusSicariiStuff/Impish_Magic_24B) model in [unsloth 4-bit dynamic quant](https://archive.is/EFz7P) using this [collab notebook](https://colab.research.google.com/drive/1P23C66j3ga49kBRnDNlmRce7R_l_-L5l?usp=sharing).
|
| 8 |
+
|
| 9 |
+
## About this Conversion
|
| 10 |
+
|
| 11 |
+
This conversion uses **Unsloth** to load the model in **4-bit** format and force-save it in the same **4-bit** format.
|
| 12 |
+
|
| 13 |
+
### How 4-bit Quantization Works
|
| 14 |
+
- The actual **4-bit quantization** is handled by **BitsAndBytes (bnb)**, which works under **Torch** via **AutoGPTQ** or **BitsAndBytes**.
|
| 15 |
+
- **Unsloth** acts as a wrapper, simplifying and optimizing the process for better efficiency.
|
| 16 |
+
|
| 17 |
+
This allows for reduced memory usage and faster inference while keeping the model compact.
|
chat_template.jinja
ADDED
|
@@ -0,0 +1,4 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{% if not add_generation_prompt is defined %}{% set add_generation_prompt = false %}{% endif %}{% for message in messages %}{{'<|im_start|>' + message['role'] + '
|
| 2 |
+
' + message['content'] + '<|im_end|>' + '
|
| 3 |
+
'}}{% endfor %}{% if add_generation_prompt %}{{ '<|im_start|>assistant
|
| 4 |
+
' }}{% endif %}
|
config.json
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:9e3288aa741b85bfbc1f18f114c48ab0af54709e402d6c469c632e0293cff024
|
| 3 |
+
size 1085
|
generation_config.json
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:8cfe606e8ae49b779b3821f31478793f9b1ddfe50dd9a0a7b53b2b15df4de510
|
| 3 |
+
size 177
|
model-00001-of-00003.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:9a6310a7b283632e429828ac5741e7e5ffd10d15b81d564e3d69549df76bb4e6
|
| 3 |
+
size 4983059570
|
model-00002-of-00003.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:603c11a5aa01772eb3b7ea18cee5c5ba9dae2442c22ba5bf1b9552f44ba2be82
|
| 3 |
+
size 4987974528
|
model-00003-of-00003.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:6d30925d1e60ca087fb8df9d8c0b8f8635614ef4f489bdef80ece31231bd3e60
|
| 3 |
+
size 4182407796
|
model.safetensors.index.json
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:237cc12d59f8228ac471ad1e88ca78c8e577bab8d24a4bfec351e026afd5dabb
|
| 3 |
+
size 165421
|
special_tokens_map.json
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:a0484b1778862c5b616f24d94079bcd645f9ad687ceffc28f0d0af683f762817
|
| 3 |
+
size 21446
|
tokenizer.json
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:022babbca3e638f8beea33101ca709791362c00647e462df84a1a4523e633839
|
| 3 |
+
size 17078206
|
tokenizer_config.json
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:8679aa42cc662d81b237d993c8a3d74718f623a452745d9fd6f555908c8e6512
|
| 3 |
+
size 198244
|