huggingkot commited on
Commit ·
ea8268b
1
Parent(s): 84cf48a
add files
Browse files- .gitattributes +1 -0
- README.md +17 -0
- config.json +3 -0
- generation_config.json +3 -0
- model.safetensors +3 -0
- special_tokens_map.json +3 -0
- tokenizer.json +3 -0
- tokenizer.model +3 -0
- tokenizer_config.json +3 -0
.gitattributes
CHANGED
|
@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
|
| 33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
| 34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
| 35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
|
|
|
|
|
| 33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
| 34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
| 35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
| 36 |
+
*.json filter=lfs diff=lfs merge=lfs -text
|
README.md
ADDED
|
@@ -0,0 +1,17 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
|
| 2 |
+
---
|
| 3 |
+
base_model:
|
| 4 |
+
- icefog72/IceNalyvkaRP-7b
|
| 5 |
+
---
|
| 6 |
+
|
| 7 |
+
This is a converted weight from [IceNalyvkaRP-7b](https://huggingface.co/icefog72/IceNalyvkaRP-7b) model in [unsloth 4-bit dynamic quant](https://archive.is/EFz7P) using this [collab notebook](https://colab.research.google.com/drive/1P23C66j3ga49kBRnDNlmRce7R_l_-L5l?usp=sharing).
|
| 8 |
+
|
| 9 |
+
## About this Conversion
|
| 10 |
+
|
| 11 |
+
This conversion uses **Unsloth** to load the model in **4-bit** format and force-save it in the same **4-bit** format.
|
| 12 |
+
|
| 13 |
+
### How 4-bit Quantization Works
|
| 14 |
+
- The actual **4-bit quantization** is handled by **BitsAndBytes (bnb)**, which works under **Torch** via **AutoGPTQ** or **BitsAndBytes**.
|
| 15 |
+
- **Unsloth** acts as a wrapper, simplifying and optimizing the process for better efficiency.
|
| 16 |
+
|
| 17 |
+
This allows for reduced memory usage and faster inference while keeping the model compact.
|
config.json
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:1065b7b3bb76bfa0b2ec42e3b33bad8f323883e1ece2a48edc657a5f4987603a
|
| 3 |
+
size 1123
|
generation_config.json
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:e73febd0ac1bfe7f16044903b7985ffe94bc8cbd192f60e4728b3a78889e4c87
|
| 3 |
+
size 155
|
model.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:45be6c43cfe5df0804476d0e070c6f985c178d025c1970d95e02e997d8342c5f
|
| 3 |
+
size 4125687613
|
special_tokens_map.json
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:118c1e072780eb6630f0b3453f4872b7c2ff6fbb3f85557663c12a53568d903c
|
| 3 |
+
size 511
|
tokenizer.json
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:1d5b9634c23bdd4540f633d327120e1fa4b57a2a723a56d7a92debfc4d15c061
|
| 3 |
+
size 3505751
|
tokenizer.model
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:dadfd56d766715c61d2ef780a525ab43b8e6da4de6865bda3d95fdef5e134055
|
| 3 |
+
size 493443
|
tokenizer_config.json
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:9b5db504189a506056619afab34b18d59c3ebd530d2f6d71ed1d5088b34c7957
|
| 3 |
+
size 1065
|