ybelkada
/

bloom-1b7-8bit

Text Generation

text-generation-inference

8-bit precision

Model card Files Files and versions

ybelkada commited on Apr 12, 2023

Commit

9874680

·

1 Parent(s): 365deed

Update README.md

Files changed (1) hide show

README.md +8 -1

README.md CHANGED Viewed

@@ -89,4 +89,11 @@ Then just call `push_to_hub` method or `save_pretrained` method if you want to s
 model.push_to_hub("{your_username}/bloom-1b7-8bit")
 ```
-That's it!

 model.push_to_hub("{your_username}/bloom-1b7-8bit")
 ```
+That's it!
+## What is inside the model's `state_dict`?
+Inside the state dict of the model (`pytorch_model.bin` file) you have
+- the quantized `int8` weights
+- the quantization statistics in `float16`