inferencerlabs commited on
Commit
44b5838
·
verified ·
1 Parent(s): a9d7bb3

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -0
README.md CHANGED
@@ -20,6 +20,8 @@ pipeline_tag: text-generation
20
  In this build, the 4-bit pre-quantized weights of the base model were repacked (rather than dequantized and re-quantized to 9-bit), as this approach performed slightly better in our initial coding tests. All remaining weights were quantized to 9-bit. It also includes a temporary chat template. Stay tuned for updates.
21
  </p>
22
 
 
 
23
  ##### Quantized with a modified version of [MLX](https://github.com/ml-explore/mlx)
24
  ##### For more details see our [demonstration videos](https://youtube.com/xcreate) or visit [DeepSeek-V4-Flash](https://huggingface.co/deepseek-ai/DeepSeek-V4-Flash).
25
 
 
20
  In this build, the 4-bit pre-quantized weights of the base model were repacked (rather than dequantized and re-quantized to 9-bit), as this approach performed slightly better in our initial coding tests. All remaining weights were quantized to 9-bit. It also includes a temporary chat template. Stay tuned for updates.
21
  </p>
22
 
23
+ ![Screenshot 2026-04-27 at 2.38.31 pm](https://cdn-uploads.huggingface.co/production/uploads/688479d616f1ec82fa645019/ueGDAcsebpcWYhuU9Gvn4.jpeg)
24
+
25
  ##### Quantized with a modified version of [MLX](https://github.com/ml-explore/mlx)
26
  ##### For more details see our [demonstration videos](https://youtube.com/xcreate) or visit [DeepSeek-V4-Flash](https://huggingface.co/deepseek-ai/DeepSeek-V4-Flash).
27