Skyfall-31B-v4.2-int8

This repository contains a weight-only INT8 quantized version of TheDrummer/Skyfall-31B-v4.2.

Notes:

  • Quantized on Kaggle using CPU + RAM disk (/dev/shm)
  • Quantization backend: Optimum Quanto
  • Intended as an uploaded INT8 artifact; TPU runtime compatibility depends on the serving stack
Downloads last month
11
Safetensors
Model size
31B params
Tensor type
BF16
·
I8
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for vekotov/Skyfall-31B-v4.2-int8