smoothquant: fix istupakov int8 size note comparing output to itself 72a6e63 thiswillbeyourgithub Claude Opus 4.8 commited on 12 days ago
smoothquant: honor --alpha (work around neural-compressor dropping SmoothQuant* extra_options) 422d652 thiswillbeyourgithub Claude Opus 4.8 commited on 12 days ago
quantize-int8-smoothquant: use one fixed append-mode log file 83ec1af thiswillbeyourgithub Claude Opus 4.8 commited on 12 days ago
quantize-int8-smoothquant: route output through loguru + log to file fa7c830 thiswillbeyourgithub Claude Opus 4.8 commited on 12 days ago
scripts: move quantize/shard tools into scripts/ subfolder 7d8c10c thiswillbeyourgithub Claude Opus 4.8 commited on 12 days ago