Commit History

smoothquant: fix istupakov int8 size note comparing output to itself
72a6e63

thiswillbeyourgithub Claude Opus 4.8 commited on

smoothquant: honor --alpha (work around neural-compressor dropping SmoothQuant* extra_options)
422d652

thiswillbeyourgithub Claude Opus 4.8 commited on

quantize-int8-smoothquant: use one fixed append-mode log file
83ec1af

thiswillbeyourgithub Claude Opus 4.8 commited on

quantize-int8-smoothquant: route output through loguru + log to file
fa7c830

thiswillbeyourgithub Claude Opus 4.8 commited on

scripts: move quantize/shard tools into scripts/ subfolder
7d8c10c

thiswillbeyourgithub Claude Opus 4.8 commited on

smoothquant: point wer-quants.py suggestion at the parakeet_web repo
9d32c36

thiswillbeyourgithub Claude Opus 4.8 commited on

smoothquant: strip orphaned folded smooth-scale initializers
fca240e

thiswillbeyourgithub Claude Opus 4.8 commited on

quantize-fp16: make it a uv run script (PEP 723 header)
c95254b

thiswillbeyourgithub Claude Opus 4.8 commited on

docs: reflect standalone script defaults (./ and ./calibration_audio)
acfae0f

thiswillbeyourgithub Claude Opus 4.8 commited on

scripts: default quantize-fp16 and shard-fp32 to the current directory
5954ff0

thiswillbeyourgithub Claude Opus 4.8 commited on

smoothquant: normalize usage examples to standalone invocation
0c0c7af

thiswillbeyourgithub Claude Opus 4.8 commited on

smoothquant: default to . for models and ./candidates, ./calibration_audio for calib
6a6bdd7

thiswillbeyourgithub Claude Opus 4.8 commited on

smoothquant: default alpha to per-layer auto search
f178395

thiswillbeyourgithub Claude Opus 4.8 commited on

docs: fill in generalization table with comprehensive benchmark numbers
aefff72

thiswillbeyourgithub Claude Opus 4.8 commited on

Add browser-friendly sharded fp32 encoder + shard-fp32.py
d99636b

thiswillbeyourgithub Claude Opus 4.8 commited on

docs: expand fp16 intro (naive cast, equal to fp32, half-size packaging win)
57efb46

thiswillbeyourgithub Claude Opus 4.8 commited on

docs: merge fp16/fp32 WER column (identical values) with italic note
bff6277

thiswillbeyourgithub Claude Opus 4.8 commited on

docs: correct calibration framing (bilingual audio used) + TODO recomputed generalization values
0b443ef

thiswillbeyourgithub Claude Opus 4.8 commited on

docs: note murmure contribution / upstreaming in README
4fdef44

thiswillbeyourgithub Claude Opus 4.8 commited on

Add .gitignore for calibration audio and local logs
3b3d35d

thiswillbeyourgithub Claude Opus 4.8 commited on

Recalibrate int8 encoder on held-out speeches (fix eval contamination)
8696232

thiswillbeyourgithub Claude Opus 4.8 commited on

docs: language-agnostic note + FLEURS-fr check + parakeet_web origin
cb26dd6

thiswillbeyourgithub commited on

docs: frame the long-audio result as not-reproduced, not a guarantee
981c55e

thiswillbeyourgithub commited on

Parakeet TDT 0.6B v3 (Multilingual) ONNX with a SmoothQuant int8 encoder
d48fd91

thiswillbeyourgithub commited on