smoothquant: fix istupakov int8 size note comparing output to itself 72a6e63 thiswillbeyourgithub Claude Opus 4.8 commited on 12 days ago
smoothquant: honor --alpha (work around neural-compressor dropping SmoothQuant* extra_options) 422d652 thiswillbeyourgithub Claude Opus 4.8 commited on 12 days ago
quantize-int8-smoothquant: use one fixed append-mode log file 83ec1af thiswillbeyourgithub Claude Opus 4.8 commited on 12 days ago
quantize-int8-smoothquant: route output through loguru + log to file fa7c830 thiswillbeyourgithub Claude Opus 4.8 commited on 12 days ago
scripts: move quantize/shard tools into scripts/ subfolder 7d8c10c thiswillbeyourgithub Claude Opus 4.8 commited on 12 days ago
smoothquant: point wer-quants.py suggestion at the parakeet_web repo 9d32c36 thiswillbeyourgithub Claude Opus 4.8 commited on 12 days ago
smoothquant: strip orphaned folded smooth-scale initializers fca240e thiswillbeyourgithub Claude Opus 4.8 commited on 12 days ago
quantize-fp16: make it a uv run script (PEP 723 header) c95254b thiswillbeyourgithub Claude Opus 4.8 commited on 12 days ago
docs: reflect standalone script defaults (./ and ./calibration_audio) acfae0f thiswillbeyourgithub Claude Opus 4.8 commited on 12 days ago
scripts: default quantize-fp16 and shard-fp32 to the current directory 5954ff0 thiswillbeyourgithub Claude Opus 4.8 commited on 12 days ago
smoothquant: normalize usage examples to standalone invocation 0c0c7af thiswillbeyourgithub Claude Opus 4.8 commited on 12 days ago
smoothquant: default to . for models and ./candidates, ./calibration_audio for calib 6a6bdd7 thiswillbeyourgithub Claude Opus 4.8 commited on 12 days ago
smoothquant: default alpha to per-layer auto search f178395 thiswillbeyourgithub Claude Opus 4.8 commited on 12 days ago
docs: fill in generalization table with comprehensive benchmark numbers aefff72 thiswillbeyourgithub Claude Opus 4.8 commited on 15 days ago
Add browser-friendly sharded fp32 encoder + shard-fp32.py d99636b thiswillbeyourgithub Claude Opus 4.8 commited on 15 days ago
docs: expand fp16 intro (naive cast, equal to fp32, half-size packaging win) 57efb46 thiswillbeyourgithub Claude Opus 4.8 commited on 15 days ago
docs: merge fp16/fp32 WER column (identical values) with italic note bff6277 thiswillbeyourgithub Claude Opus 4.8 commited on 15 days ago
docs: correct calibration framing (bilingual audio used) + TODO recomputed generalization values 0b443ef thiswillbeyourgithub Claude Opus 4.8 commited on 15 days ago
docs: note murmure contribution / upstreaming in README 4fdef44 thiswillbeyourgithub Claude Opus 4.8 commited on 15 days ago
Add .gitignore for calibration audio and local logs 3b3d35d thiswillbeyourgithub Claude Opus 4.8 commited on 15 days ago
Recalibrate int8 encoder on held-out speeches (fix eval contamination) 8696232 thiswillbeyourgithub Claude Opus 4.8 commited on 15 days ago
docs: language-agnostic note + FLEURS-fr check + parakeet_web origin cb26dd6 thiswillbeyourgithub commited on 15 days ago
docs: frame the long-audio result as not-reproduced, not a guarantee 981c55e thiswillbeyourgithub commited on 15 days ago
Parakeet TDT 0.6B v3 (Multilingual) ONNX with a SmoothQuant int8 encoder d48fd91 thiswillbeyourgithub commited on 16 days ago