--- license: apache-2.0 base_model: Qwen/Qwen3.5-35B-A3B library_name: transformers tags: - dashq - quantized - post-training-quantization --- # Qwen3.5-35B-A3B-DASHQ-INT3-g128 This repository contains a DASH-Q packed quantized checkpoint for `Qwen/Qwen3.5-35B-A3B`. DASH-Q checkpoints require the lightweight DASH-Q runtime package for loading. They are not plain Transformers checkpoints because linear layers are stored as `PackedQuantizedLinear` modules. ## Install ```bash pip install git+https://github.com/JaeminK/dashq.git ``` ## Load ```python from dashq import load_quantized model, tokenizer = load_quantized( "jkim96/Qwen3.5-35B-A3B-DASHQ-INT3-g128", device_map="auto", ) ``` ## Quantization | Field | Value | | --- | --- | | Base model | `Qwen/Qwen3.5-35B-A3B` | | Bits | `3` | | Group size | `128` | | Scale/zero dtype | `float16` | | Calibration dataset | `wikitext2` | | Calibration samples | `128` | | Sequence length | `2048` | | Original size | `71.9039 GB` | | Quantized size | `17.4800 GB` | ## Evaluation | Metric | Value | | --- | ---: | | `wikitext2_ppl` | 7.1423 | | `zero-shot accuracy avg` | 70.2603 | | `arc_challenge` | 61.3481 | | `arc_easy` | 81.6498 | | `commonsense_qa` | 84.1114 | | `gsm8k_cot` | 82.8658 | | `hellaswag` | 80.3625 | | `lambada_openai` | 69.8040 | | `mmlu` | 77.9590 | | `openbookqa` | 44.0000 | | `piqa` | 82.2089 | | `truthfulqa_mc2` | 55.1401 | | `winogrande` | 73.7174 |