🌟Qwopus3.5-27B-v3.5-INT4-FOEM

This is an unofficial quantized version of Qwopus3.5-27B-v3.5.

🧠 Quantization Framework

GPTQModel

🗺️ Quantization Method

FOEM (AAAI 2026)

FOEM is an improved quantization method over GPTQ. The resulting model preserves the same inference structure as GPTQ, ensuring compatibility with existing deployment pipelines while achieving better accuracy.

📚 Calibration Dataset

We randomly sampled 512 examples from nohurry/Opus-4.6-Reasoning-3000x-filtered.

📋 Usage Example

This model can be deployed using standard frameworks such as vLLM, just like other GPTQModel-quantized models.

Example evaluation command:

lm-eval --model vllm --model_args pretrained=models/gptqmodel/Qwopus3.5-27B-v3.5-INT4-FOEM,tensor_parallel_size=1,gpu_memory_utilization=0.45 --tasks wikitext --batch_size 1

⚠️ Limitations & Intended Use

(Adapted from the original repository of Jackrong/Qwopus3.5-27B-v3.5)

  • Possible overfitting if scaling exceeds optimal regime
  • Reasoning may still exhibit instability in edge cases
  • Tool-calling performance depends on environment integration
  • Not all capabilities are fully benchmarked yet

🙏 Acknowledgements

Special thanks to Jackrong for providing the original model: Qwopus3.5-27B-v3.5.

📖 Citation

If you use this model in your research or projects, please cite:

@misc{jackrong_qwopus35_v35,
  title        = {Qwopus3.5-27B-v3.5},
  author       = {Jackrong},
  year         = {2026},
  publisher    = {Hugging Face}
}
@misc{qubitium2024gptqmodel,
  author = {ModelCloud.ai and qubitium@modelcloud.ai},
  title = {GPT-QModel},
  publisher = {GitHub},
  journal = {GitHub repository},
  howpublished = {\url{https://github.com/modelcloud/gptqmodel}},
  note = {Contact: qubitium@modelcloud.ai},
  year = {2024},
}
@inproceedings{zheng2026first,
  title={First-order error matters: Accurate compensation for quantized large language models},
  author={Zheng, Xingyu and Qin, Haotong and Li, Yuye and Chu, Haoran and Wang, Jiakai and Guo, Jinyang and Magno, Michele and Liu, Xianglong},
  booktitle={Proceedings of the AAAI Conference on Artificial Intelligence},
  volume={40},
  number={34},
  pages={28883--28891},
  year={2026}
}
Downloads last month
41
Safetensors
Model size
27B params
Tensor type
BF16
·
I32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for Xingyu-Zheng/Qwopus3.5-27B-v3.5-INT4-FOEM

Base model

Qwen/Qwen3.5-27B
Quantized
(5)
this model

Dataset used to train Xingyu-Zheng/Qwopus3.5-27B-v3.5-INT4-FOEM

Collection including Xingyu-Zheng/Qwopus3.5-27B-v3.5-INT4-FOEM

Paper for Xingyu-Zheng/Qwopus3.5-27B-v3.5-INT4-FOEM