Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
caiovicentino1
/
Qwen3.5-9B-Neo-HLWQ-Q5
like
5
Text Generation
qwen3_5_text
hlwq
qwen3.5
quantized
kv-cache-compression
conversational
polarengine
arxiv:
2502.02617
arxiv:
2603.29078
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Copy to bucket
new
main
Qwen3.5-9B-Neo-HLWQ-Q5
6.27 GB
Ctrl+K
Ctrl+K
1 contributor
History:
15 commits
caiovicentino1
Remove legacy polar_config.json
3355841
verified
2 months ago
.gitattributes
Safe
1.57 kB
Upload folder using huggingface_hub
2 months ago
README.md
Safe
4.13 kB
HLWQ rebrand: title, tags, notice, self-links
2 months ago
chat_template.jinja
Safe
4.05 kB
Upload folder using huggingface_hub
2 months ago
compression.png
Safe
58.6 kB
Upload compression.png with huggingface_hub
2 months ago
config.json
Safe
2.04 kB
fix: quant_method polar -> polarengine for vLLM compatibility
2 months ago
family.png
Safe
44.8 kB
Upload family.png with huggingface_hub
2 months ago
hlwq_config.json
Safe
261 Bytes
Add hlwq_config.json (rename from polar_config.json)
2 months ago
kv_speed.png
Safe
35.5 kB
Upload kv_speed.png with huggingface_hub
2 months ago
model_int4.pt
pickle
Detected Pickle imports (14)
"torchao.quantization.quant_primitives.ZeroPointDomain"
,
"torch.int32"
,
"torch.IntStorage"
,
"torch.bfloat16"
,
"torch.serialization._get_layout"
,
"torch.BFloat16Storage"
,
"collections.OrderedDict"
,
"torch.device"
,
"torchao.dtypes.affine_quantized_tensor.AffineQuantizedTensor"
,
"torchao.dtypes.uintx.tensor_core_tiled_layout.TensorCoreTiledLayout"
,
"torch._utils._rebuild_tensor_v2"
,
"torch._tensor._rebuild_from_type_v2"
,
"torch._utils._rebuild_wrapper_subclass"
,
"torchao.dtypes.uintx.tensor_core_tiled_layout.TensorCoreTiledAQTTensorImpl"
How to fix it?
6.25 GB
xet
Upload model_int4.pt with huggingface_hub
2 months ago
tokenizer.json
Safe
20 MB
xet
Upload folder using huggingface_hub
2 months ago
tokenizer_config.json
Safe
1.17 kB
Upload folder using huggingface_hub
2 months ago