GGUF
How to use from
Ollama
ollama run hf.co/ikawrakow/qwen-14b-chat-gguf
Quick Links

Posting these Qwen-14B-Chat quantized models in GGUF format for use with llama.cpp due to a user request.

But, having used an importance matrix derived from English-only training data in the quantization, I have no idea how these models will perform in Chinese.

Downloads last month
64
GGUF
Model size
14B params
Architecture
qwen
Hardware compatibility
Log In to add your hardware

We're not able to determine the quantization variants.

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support