Instructions to use Qwen/Qwen3.5-27B with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use Qwen/Qwen3.5-27B with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("image-text-to-text", model="Qwen/Qwen3.5-27B") messages = [ { "role": "user", "content": [ {"type": "image", "url": "https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/p-blog/candy.JPG"}, {"type": "text", "text": "What animal is on the candy?"} ] }, ] pipe(text=messages)# Load model directly from transformers import AutoProcessor, AutoModelForImageTextToText processor = AutoProcessor.from_pretrained("Qwen/Qwen3.5-27B") model = AutoModelForImageTextToText.from_pretrained("Qwen/Qwen3.5-27B") messages = [ { "role": "user", "content": [ {"type": "image", "url": "https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/p-blog/candy.JPG"}, {"type": "text", "text": "What animal is on the candy?"} ] }, ] inputs = processor.apply_chat_template( messages, add_generation_prompt=True, tokenize=True, return_dict=True, return_tensors="pt", ).to(model.device) outputs = model.generate(**inputs, max_new_tokens=40) print(processor.decode(outputs[0][inputs["input_ids"].shape[-1]:])) - Inference
- HuggingChat
- Notebooks
- Google Colab
- Kaggle
- Local Apps
- vLLM
How to use Qwen/Qwen3.5-27B with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "Qwen/Qwen3.5-27B" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "Qwen/Qwen3.5-27B", "messages": [ { "role": "user", "content": [ { "type": "text", "text": "Describe this image in one sentence." }, { "type": "image_url", "image_url": { "url": "https://cdn.britannica.com/61/93061-050-99147DCE/Statue-of-Liberty-Island-New-York-Bay.jpg" } } ] } ] }'Use Docker
docker model run hf.co/Qwen/Qwen3.5-27B
- SGLang
How to use Qwen/Qwen3.5-27B with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "Qwen/Qwen3.5-27B" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "Qwen/Qwen3.5-27B", "messages": [ { "role": "user", "content": [ { "type": "text", "text": "Describe this image in one sentence." }, { "type": "image_url", "image_url": { "url": "https://cdn.britannica.com/61/93061-050-99147DCE/Statue-of-Liberty-Island-New-York-Bay.jpg" } } ] } ] }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "Qwen/Qwen3.5-27B" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "Qwen/Qwen3.5-27B", "messages": [ { "role": "user", "content": [ { "type": "text", "text": "Describe this image in one sentence." }, { "type": "image_url", "image_url": { "url": "https://cdn.britannica.com/61/93061-050-99147DCE/Statue-of-Liberty-Island-New-York-Bay.jpg" } } ] } ] }' - Docker Model Runner
How to use Qwen/Qwen3.5-27B with Docker Model Runner:
docker model run hf.co/Qwen/Qwen3.5-27B
Add MMMU-Pro evaluation result
#46 opened 10 days ago
by
SaylorTwift
qwen3.5-27b + openclaw
#45 opened about 1 month ago
by
RZZR
fix chat template to avoid empty historical `<think>` blocks
🔥 1
1
#44 opened about 2 months ago
by
latent-variable
Qwen 3.5 27b fine tuning multi-gpu example lora sft
#43 opened about 2 months ago
by
Ethermich
Heavy Chinese-English mixing
2
#42 opened about 2 months ago
by
ariesaa
65454
#41 opened 2 months ago
by
yehuashangui
what the best engine to run this model.
👀 1
1
#40 opened 2 months ago
by
Shimon324
qwen3.5-27b,快速模式下,发票的年份识别不确认,2026识别成2025,在思考模式下才正常
#39 opened 2 months ago
by
htl258
Add ScreenSpot-Pro evaluation result
#38 opened 2 months ago
by
merve
what's the matrics of livecode bench v6 when enable think off
#37 opened 2 months ago
by
HoldenShen
repeat_penalty not repetition
1
#36 opened 2 months ago
by
pants44
Add MathArena evaluation result for hmmt/hmmt_feb_2026
#35 opened 2 months ago
by
JasperDekoninck
Add MathArena evaluation result for aime/aime_2026
#34 opened 2 months ago
by
JasperDekoninck
`.cw` is output at the end of the reasoning content
8
#33 opened 2 months ago
by
owao
Chinese-English Typography Issues
#32 opened 3 months ago
by
douglarek
Looping forever
1
#31 opened 3 months ago
by
kil3r
[transformers-server] Continuous batching does not support Qwen-3.5 series
#30 opened 3 months ago
by
zhang-zhi-chun
Language Request: Garhwali
#29 opened 3 months ago
by
akshitgaur2005
What is the chat template for Ollama ?
👍 1
2
#28 opened 3 months ago
by
ngbruce
Fine tune with lora
1
#26 opened 3 months ago
by
saireddy
Please don't forget to release the base model of Qwen3.5-27B
👀➕ 11
2
#25 opened 3 months ago
by
Linksome
pure transformers inference code is needed
👀👍 4
3
#24 opened 3 months ago
by
maltoseflower
When?
👀 3
#23 opened 3 months ago
by
AxionLab-official
as a starter
1
#22 opened 3 months ago
by
kit17
Upload 3 files
#21 opened 3 months ago
by
shamamanaeem03
model overthinking
👍 3
1
#20 opened 3 months ago
by
cse2011
vLLM Version conflict issue
4
#19 opened 3 months ago
by
innosynth
有没有BFCLV4和tau2的细节指标
#18 opened 3 months ago
by
HuanchangLiu
Thinking Blocks Error
12
#16 opened 3 months ago
by
TronixAT
Plan on releasing the 4bits quantized version
👀👍 12
#15 opened 3 months ago
by
omarsou
Qwen are is the 8b coming out?
4
#14 opened 3 months ago
by
Crownelius
vllm serving issue
3
#13 opened 3 months ago
by
grozatech
Russian language support, bad grammar!
13
#12 opened 3 months ago
by
alexcardo
Add evaluation results
#11 opened 3 months ago
by
SaylorTwift
Availability of the Qwen3.5-27B Base model
👀👍 17
1
#9 opened 3 months ago
by
Jellyfish042
Value error, Model architectures ['Qwen3_5ForConditionalGeneration'] are not supported for now. Transformers version 5.3.0.dev0
👀 15
6
#8 opened 3 months ago
by
rameshch
Installation Video and Testing - Step by Step
🔥 3
#6 opened 3 months ago
by
fahdmirzac
amazing model
❤️ 7
2
#5 opened 3 months ago
by
Tugay31
is HLE w/ CoT a typo?
#4 opened 3 months ago
by
xiaoqianWX
Samplers
4
#3 opened 3 months ago
by
ggnoy
DENSE
🔥 5
2
#2 opened 3 months ago
by
ox-ox
请千万千万别忘了把 Qwen Image 2.0 也开源——这对我们本地用户来说会是个巨大的改变 :-)
🤯🧠 3
#1 opened 3 months ago
by
Hanswalter