--- license: apache-2.0 base_model: - Qwen/Qwen3.6-27B - Qwen/Qwen3.6-35B-A3B - Qwen/Qwen3.5-27B - Qwen/Qwen3.5-35B-A3B pipeline_tag: text-generation library_name: transformers tags: - chat-template - jinja - jinja2 - qwen3 - template-fix - bugfix --- 09.06.2026 ### This project is on hold now. For most stable experience currently, please use https://huggingface.co/spiritbuun/buun-Qwen3.6-chat_template #### Preserve Thinking: To enable preserve-thinking (Agent can remember what he thought about in older turns) pass the following parameter to llama.cpp startup: On Linux&MacOS (Windows need slightly different which cant be correctly displayed here...): --chat-template-kwargs '{"preserve_thinking": true}' #### OpenCode: If using OpenCode do notice they have an open bug for showing thinking content as plain text. Open Bug since 4 months(https://github.com/anomalyco/opencode/issues/11439). Meanwhile you can use the llama.cpp Server switch to hide thinking completely(but still let the model think) using the additonal llama.cpp parameter: --reasoning-format deepseek