Qwen 3.6 27B RYS XL (GGUF)

This is a modified version of the Qwen 3.6 27B model, utilizing the RYS (Repeat Your Self) technique.

What is RYS?

The RYS (Repeat Your Self) technique, discovered by David Ng, enhances the reasoning capabilities of Large Language Models by duplicating specific "reasoning" layers in the middle of the transformer stack. This increases the depth of the model's computation for semantic and logic-heavy tasks without requiring additional training.

Model Details

  • Architecture: Qwen 3.6 27B
  • RYS Configuration: Physical duplication of layers (26, 34).
  • Variant: XL (8 additional layers, bringing the total depth to 72 layers).
  • Format: GGUF (Quantized to Q8_0).
  • Tokenizer: Full Qwen 3.5/3.6 201-language support.

Performance

By repeating layers 26 through 34, the model spends more time processing the internal semantic representation of a prompt. This is particularly effective for:

  • Mathematical reasoning
  • Complex logic puzzles
  • Large-scale coding tasks

Usage

This GGUF model is compatible with any tool that uses llama.cpp, such as:

Prompt Format

This model uses the standard Qwen Chat template:

<|im_start|>system
You are a helpful assistant.<|im_end|>
<|im_start|>user
{prompt}<|im_end|>
<|im_start|>assistant


CREDITS
Base Model: The Qwen Team at Alibaba Cloud.
RYS Technique: David Ng (dnhkng).
Quantization: Processed on a Mac Studio
Downloads last month
51
GGUF
Model size
30B params
Architecture
qwen35
Hardware compatibility
Log In to add your hardware

We're not able to determine the quantization variants.

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for LogicBombaklot/Qwen3.6-27B-RYS-XL-GGUF

Base model

Qwen/Qwen3.6-27B
Quantized
(6)
this model