Yi-1.5-6B-RYS-20-23

Yi-1.5-6B-Chat with layers 20-22 duplicated. A late-stack triple-coupled circuit β€” boosting math, EQ, and reasoning simultaneously β€” runs twice on every forward pass.

32 base layers β†’ 35 after duplication. No training, no merging, no weight changes.

Reasoning 76.47% β†’ 88.23% (+11.76). EQ 86.09 β†’ 91.87 (+5.78). Math 0.518 β†’ 0.5537 (+3.57). Combined Ξ” +21.11 β€” rare three-way positive lift in the v2 corpus.

Results

Metric Baseline RYS (20,23) Delta
Math 0.518 0.5537 +3.57
EQ 86.09 91.87 +5.78
Reasoning 76.47% 88.23% +11.76

The three-way lift. Most RYS sweeps surface a trade-off β€” math vs EQ, reasoning vs EQ, math vs reasoning. Yi-1.5-6B is the only model in the v2 corpus where the single best configuration delivers simultaneous positive lift across all three probes. The combined Ξ” +21.11 is reached without sacrificing any dimension.

Yi-1.5-6B was the closing sweep of the v2 cross-architecture queue (2026-05-12), bringing the corpus to N=21 and locking the final correlation at r = βˆ’0.726. The narrow boost signal (1 of 66 configs reasoning-boosters) means the triple-coupled config is specific to (20,23) block-3.

Pick this when you want simultaneous lift across all three dimensions without trade-off.

Usage

llama-server -m Yi-1.5-6B-RYS-20-23-Q4_K_M.gguf -ngl 99

Full sweep data

66 configurations tested. (20,23) block-3 is the unique three-way-positive pick. Full per-config sweep + cross-architecture analysis: v2 dataset.

Part of the RYS Sovereign Collection v2.


Where this sits in the Sovereign Collection

v1 β€” Qwen2.5 cross-scale + Qwen3-32B headline crossover. 5 model repos.

v2 β€” cross-architecture corpus. 21 model variants across 10 architecture families. Inverse correlation (r = βˆ’0.726): weak baselines lift more, in their weakest dimension. Yi-1.5-6B is on-curve at high baseline; the three-way coupling is what makes its row distinctive. 13 deployable RYS-applied weight repos covering every non-zero-lift variant.

Credit

John Broadway, with collaboration from Claude (Opus 4.6 in April 2026 sweep generation and build pipeline; Opus 4.7 in May 2026 cross-architecture analysis and publication). Original RYS method by David Ng on Qwen2-72B; sweep + probe toolkit by alainnothere.

Downloads last month
154
GGUF
Model size
7B params
Architecture
llama
Hardware compatibility
Log In to add your hardware

4-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Model tree for john-broadway/Yi-1.5-6B-RYS-20-23-GGUF

Quantized
(30)
this model

Collection including john-broadway/Yi-1.5-6B-RYS-20-23-GGUF