File size: 789 Bytes
5e3dba1
 
 
 
 
 
 
 
 
 
e51ba55
 
5e3dba1
 
e51ba55
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
---
title: My Pi Agent
emoji: 🦀
colorFrom: gray
colorTo: indigo
sdk: gradio
sdk_version: 6.14.0
python_version: '3.12'
app_file: app.py
pinned: false
models:
  - AlexWortega/qwen35-4b-soyuz-merged-gguf
---

# My Pi Agent — Soyuz

Chat with **qwen35-4b-soyuz-merged** (a Qwen3.5-4B hybrid linear-attention model)
served as a GGUF via `llama-cpp-python` on **ZeroGPU**.

The right-hand panel surfaces what the model actually produced:
- **🧠 Reasoning** — the `<think> ... </think>` chain of thought.
- **📝 Raw output** — the full untouched generation, tags included.

GGUF: [`AlexWortega/qwen35-4b-soyuz-merged-gguf`](https://huggingface.co/AlexWortega/qwen35-4b-soyuz-merged-gguf)
(`Q4_K_M`, MTP head dropped — `--no-mtp` — since llama.cpp does not yet run Qwen3.5 MTP).