Spaces:
Sleeping
Sleeping
File size: 789 Bytes
5e3dba1 e51ba55 5e3dba1 e51ba55 | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 | ---
title: My Pi Agent
emoji: 🦀
colorFrom: gray
colorTo: indigo
sdk: gradio
sdk_version: 6.14.0
python_version: '3.12'
app_file: app.py
pinned: false
models:
- AlexWortega/qwen35-4b-soyuz-merged-gguf
---
# My Pi Agent — Soyuz
Chat with **qwen35-4b-soyuz-merged** (a Qwen3.5-4B hybrid linear-attention model)
served as a GGUF via `llama-cpp-python` on **ZeroGPU**.
The right-hand panel surfaces what the model actually produced:
- **🧠 Reasoning** — the `<think> ... </think>` chain of thought.
- **📝 Raw output** — the full untouched generation, tags included.
GGUF: [`AlexWortega/qwen35-4b-soyuz-merged-gguf`](https://huggingface.co/AlexWortega/qwen35-4b-soyuz-merged-gguf)
(`Q4_K_M`, MTP head dropped — `--no-mtp` — since llama.cpp does not yet run Qwen3.5 MTP).
|