Getting better speeds and somehow results then Qwen3.6-35B-A3B! Q4

#15
by SaturnsVoid - opened

On my 9070XT (16GB) and 48GB Ram on LM Studio with 64000 context Q4 getting better OpenCode results and faster then Qwen3.6-35B-A3B (20-25/ts ---> 30-33/ts). Now just waiting for someone to uncensor it and i will probably replace Qwen3.6-35B-A3B with it as my daily runner!

You should test this on benches, cause appearance could fail you. It worked so far, but you're just guessing at this point.

Sign up or log in to comment