It does not work with Hermes

#8
by hzeid - opened

Model loaded correctly, but it can not use the tools, it does not complete any task, it can generate simple text , but no tools

  • low quantization
  • tool support not declared somewhere
  • missing jinja template for correct tool handling

so does it or does it not?

it's working for me -
exec llama-server
-hf HauhauCS/Qwen3.6-35B-A3B-Uncensored-HauhauCS-Aggressive
--host 0.0.0.0
--port 8090
--ctx-size 251072
--n-gpu-layers 99
--flash-attn on
--cache-type-k q4_0
--cache-type-v q4_0
--parallel 1
--cont-batching
--jinja
--alias "qwen-uncensored"
--api-key 1234567890
--no-webui
-ub 512
--no-mmap

Sign up or log in to comment