Seems like chat template is broken?

#3
by tarruda - opened

Running Q6_K_L on llama server and asking any question results in this error being popped up:
image

I tried passing reasoning_effort via --chat-template-kwargs but didn't seem to work

completely broken

hmm that's a strange one, probably something we should look into in llama.cpp itself, assuming you're not somehow passing low or any other value then this shouldn't be failing like that

Can you try again out of curiousity?

I tried launching the server with default args and sent the example prompt:

curl --request POST \
    --url http://localhost:8080/completion \
    --header "Content-Type: application/json" \
    --data '{"prompt": "Building a website can be done in 10 simple steps:","n_predict": 128}'

and it had no errors, not sure if the tool you're using is passing something bad or if there was an update

I also got the same thing with unsloth Q6_K. No idea what it could be, maybe a bug in the web UI

Sign up or log in to comment