add Q3 please

#1
by Y4iges - opened

add Q3 please, for users with 16 vram.

Y4iges changed discussion status to closed

Sure, will do!

Owner
β€’
edited May 7

Added Q2 and Q3, though depending on your settings they can sometimes output gibberish. Not sure if it's a bug in the MTP implementation or in llama.cpp itself, since the base model without MTP also showed the same issue occasionally.

Sign up or log in to comment