MTP layer

#2
by engrtipusultan - opened

It does not have mtp layer can you upload without MTP stripped?

Also when you plan to upload coding optimized version?

General-Instinct org

I tried compiling it with MTP and DFlash, but it hurt the output quality quite a bit, so I decided not to release it with the MTP layer.

We're still working on decode speed optimization. Hopefully, we'll have another version focused on faster decoding performance available soon.

I did not quite get that. Keeping the MTP layer does not mean it is enabled and Dflash is not supported in mailline llama.cpp hence in lmstudio etc.

Hopefully you can add some benchmarks when you upload final version against the original model for accuracy and speed.

Sign up or log in to comment