will there be a MTP version?

#4
by jasoncow - opened

to speed up with mtp

yes, totally. I'm following up llama.cpp upstream closely. Once that's merged I'll create an MTP version right away!

You can use the script I created to experience the MTP effect firsthand :)
I have tested it, and it can run on the APEX model.

https://www.modelscope.cn/models/HereIsMark/Qwen3.6-35B-A3B-MTP-Donor

yes, totally. I'm following up llama.cpp upstream closely. Once that's merged I'll create an MTP version right away!

Tons of MTP-refined models are popping up now. We’ve already pulled the branch locally and it’s stable. Reddit is buzzing with discussions. I’d recommend getting started with the new models right away—things are happening way too fast.

Really excited for the MTP version, @mudler ! Great to hear you're tracking the llama.cpp upstream so closely — that dedication shows in the quality of your work. This model is already fantastic, and with MTP support it's going to be even better. Keep it up, can't wait!

是的 非常期待您这款优秀模型的MTP版本, 还有谷歌模型的草稿版本

@mudler

yes, totally. I'm following up llama.cpp upstream closely. Once that's merged I'll create an MTP version right away!

MTP has just been merged.

Sign up or log in to comment