Which Branch?

#1
by RedDragonGecko - opened

Hello. Again.
Saw you recommended your fattn branch over the main llama.cpp for your non pro quants.
Was curious if that holds true for the pro ones.
Didn't want to derail the thread.
What about the vision branch? I saw they were merged but not which to which.

Hi, the vision branch is forked from the attention branch so that has all but the latest tweaks I've made in the PR. Basically it should still run fine, just with a few extra kernels that aren't strictly required.

You can run Pro on either the fattn or vision branch as well, but Pro doesn't have any multimodal components so mainly it'd just save you the recompile.

Sign up or log in to comment