How to enable vision encoder?

#10
by stefan28123 - opened

Hi everyone,
Thanks for this great model.
We are currently using llama.cpp to run the MXFP4_MOE on our RTX 5000 48GB.
However we noticed that it only loads up text modality. Is there any guide on how to enable visio capabilities (e.g. loading the vision encoder?)
Thanks!

if you're using llama.cpp try --mmproj path/to/mmproj-BF16.gguf in your server startup cmdline

Sign up or log in to comment