humble request if possible to do this neo code magic to latest NVIDIA-Nemotron-3-Nano-Omni-30B-A3B

#3
by Hakemz - opened

First thank you for amazing amazing Qwen3.6-27B-NEO-CODE-Di-IMatrix-MAX-GGUF, finally after long time waiting now it is possible for me to have local creative and good vision model for my mere 12 GB VRAM even if it just IQ2_M but magically still have high accuracy and much more creative than base model. It is slow though because I use image input too with text but usable enough and not too slow that is annoying. However when NVIDIA-Nemotron-3-Nano-Omni-30B-A3B came out and I tried it, surprisingly it is really good in its vision capability far better than even this neo code quant and base qwen3.6 and fast too since 3B activated only, I mean really really good, but when it comes to creativity it is less creative than this qwen3.6 27b neo code quant. So I think if you do this neo code magic to this new model, probably it will be the best small all purpose model ever that can be run locally! With 3B activated, my workflow time can be reduced to hour or two instead of 4 hours+ with Qwen3.6-27B-NEO-CODE-Di-IMatrix-MAX-GGUF.

So just my humble request but can be ignored though. :D

Hakemz changed discussion status to closed
Owner

You may want to check this out ; uploading now:

https://huggingface.co/DavidAU/Qwen3.6-27B-Heretic-Uncensored-FINETUNE-NEO-CODE-Di-IMatrix-MAX-GGUF

This model AND quants exceed Qwen 3.6 27B performance and fully uncensored too.

Sign up or log in to comment