Since I like having more than one local LLM to switch between when analysing tricky development issues I decided to try out this new MoE model today. It’s a 30B A3B which means it’s basically a drop-in replacement for Qwen 3.6 35B A3B with suitable llama.cpp parameters the same.

On their own published benchmark metrics it’s supposed to be slightly worse than Qwen, but so far it’s not something I’ve noticed. It’s tuned to work well in Opencode which is how I’m running it as well.

Try it out, see how it works for you. I know that there are those who would rather use a Canadian than Chinese model in today’s political climate and it does seem to perform better than Gemma 4 at least for me. Just don’t forget to use the PR linked from unsloth’s description until it has been merged into main.