North Mini Code v1.0 - a Qwen 3.6 35B MoE alternative

troed@fedia.io · 11 days ago

North Mini Code v1.0 - a Qwen 3.6 35B MoE alternative

e0qdk@reddthat.com · 11 days ago

Interesting. Looks like I’d need to build a special llama.cpp to get it to run on my system currently, and I think I could get lost for a long time if I start digging up that rabbit hole… so maybe not today, but I’ll keep an eye out and give it a try if support lands in main.

Is it doing any better than Qwen at avoiding getting stuck in thinking loops?

melfie@lemmy.zip · 10 days ago

This Dockerfile worked for me to build the llama-cpp-turboquant fork: https://huggingface.co/spaces/ai-engineering-at/llama-cpp-turboquant-guide/blob/main/Dockerfile. Should work for upstream too. The Dockerfile I made myself crashed 2 different machines, but then I found this one and can confirm it works well.

e0qdk@reddthat.com · 10 days ago

I’ve got an AMD system so that probably won’t work for me, but glad it’s working for you and maybe it will help others!

How does the model compare to Qwen and Gemma4 so far?

melfie@lemmy.zip · 10 days ago

Ah, ok, hope it helps someone. I’ll probably try the model this weekend sometime.

North Mini Code v1.0 - a Qwen 3.6 35B MoE alternative

North Mini Code v1.0 - a Qwen 3.6 35B MoE alternative

unsloth/North-Mini-Code-1.0-GGUF · Hugging Face