North Mini Code v1.0 - a Qwen 3.6 35B MoE alternative

troed@fedia.io · 12 days ago

melfie@lemmy.zip · 11 days ago

This Dockerfile worked for me to build the llama-cpp-turboquant fork: https://huggingface.co/spaces/ai-engineering-at/llama-cpp-turboquant-guide/blob/main/Dockerfile. Should work for upstream too. The Dockerfile I made myself crashed 2 different machines, but then I found this one and can confirm it works well.

e0qdk@reddthat.com · 11 days ago

I’ve got an AMD system so that probably won’t work for me, but glad it’s working for you and maybe it will help others!

How does the model compare to Qwen and Gemma4 so far?

melfie@lemmy.zip · 11 days ago

Ah, ok, hope it helps someone. I’ll probably try the model this weekend sometime.