The Autodidacts

How to fit Qwen 3.6 35B A3B into 16GB of VRAM, & run it with Llama.cpp on an RTX 3080 (opens in new tab)

Covers Can your machine run AI models?

The belly hangs over the belt, but it fits

Read the original article

Sign in to keep reading the full article.