Lingdas1/local-llm-guide: The definitive guide to running LLMs locally — Ollama, GGUF, Open WebUI, DeepSeek V4, Qwen 3.6, GLM-4.7, hardware matching, and production deployment. (opens in new tab)  🦙Ollama  Content type: Code

GGUF & Modelfile: The Power User's Guide to Local LLMs Beyond ollama pull — download any model from Hugging Face, quantize it, customize it, and import it into Ollama. What's GGUF? GGUF (GPT-Generated Unified Format) is the standard file format for running LLMs locally. Think of it as the .mp3 of AI models: Compressed — 70-85% smaller than the original float16 weights Fast — optimized for CPU and GPU inference Portable — one file contains the entire model Metadata-rich — includes tokenizer, c...

Read the original article
Sign in to keep reading the full article.

Cited by 4 articles

dev.to··DEV
dev.to··DEV
dev.to··DEV
View all 4 ›

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help