Ollama

Feeds to Scour
SubscribedAll
Scoured 207 posts in 7.7 ms

Ollama 0.30 GPU Boost: Faster local Qwen inference on NVIDIA

 💻Wezterm
everylocalai.com··DEV

Unsloth Minimax M3 GGUF

 💻Wezterm

Ollama 0.30 delivers faster NVIDIA GPU performance and wider hardware support

 💻Wezterm
alternativeto.net·

DevMando/MandoCode: A .NET C# CLI Coding Agent powered by Ollama + Semantic Kernel and RazorConsole. Run locally or in the cloud. Refactors code, proposes diffs, and updates your project safely — no API keys required.

 🔧MCP  Content type: Code
github.com··Hacker News

Ollama's highest performance on Apple Silicon yet with MLX

 🧠LLMs  Content type: Blog
ollama.com·

GGUF vs GPTQ vs AWQ: The Plain-English Guide to LLM Quantization (and Which One to Pick)

 🧠LLMs

6. Air-Gapped Claude Code - The Claude Code SRE Handbook

 💻AI

How to Run an LLM Locally: Ultimate Guide to Local AI 2026

 🧠LLMs  Content type: Blog
cswithsanjay.blogspot.com·

From Chatbot Hallucinations to Deterministic Agents: Forcing Local LLMs to Run Production-Grade…

 🧠LLMs  Content type: Blog
medium.com
·

local llm on laptop 780M GPU using llama + gemma 4 qat

 🧠LLMs  Content type: Blog
alper.bearblog.dev·

What Ollama Reveals About Local AI, Agents, and Open Models

 🤖AI software development  Content type: Blog
odsc.medium.com·

I gave a local LLM access to my Docker containers, and it replaced my monitoring scripts

 💻Wezterm
xda-developers.com·

I've tested so many desktop AI tools, but Hermes with Ollama is my new favorite - here's why

 🤖AI Agents  Content type: News  Content type: Tutorial
zdnet.com·

lightmetal: GPU LLM Inference From a Single Java 25 JAR

 💻Wezterm  Content type: Blog
adambien.blog·

MTP Isn't Always a Win: 1.95x on My 3090, but Speculative Decoding Is Hardware-Dependent

 💻Wezterm  Content type: Blog

Ask HN: What's the best LLM model that on a 24 GB VRAM GPU?

 🧠LLMs  Content type: Discussion

fix: resolve managed secretref provider auth (#92235) · openclaw/openclaw@9386d62

 🔌LSP  Content type: Code  Content type: Release
github.com·

Self-hosted remote access for Ollama without complicated setup

 🔌LSP
Less-relevant results

DiffusionGemma: 4x Faster Text Generation

 🧠LLMs  Content type: News  Content type: Blog  19 sources covering this post

Fixing a stuck Ollama runner and building a GPU watchdog

 💻Wezterm

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help