Local LLMs

Feeds to Scour
SubscribedAll
Scoured 440 posts in 7.4 ms

Ollama 0.30 GPU Boost: Faster local Qwen inference on NVIDIA

馃煝NVIDIA
everylocalai.comDEV

Improved performance and model support with GGUF

馃Open Source AIContent type: Blog
ollama.com

Ollama 0.30 delivers faster NVIDIA GPU performance and wider hardware support

馃Open Source AI
alternativeto.net

Neo-X7/Neo-AI: A fully offline AI assistant powered by Ollama. Stores and retrieves conversations using SQLite + LanceDB vector search. No cloud. No API keys. Runs entirely on your machine.

馃Open Source AIContent type: Code
github.comDEV

Qwen 3.6 27B AutoRound GGUF, need your feedback

馃LLMs

Show HN: Run Llama.cpp In-Process from Java with Project Panama FFM

馃Open Source AI

147th airhacks tv: Local LLMs, LightMetal, ZSmith Agents, AI Rails, Saving Tokens

馃Open Source AIContent type: Blog
adambien.blog

How Small Can You Go? LoRA Fine-Tuning 270M-8B Models for Merchant Information Extraction in Financial Transactions

馃Open Source AIContent type: Academic
arxiv.org

What Ollama Reveals About Local AI, Agents, and Open Models

馃Open Source AIContent type: Blog
odsc.medium.com

Using Scikit-LLM with Open-Source LLMs

馃Open Source AI

I've tested so many desktop AI tools, but Hermes with Ollama is my new favorite - here's why

馃Open Source AIContent type: NewsContent type: Tutorial
zdnet.com

I added this open-source tool to my local AI stack, and my local LLM finally has persistent memory

鉁嶏笍Prompt Engineering
xda-developers.com

GGUF vs GPTQ vs AWQ: The Plain-English Guide to LLM Quantization (and Which One to Pick)

馃LLMs

Fixing a stuck Ollama runner and building a GPU watchdog

馃Open Source AI

local llm on laptop 780M GPU using llama + gemma 4 qat

馃Open Source AIContent type: Blog
alper.bearblog.dev

Train Models Faster with JAX and MaxText Using NVFP4 on NVIDIA Blackwell

馃煝NVIDIAContent type: NewsContent type: Blog
developer.nvidia.com

Unsloth Gemma 4 QAT

馃Open Source AI
unsloth.ai

Gemma 4 QAT on 10GB Laptop: Local AI with 6.7GB VRAM

馃Open Source AI
everylocalai.comDEV

Tales of an Ollama Honeypot (Part 3): More Traffic, More Findings

馃Open Source AI
posts.inthecyber.com

fix(opencode-go): add qwen plus tiered pricing (#91351)

馃捇Code GenerationContent type: Code
github.com

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help