🏠 Local LLMs - kudolink

Neo-X7/Neo-AI: A fully offline AI assistant powered by Ollama. Stores and retrieves conversations using SQLite + LanceDB vector search. No cloud. No API keys. Runs entirely on your machine.

🤗Open Source AI Code

github.com··DEV

Qwen 3.6 27B AutoRound GGUF, need your feedback

🧠LLMs

huggingface.co··r/LocalLLaMA

Show HN: Run Llama.cpp In-Process from Java with Project Panama FFM

🤗Open Source AI

deemwar-products.github.io··Hacker News

147th airhacks tv: Local LLMs, LightMetal, ZSmith Agents, AI Rails, Saving Tokens

🤗Open Source AI Blog

adambien.blog·

How Small Can You Go? LoRA Fine-Tuning 270M-8B Models for Merchant Information Extraction in Financial Transactions

🤗Open Source AI Academic

arxiv.org·

What Ollama Reveals About Local AI, Agents, and Open Models

🤗Open Source AI Blog

odsc.medium.com·

Using Scikit-LLM with Open-Source LLMs

🤗Open Source AI

machinelearningmastery.com·

I've tested so many desktop AI tools, but Hermes with Ollama is my new favorite - here's why

🤗Open Source AI News Tutorial

zdnet.com·

I added this open-source tool to my local AI stack, and my local LLM finally has persistent memory

✍️Prompt Engineering

xda-developers.com·

GGUF vs GPTQ vs AWQ: The Plain-English Guide to LLM Quantization (and Which One to Pick)

🧠LLMs

vettedconsumer.com··Hacker News

Fixing a stuck Ollama runner and building a GPU watchdog

🤗Open Source AI

patrickmccanna.net··Hacker News

local llm on laptop 780M GPU using llama + gemma 4 qat

🤗Open Source AI Blog

alper.bearblog.dev·

Train Models Faster with JAX and MaxText Using NVFP4 on NVIDIA Blackwell

🟢NVIDIA News Blog

developer.nvidia.com·

Unsloth Gemma 4 QAT

🤗Open Source AI

unsloth.ai·

Gemma 4 QAT on 10GB Laptop: Local AI with 6.7GB VRAM

🤗Open Source AI

everylocalai.com··DEV

Tales of an Ollama Honeypot (Part 3): More Traffic, More Findings

🤗Open Source AI

posts.inthecyber.com·

fix(opencode-go): add qwen plus tiered pricing (#91351)

💻Code Generation Code

github.com

Ollama 0.30 GPU Boost: Faster local Qwen inference on NVIDIA

Improved performance and model support with GGUF

Ollama 0.30 delivers faster NVIDIA GPU performance and wider hardware support

Neo-X7/Neo-AI: A fully offline AI assistant powered by Ollama. Stores and retrieves conversations using SQLite + LanceDB vector search. No cloud. No API keys. Runs entirely on your machine.

Qwen 3.6 27B AutoRound GGUF, need your feedback

Show HN: Run Llama.cpp In-Process from Java with Project Panama FFM

147th airhacks tv: Local LLMs, LightMetal, ZSmith Agents, AI Rails, Saving Tokens

How Small Can You Go? LoRA Fine-Tuning 270M-8B Models for Merchant Information Extraction in Financial Transactions

What Ollama Reveals About Local AI, Agents, and Open Models

Using Scikit-LLM with Open-Source LLMs

I've tested so many desktop AI tools, but Hermes with Ollama is my new favorite - here's why

I added this open-source tool to my local AI stack, and my local LLM finally has persistent memory

GGUF vs GPTQ vs AWQ: The Plain-English Guide to LLM Quantization (and Which One to Pick)

Fixing a stuck Ollama runner and building a GPU watchdog

local llm on laptop 780M GPU using llama + gemma 4 qat

Train Models Faster with JAX and MaxText Using NVFP4 on NVIDIA Blackwell

Unsloth Gemma 4 QAT

Gemma 4 QAT on 10GB Laptop: Local AI with 6.7GB VRAM

Tales of an Ollama Honeypot (Part 3): More Traffic, More Findings

fix(opencode-go): add qwen plus tiered pricing (#91351)