🤖 AI - tionis · Scour

Show HN: Run Llama.cpp In-Process from Java with Project Panama FFM

deemwar-products.github.io··Hacker News

Ollama 0.30 GPU Boost: Faster local Qwen inference on NVIDIA

everylocalai.com··DEV

MTP Isn't Always a Win: 1.95x on My 3090, but Speculative Decoding Is Hardware-Dependent

🧱data structures Blog

bric.pe.kr··DEV

fix(ollama): use provider thinking default in SDK session factory (#9… · openclaw/openclaw@4f3c2cd

🧱data structures Code

147th airhacks tv: Local LLMs, LightMetal, ZSmith Agents, AI Rails, Saving Tokens

🐧unix Blog

adambien.blog·

A Complete Beginner's Guide to Local LLM Inference

🧩lisp Blog

khnsakhnm.medium.com·

local llm on laptop 780M GPU using llama + gemma 4 qat

🐧unix Blog

alper.bearblog.dev·

What Ollama Reveals About Local AI, Agents, and Open Models

🕸️graphs Blog

odsc.medium.com·

I added this open-source tool to my local AI stack, and my local LLM finally has persistent memory

xda-developers.com·

Ask HN: Any Local LLM can I run without GPU for Local Agentic workflow AI?

🧱data structures Discussion

news.ycombinator.com··Hacker News

Improved performance and model support with GGUF

🕸️graphs Blog

Qwen 3.6 27B AutoRound GGUF, need your feedback

🧱data structures

huggingface.co··r/LocalLLaMA

Holding the FP8 Quality Ceiling at 8-Bit Weights and Activations: INT8 and GGUF Post-Training Quantization of Ideogram 4.0 for Consumer GPUs

🧱data structures Academic

Token4Token — pay-per-token inference on Gnosis + Swarm

🧱data structures

t4t.eth.link··Hacker News

Unsloth Gemma 4 QAT

🧱data structures

On-device AI is a margin decision

🧱data structures Blog

ziraph.com··Hacker News

Fixing a stuck Ollama runner and building a GPU watchdog

patrickmccanna.net··Hacker News

I Built an AI That Asks Whether Your Spending Creates Joy, Not Just Whether You Are Over Budget

🧩lisp Blog

joyboseroy.medium.com·

Google Shrank Gemma 4 by 72% and Unsloth Fixed the 4-Bit Bug Nobody Else Caught on One 4090, and 4-Bit Shouldn’t Be This Good

🧱data structures Blog

towardsai.net·

I've tested so many desktop AI tools, but Hermes with Ollama is my new favorite - here's why

🧱data structures News Tutorial

Log in to enable infinite scrolling