🌐 Open Source AI - liux0629 · Scour

Improved performance and model support with GGUF

💬LLMs Blog

Ollama 0.30 GPU Boost: Faster local Qwen inference on NVIDIA

everylocalai.com··DEV

techjarves/Portable-AI-USB: A 100% offline, fully portable, zero-trace AI (Ollama + Llama 3 + AnythingLLM) that runs natively from a USB drive on Windows and Mac.

✍️Prompt Engineering Code

Orchestrate your LLM pipeline. Locally

✍️Prompt Engineering

llmforge.app··Hacker News

MechLens: Late Crystallization of Factual Knowledge Explains Intervention Effectiveness in Language Models

✍️Prompt Engineering Academic

What Ollama Reveals About Local AI, Agents, and Open Models

⚙️MLOps Blog

odsc.medium.com·

Show HN: Run Llama.cpp In-Process from Java with Project Panama FFM

✍️Prompt Engineering

deemwar-products.github.io··Hacker News

MTP Isn't Always a Win: 1.95x on My 3090, but Speculative Decoding Is Hardware-Dependent

💬LLMs Blog

bric.pe.kr··DEV

6. Air-Gapped Claude Code - The Claude Code SRE Handbook

✍️Prompt Engineering

har-ki.github.io··Hacker News

Ask HN: Any Local LLM can I run without GPU for Local Agentic workflow AI?

✍️Prompt Engineering Discussion

news.ycombinator.com··Hacker News

local llm on laptop 780M GPU using llama + gemma 4 qat

✍️Prompt Engineering Blog

alper.bearblog.dev·

LeLab Is Hugging Face’s New Browser-Based GUI for the LeRobot Ecosystem

🤖AI News

Two old GPUs I salvaged are doing more AI work than a brand new $2000 card, and I won't be upgrading anytime soon

✍️Prompt Engineering

xda-developers.com·

I've tested so many desktop AI tools, but Hermes with Ollama is my new favorite - here's why

🧠AI Agents News Tutorial

You don't need Copilot for code completion, try this instead

mistral.ai··r/GithubCopilot

DiffusionGemma 26B A4B results on my 5090

✍️Prompt Engineering

huggingface.co··r/LocalLLaMA

谷歌推出 DiffusionGemma 文本扩散模型：本地 AI 推理速度提升 4 倍

Ollama 0.30 delivers faster NVIDIA GPU performance and wider hardware support

alternativeto.net·

Show HN: In-browser real LLM token counter and cost estimation

✍️Prompt Engineering

holaclaw.ai··Hacker News

lightmetal: GPU LLM Inference From a Single Java 25 JAR

✍️Prompt Engineering Blog

adambien.blog·

Log in to enable infinite scrolling