🤖 LLMs - tompeart · Scour

harshuljain13/llm-inference-at-scale: A Practitioner handbook for production llm serving.

🤖AI Code

github.com··Hacker News, r/LLM

Comprehensive evaluation of LLM capabilities for interpretation and analysis of genome-scale metabolic models in metabolic engineering

🤖AI Academic

Fine-tuning Multi-modal LLMs with ART: Art-based Reinforcement Training

🤖AI Academic

6. Air-Gapped Claude Code - The Claude Code SRE Handbook

har-ki.github.io··Hacker News

Intelligent inference scheduling with llm-d on Red Hat AI

developers.redhat.com·

Why LLMs (still) lack taste

beyondtheprior.com··Hacker News

Claude vs GPT-4: Which AI API Is Better for Developers? (2026)

kalyna.pro··DEV

Ollama 0.30 GPU Boost: Faster local Qwen inference on NVIDIA

everylocalai.com··DEV

Timing Trick Cuts Energy Used in LLM Training by Up to 14 Percent

🏃Running News

spectrum.ieee.org

··Hacker News

Why Your LLM Gets Dumber With More Context

siliconopera.com·

What Ollama Reveals About Local AI, Agents, and Open Models

🤖AI Blog

odsc.medium.com·

The smartest ChatGPT users are putting local AI in front of it — here's why

·

Fixing a stuck Ollama runner and building a GPU watchdog

⚙System programming

patrickmccanna.net··Hacker News

CommBench: Can LLMs Write Correct and Efficient GPU Communication Code?

uccl-project.github.io··Hacker News

Built and launched a research-reading and highlighting tool with Claude over a few months. Here are the things AI was surprisingly good (and bad) at.

highlyt.app··r/ClaudeAI

Improved performance and model support with GGUF

🤖AI Blog

MCP Architecture Explained for Beginners: Why AI Needs a Structured Communication System

🤖AI Blog

·

MTP Isn't Always a Win: 1.95x on My 3090, but Speculative Decoding Is Hardware-Dependent

🇬🇧London Tech Blog

bric.pe.kr··DEV

Large companies can add a local LLM filter layer to considerably reducing their AI costs

umrashrf.github.io··Hacker News

AMD's Lemonade SDK For Local AI Adds NVIDIA CUDA Support

phoronix.com··r/artificial

Log in to enable infinite scrolling