💬 LLMs - simiasherextra · Scour

harshuljain13/llm-inference-at-scale: A Practitioner handbook for production llm serving.

🔌Embedded Systems Code

github.com··Hacker News, r/LLM

Ollama 0.30 GPU Boost: Faster local Qwen inference on NVIDIA

✨Neural Radiance Fields

everylocalai.com··DEV

Should LLM Agents Decide in Social Simulations? Comparing Finite-State and LLM-Based Decision Policies

🛡️AI Safety Academic

Introducing LLM as a Judge: Scaling search relevance evaluation with AI

👁️Computer Vision Blog

opensearch.org·

Comprehensive evaluation of LLM capabilities for interpretation and analysis of genome-scale metabolic models in metabolic engineering

🌐AGI Academic

How LLMs are Actually Trained

🛡️AI Safety News Blog

blog.algomaster.io·

local llm on laptop 780M GPU using llama + gemma 4 qat

🔌Embedded Systems Blog

alper.bearblog.dev·

Orchestrate your LLM pipeline. Locally

🧠AI Research

llmforge.app··Hacker News

What Ollama Reveals About Local AI, Agents, and Open Models

🛡️AI Safety Blog

odsc.medium.com·

How ChatGPT Actually Works (Beginner Friendly)

🏳️‍🌈LGBT Tech Blog

·

Two old GPUs I salvaged are doing more AI work than a brand new $2000 card, and I won't be upgrading anytime soon

🔓Open Source

xda-developers.com·

Intelligent inference scheduling with llm-d on Red Hat AI

🔌Embedded Systems

developers.redhat.com·

WWDC 2026: Foundation Models (& Anarlog)

🏳️‍🌈LGBT Tech

skushagra.com·

6. Air-Gapped Claude Code - The Claude Code SRE Handbook

🔌Embedded Systems

har-ki.github.io··Hacker News

lightmetal: GPU LLM Inference From a Single Java 25 JAR

🔌Embedded Systems Blog

adambien.blog·

How we fight GPU scarcity without compromise

🔌Embedded Systems Blog

equixly.com··Hacker News

Ask HN: Any Local LLM can I run without GPU for Local Agentic workflow AI?

✨Neural Radiance Fields Discussion

news.ycombinator.com··Hacker News

Generative AI in the Real World: Agentic Systems Fundamentals with Maarten Grootendorst

🌐AGI Audio

Why Your LLM Gets Dumber With More Context

🛡️AI Safety

siliconopera.com·

MTP Isn't Always a Win: 1.95x on My 3090, but Speculative Decoding Is Hardware-Dependent

🔌Embedded Systems Blog

bric.pe.kr··DEV

Log in to enable infinite scrolling