🧠 LLMs - Lucasg2g · Scour

harshuljain13/llm-inference-at-scale: A Practitioner handbook for production llm serving.

🤖AI Agents Code

github.com··Hacker News

The Neutral Mask: How RLHF Provides Shallow Alignment while Leaving Partisan Structure Intact in a Large Language Model

✨vibe coding Academic

Inferoa AI harness claimed 90% cache savings. We ran it and measured 97.8%

zozo123.github.io··Hacker News

A Plea to the Labs: Let the Models Diagnose.

🐛Bug Bounty Blog

tangent.bearblog.dev··Hacker News

Why LLMs (still) lack taste

beyondtheprior.com··Hacker News

Using Scikit-LLM with Open-Source LLMs

machinelearningmastery.com·

AMD's Lemonade SDK For Local AI Adds NVIDIA CUDA Support

🌐Open Source

The Rise of Agentic AI: What Every Engineer Should Learn

🤖AI Agents Blog

Using local LLMs for agentic coding

🤖AI Agents Blog

blog.alexewerlof.com·

147th airhacks tv: Local LLMs, LightMetal, ZSmith Agents, AI Rails, Saving Tokens

🐛Bug Bounty Blog

adambien.blog·

Train Models Faster with JAX and MaxText Using NVFP4 on NVIDIA Blackwell

✨vibe coding News Blog

developer.nvidia.com·

How we fight GPU scarcity without compromise

🤖AI Agents Blog

equixly.com··Hacker News

NVIDIA Accelerates Google DeepMind’s DiffusionGemma for Local AI

🤖AI Agents Blog

blogs.nvidia.com·

What are AI parameters — and why does everyone keep talking about billions of them?

🤖AI Agents Blog

Slack bot for the whole team, not per-seat

🔑Authentication Discussion

plugand.ai··Hacker News

How LLMs work | Practical Leaders

practical-leaders.com··Hacker News

Report: GKE Inference Gateway delivers up to 92% faster AI responses

🤖AI Agents Blog

cloud.google.com··Hacker News

local llm on laptop 780M GPU using llama + gemma 4 qat

✨vibe coding Blog

alper.bearblog.dev·

1-bit and 1.58 bit LLM Benchmarking on Jetson Orin Nano Super | Bonsai LM

🌐Open Source

smolhub.com··r/LocalLLaMA

Less-relevant results

The biggest local LLM on your machine is useless if it can't call a single tool, no matter how many parameters it has

xda-developers.com·

Log in to enable infinite scrolling