🧠 LLMs - marlonp · Scour

harshuljain13/llm-inference-at-scale: A Practitioner handbook for production llm serving.

🤖AI Code

github.com··Hacker News, r/LLM

Making a Vintage LLM from Scratch

🔓Open Source AI

crlf.link··Hacker News

The Order Matters: Sequential Fine-Tuning of LLaMA for Coherent Automated Essay Scoring

🔓Open Source AI Academic

Fine-tuning Large Language Models (LLMs) using PEFT

🤖AI Blog

·

Why LLMs (still) lack taste

beyondtheprior.com··Hacker News

New comment by alroma90 in "Ask HN: Who wants to be hired? (June 2026)"

🕵️AI Agents Discussion

news.ycombinator.com··Hacker News

GGUF vs GPTQ vs AWQ: The Plain-English Guide to LLM Quantization (and Which One to Pick)

🔓Open Source AI

vettedconsumer.com··Hacker News

Intelligent inference scheduling with llm-d on Red Hat AI

🔓Open Source AI

developers.redhat.com·

LLM Routing: From Strategy Selection to Production Architecture

⚙️MLOps Blog

CommBench: Can LLMs Write Correct and Efficient GPU Communication Code?

🔓Open Source AI

uccl-project.github.io··Hacker News

lightmetal: GPU LLM Inference From a Single Java 25 JAR

🔓Open Source AI Blog

adambien.blog·

Gemma 4 QAT models: Optimizing model compression for mobile and laptop efficiency

🔓Open Source AI News Blog

blog.google··Hacker News

Researchers say they trained a foundation model from scratch for about $1,500

🔓Open Source AI

venturebeat.com··Hacker News

Qwen 3.6 27B AutoRound GGUF, need your feedback

🔓Open Source AI

huggingface.co··r/LocalLLaMA

Research Proposal: Decoupled RISC-LLM Architectures via Circadian Synaptic Consolidation

🔓Open Source AI

aermia.com··Hacker News

Predictive Processing: Conscious when Training

lesswrong.com·

Treating LLMs as Programming Books

🔓Open Source AI Blog

jola.dev··Hacker News

NVIDIA Accelerates Google DeepMind’s DiffusionGemma for Local AI

🔓Open Source AI Blog

blogs.nvidia.com·

Train Models Faster with JAX and MaxText Using NVFP4 on NVIDIA Blackwell

🔓Open Source AI News Blog

developer.nvidia.com·

local llm on laptop 780M GPU using llama + gemma 4 qat

🔓Open Source AI Blog

alper.bearblog.dev·

Log in to enable infinite scrolling