🧠 LLMs - mmm18ix · Scour

harshuljain13/llm-inference-at-scale: A Practitioner handbook for production llm serving.

🤖AI Code

github.com··Hacker News

You don't need Copilot for code completion, try this instead

mistral.ai··r/GithubCopilot

DeepSeekV4 1.6T Day 0 to Day 43 Performance Over Time - Huawei, GB300 NVL72, MI355X, B200

🤖AI News

newsletter.semianalysis.com

··Hacker News

147th airhacks tv: Local LLMs, LightMetal, ZSmith Agents, AI Rails, Saving Tokens

☕Java Blog

adambien.blog·

The Neutral Mask: How RLHF Provides Shallow Alignment while Leaving Partisan Structure Intact in a Large Language Model

🤖AI Academic

Using Scikit-LLM with Open-Source LLMs

machinelearningmastery.com·

Comprehensive evaluation of LLM capabilities for interpretation and analysis of genome-scale metabolic models in metabolic engineering

✍️Prompt Engineering Academic

Research Proposal: Decoupled RISC-LLM Architectures via Circadian Synaptic Consolidation

aermia.com··Hacker News

Report: GKE Inference Gateway delivers up to 92% faster AI responses

🤖AI Blog

cloud.google.com··Hacker News

2x GH200 for LLM inference, Part 2: vLLM, DeepSeek V4 Flash, and MTP

🤖AI Blog

dnhkng.github.io·

AI Paper Review: Training Language Models to Follow Instructions with Human Feedback (InstructGPT)

freecodecamp.org·

Build a Medical Report Analyzer on Dedicated Inference with Python

digitalocean.com·

Running LLM Inference on Kubernetes: What It Actually Takes

☁️Cloud Native Blog

fairwinds.com·

LLM Inference Engineering Room — Part 3: The Orchestration Layer

🤖AI Blog

vimal-dwarampudi.medium.com·

local llm on laptop 780M GPU using llama + gemma 4 qat

🤖AI Blog

alper.bearblog.dev·

lightmetal: GPU LLM Inference From a Single Java 25 JAR

🤖AI Blog

adambien.blog·

LLM-as-a-Discriminator: When Synthetic Tables Still Look Real

🤖AI Academic

How attackers are gaining access to LLM inference

🤖AI Blog

What Are Tokens in LLMs?

🤖AI Blog

bearisland.dev··Hacker News

BacteReason: A Reasoning Model for Antimicrobial Resistance Prediction

🤖AI Academic

Log in to enable infinite scrolling