LLMs

Feeds to Scour
SubscribedAll
Scoured 3294 posts in 10.0 ms

harshuljain13/llm-inference-at-scale: A Practitioner handbook for production llm serving.

 💡AI  Content type: Code
github.com··Hacker News

Introducing the Third Generation of Apple’s Foundation Models

 💡AI

The Shibboleth Effect: Auditing the Cross-Lingual Distributional Skew of Large Language Models

 💡AI  Content type: Academic
arxiv.org·

Nvidia Ships the Foundation Model Physical AI Has Been Waiting For

 🛠️AI Tooling
pymnts.com·

LangChain Explained: Understanding Models, Prompts, Chains, Memory, Indexes, and Agents

 ✍️Prompt Engineering  Content type: Blog
towardsai.net·

How LLMs work | Practical Leaders

 🛠️AI Tooling

LLM Routing: From Strategy Selection to Production Architecture

 ⚙️Workflow Automation  Content type: Blog
blog.n8n.io·

Inferoa AI harness claimed 90% cache savings. We ran it and measured 97.8%

 🛠️AI Tooling

Why LLMs (still) lack taste

 💡AI

NVIDIA Accelerates Google DeepMind’s DiffusionGemma for Local AI

 💡AI  Content type: Blog
blogs.nvidia.com·

Apple's Foundation Models can now use third-party LLMs (Claude, Gemini) [video]

 🔶Claude

Using Scikit-LLM with Open-Source LLMs

 🔓Open Source LLMs

LLMs Are Brilliant. But They Can Be Fooled.

 🔓Open Source LLMs  Content type: Blog
medium.com
·

RAG Pipeline Explained: From Query to Answer, Step by Step

 📚RAG  Content type: Blog
medium.com
·

LLM Inference Engineering Room — Part 3: The Orchestration Layer

 🛠️AI Tooling  Content type: Blog

Modernizing attendance ticketing in SAS Viya using SAS Agentic AI Accelerator

 🤖AI Agents  Content type: Blog
blogs.sas.com·

GGUF vs GPTQ vs AWQ: The Plain-English Guide to LLM Quantization (and Which One to Pick)

 🔓Open Source LLMs

WWDC 2026: Foundation Models (& Anarlog)

 🔓Open Source LLMs
skushagra.com·

Comprehensive evaluation of LLM capabilities for interpretation and analysis of genome-scale metabolic models in metabolic engineering

 🐋DeepSeek  Content type: Academic
biorxiv.org·

Gemma 4 QAT models: Optimizing model compression for mobile and laptop efficiency

 🔓Open Source LLMs  Content type: News  Content type: Blog
blog.google··Hacker News

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help