LLMs

Feeds to Scour
SubscribedAll
Scoured 472 posts in 9.2 ms

lightmetal: GPU LLM Inference From a Single Java 25 JAR

 🤖AI Agents  Content type: Blog
adambien.blog·

CommBench: Can LLMs Write Correct and Efficient GPU Communication Code?

 🔭Competitive Intel

Claude Fable 5 is Mythos for the masses

 📡Information Diet  Content type: Blog
techzine.eu·

Why Claude Produces High-Quality Output: A Developer’s Guide to Token Efficiency and Hallucination…

 ✍️Prompt Engineering  Content type: Blog
medium.com·

Making a Vintage LLM from Scratch

 🤖AI Agents
crlf.link··Hacker News

Report: GKE Inference Gateway delivers up to 92% faster AI responses

 ⚙️AI Workflows  Content type: Blog

Deep Learning Weekly: Issue 458

 🤖AI Agents

Foundation Models: Apple Isn’t Building an AI Model. It’s Building an AI Platform.

 ⚙️AI Workflows  Content type: Blog
medium.com·

A Plea to the Labs: Let the Models Diagnose.

 🤖AI Agents  Content type: Blog

New comment by alroma90 in "Ask HN: Who wants to be hired? (June 2026)"

 🤖AI Agents  Content type: Discussion

Transitioning from Azure Language Features to Foundry Models

 ⚙️AI Workflows

Inferoa AI harness claimed 90% cache savings. We ran it and measured 97.8%

 ✍️Prompt Engineering

You don't need Copilot for code completion, try this instead

 🤖AI Agents

Should LLM Agents Decide in Social Simulations? Comparing Finite-State and LLM-Based Decision Policies

 🤖AI Agents  Content type: Academic
arxiv.org·

147th airhacks tv: Local LLMs, LightMetal, ZSmith Agents, AI Rails, Saving Tokens

 🤖AI Agents  Content type: Blog
adambien.blog·

harshuljain13/llm-inference-at-scale: A Practitioner handbook for production llm serving.

 ⚙️AI Workflows  Content type: Code
github.com··Hacker News, r/LLM

AI 101: From Prompt Engineering to Skill Engineering

 ✍️Prompt Engineering
turingpost.com·

MTP Isn't Always a Win: 1.95x on My 3090, but Speculative Decoding Is Hardware-Dependent

 🔭Competitive Intel  Content type: Blog
bric.pe.kr··DEV

Researchers say they trained a foundation model from scratch for about $1,500

 💼AI Business
venturebeat.com··Hacker News

What Is Generative AI?

 ✍️Prompt Engineering  Content type: Academic
excelsior.edu·

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help