AI Engineering

Feeds to Scour
SubscribedAll
Scoured 638 posts in 10.4 ms

Gemma 4 QAT models: Optimizing model compression for mobile and laptop efficiency

 📡Observability  Content type: News  Content type: Blog
blog.google··Hacker News

Prompt Caching Explained: The AI Concept That Can Save Millions of Tokens

 Developer Productivity  Content type: Blog
sweta-nit.medium.com·

Audio-first, deep-dive RAG Masterclass on YT it's called "Master RAG while you sleep"

 Developer Productivity  Content type: Video

Agentic AI frameworks compared: LangChain, LangGraph, AutoGen

 🔭observability engineering  Content type: Blog
udacity.com·

Two old GPUs I salvaged are doing more AI work than a brand new $2000 card, and I won't be upgrading anytime soon

 Developer Productivity
xda-developers.com·

How to Build an Agentic RAG with RubyLLM and Rails

 🔭observability engineering  Content type: Blog
panasiti.me··Hacker News

An AI-Powered Trisomy 21 Research Assistant

 🔭observability engineering  Content type: Academic
biorxiv.org·

The Death of the Four Golden Signals: Designing Telemetry for Non-Deterministic Infrastructure

 📡Observability
devops.com·

Ultrafast machine learning on FPGAs via Kolmogorov-Arnold Networks

 💾Semiconductors

AI Serving Platform That Adapts to Your Model

 🔭observability engineering  Content type: Blog
databricks.com·

New comment by Revanthkodati in "Ask HN: Who wants to be hired? (June 2026)"

 👀Code Review

Modernizing attendance ticketing in SAS Viya using SAS Agentic AI Accelerator

 📡Observability  Content type: Blog
blogs.sas.com·

Pruned YOLOv8 ONNX INT8 Fails: 3 Fixes That Work

 🔭observability engineering  Content type: Blog  Content type: Discussion
tildalice.io·

A Fun & Absurd Introduction to Vector Databases • Alexander Chatzizacharias

 🔭observability engineering  Content type: Video
youtu.be··r/programming

The latest Gemma 4 models use a training trick to slash their on-device memory footprint

 Developer Productivity
androidauthority.com·

Cloudian closes gap between enterprise AI ambitions and messy production deployments

 📡Observability  Content type: News
blocksandfiles.com·

Energy-Efficient On-Device RAG on a Mobile NPU: System Design and Benchmark on Snapdragon X Elite

 📡Observability  Content type: Academic
arxiv.org·

Big Blue’s Redbook on Storage Scale KV Cache management

 📡Observability  Content type: News
blocksandfiles.com·

New comment by monishes in "Ask HN: Who wants to be hired? (June 2026)"

 👀Code Review  Content type: Discussion

HNSW vs LSH: How Elasticsearch hits 0.99 recall@10 at 15,000 QPS — and what it costs

 📡Observability  Content type: Blog
elastic.co·

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help