🤖 AI Engineering - celurian92 · Scour

DeepSeekV4 1.6T Day 0 to Day 43 Performance Over Time - Huawei, GB300 NVL72, MI355X, B200

🧠Machine Learning News

newsletter.semianalysis.com

··Hacker News

If Claude Fable stops helping you, you'll never know

🧠Machine Learning Blog

jonready.com··Lobsters, Hacker News

ICYMI: Inside the Microsoft Agent Framework: How we designed a layered SDK

🔍RAG Blog

devblogs.microsoft.com·

I built a free extension that adds shared folders + search across ChatGPT, Claude and Gemini

foldery.app··r/chrome_extensions

Running LLM Inference on Kubernetes: What It Actually Takes

📊Observability Blog

fairwinds.com·

What I learned building an AI chatbot for websites and docs

✍️Prompt Engineering

chattybox.ai··DEV, r/SideProject

The hidden bottleneck in LLM inference and the impact on MLPerf benchmarking

TA-RAG: Tone-Aware Retrieval-Augmented Generation for Peer-Support Health Communication

🔍RAG Academic

Best practices for building a modern app with vector search

🔍RAG Blog

Hybrid Search for RAG: Fix Retrieval Accuracy in AI

🔍RAG Blog

PagedAttention vs Traditional KV Cache: How vLLM Reinvented GPU Memory for LLM Inference

🧠LLMs Blog

·

The Death of the Four Golden Signals: Designing Telemetry for Non-Deterministic Infrastructure

🏗️Backend Architecture

The AI Curse (Vis the Lisp Curse)

🔍RAG Blog

blog.djhaskin.com··Hacker News

How to Defend Against Prompt Injection in Production

✍️Prompt Engineering Reference

leanpub.com··DEV

New comment by bedelloperator in "Ask HN: Who wants to be hired? (June 2026)"

🔍RAG Discussion

news.ycombinator.com··Hacker News

RAGAS Belongs at Design Time

🔍RAG Blog

rephrase-it.com·

Show HN: Incremental RAG ingestion, only changed chunks get re-embedded

🔍RAG Code

github.com··Hacker News

Introducing GitLab Orbit

💻Software Engineering Blog

about.gitlab.com··Hacker News

Full Observability for Pinecone: Introducing an Open-Source Monitoring Stack for SaaS and BYOC

🔍RAG Blog

Powering the Inference Era: Inside the DigitalOcean Data & Learning Layer

🔍RAG Blog

digitalocean.com·

Sign up or log in to see more results

Log in to enable infinite scrolling