AI Engineering

Feeds to Scour
SubscribedAll
Scoured 2493 posts in 11.7 ms

DiffusionGemma: The Developer Guide- Google Developers Blog

 🧠LLMs  Content type: Blog

Your AI agent reads the fine print: building a RAG pipeline over EU regulations with Elasticsearch and OGX

 🤖AI  Content type: Blog
elastic.co·

Agentic AI vs Generative AI: Why one without the other hits a ceiling

 🧠LLMs  Content type: Blog
udacity.com·

NexusOS v2.0 – A zero-dependency pipeline streaming server chaos to Parquet

 🔌APIs

LLM Inference Engineering Room — Part 3: The Orchestration Layer

 🧠LLMs  Content type: Blog

Modernizing attendance ticketing in SAS Viya using SAS Agentic AI Accelerator

 🤖AI  Content type: Blog
blogs.sas.com·

Quiz: Embeddings and Vector Databases With ChromaDB

 🤖AI
realpython.com·

What Is Generative AI?

 🧠LLMs  Content type: Academic
excelsior.edu·

Agentic Hybrid RAG for Evidence-Grounded Muon Collider Analysis

 🤖AI  Content type: Academic
arxiv.org·

Agentic workflows: What they are and how enterprise teams govern them

 🔒AppSec  Content type: Blog
tines.com·

Unlocking dependable responses with Gemini Enterprise Agent Platform’s Agentic RAG

 🤖AI  Content type: Blog
research.google·

High Bandwidth Flash | A New Memory for AI Data Centers and Edge Computing | Sandisk

 📐System Design
ncnonline.net·

End-to-end encrypted ML inference with Amazon SageMaker AI and FHE

 ☁️Cloud Infrastructure  Content type: Blog
aws.amazon.com·

Build a Medical Report Analyzer on Dedicated Inference with Python

 🧠LLMs
digitalocean.com·

magenta/magenta-realtime: Magenta RealTime 2: An Open-Weights Live Music Model

 🧠LLMs  Content type: Code
github.com·

New comment by yorktanaka2024 in "Ask HN: Who wants to be hired? (June 2026)"

 ☁️Cloud Infrastructure  Content type: Discussion

Improved performance and model support with GGUF

 🧠LLMs  Content type: Blog
ollama.com·

Stop Wasting GPU Budget: Autoscaling AI Inference on Kubernetes with KEDA

 🚀DevOps
cloudnativenow.com·

Using Scikit-LLM with Open-Source LLMs

 🧠LLMs

Fixing a stuck Ollama runner and building a GPU watchdog

 📊Observability

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help