LLMs

Feeds to Scour
SubscribedAll
Scoured 525 posts in 8.3 ms

Embeddings in Generative AI: The Hidden Technology That Makes AI Actually Useful

 🔍Information Retrieval  Content type: Blog
medium.com
·

How to Build an Agentic RAG with RubyLLM and Rails

 🔍Information Retrieval  Content type: Blog
panasiti.me··Hacker News

Unlocking dependable responses with Gemini Enterprise Agent Platform’s Agentic RAG

 🪟Context Windows  Content type: Blog
research.google·

Why Your LLM Gets Dumber With More Context

 🪟Context Windows
siliconopera.com·

massimo92/spark: CLI tool for serving LLMs with vLLM on NVIDIA DGX Spark. One file, zero friction.

 🧠LLM Inference  Content type: Code
github.com··Hacker News

Inferoa AI harness claimed 90% cache savings. We ran it and measured 97.8%

 🧠LLM Inference

Energy-Efficient On-Device RAG on a Mobile NPU: System Design and Benchmark on Snapdragon X Elite

 🪟Context Windows  Content type: Academic
arxiv.org·

An autopsy of Claude Code's deep research

 🤖AI Agents
nibzard.com·

Location: Arlington Heights, IL, USA (Chicago Area) Remote: Yes Willing to reloc...

 🐍Python  Content type: Discussion

CommBench: Can LLMs Write Correct and Efficient GPU Communication Code?

 CUDA

Introducing the Third Generation of Apple’s Foundation Models

 🤖Machine Learning

Why Most RAG Systems Slow Down After the First 3 Months

 🪟Context Windows  Content type: Blog
blog.stackademic.com
·

Built and launched a research-reading and highlighting tool with Claude over a few months. Here are the things AI was surprisingly good (and bad) at.

 🤖LLM
highlyt.app··r/ClaudeAI

Google's new open-weights model brings image-generation tricks to AI text generation

 🤖Data science  Content type: News
theregister.com·

New comment by Ayaz_Saifi in "Ask HN: Who wants to be hired? (June 2026)"

 🤖Machine Learning

#068 - Apple runs Siri on Google's Gemini, OpenAI files a secret IPO at $852B, Xiaomi clocks 1,000 tps

 🧠LLM Inference
indiehacker.news·

An AI-Powered Trisomy 21 Research Assistant

 🪟Context Windows  Content type: Academic
biorxiv.org·

You Probably Don’t Need a Vector Database - If Your Data Already Lives in BigQuery

 🪟Context Windows  Content type: Blog
medium.com
·

Your AI agent reads the fine print: building a RAG pipeline over EU regulations with Elasticsearch and OGX

 🔍Information Retrieval  Content type: Blog
elastic.co·

Takeway from AWS Generative AI Lens

 🤖AI Agents  Content type: Reference
docs.aws.amazon.com··DEV

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help