Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🧠 LLM Inference
Quantization, Attention Mechanisms, Batch Processing, KV Caching
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
26847
posts in
5.34
s
How we cut
Vertex
AI latency by 35% with
GKE
Inference Gateway
cloud.google.com
·
4d
🧠
Inference Serving
Bulk
RRAM
Could Be AI’s Memory Wall Solution
spectrum.ieee.org
·
1d
·
Discuss:
r/hardware
🧠
Memory Hierarchy Design
The LLM Judge
Controversy
mlfrontiers.substack.com
·
2d
·
Discuss:
Substack
🏆
LLM Benchmarking
2026W05
jordivillar.com
·
2d
💾
Persistence Strategies
Using
Chisanbop
with Memory
Palaces
forum.artofmemory.com
·
2d
🏊
Memory Pools
bartowski/moonshotai
_Kimi-Linear-48B-A3B-Instruct-GGUF
huggingface.co
·
1d
·
Discuss:
r/LocalLLaMA
🚀
Astral
Tutorial
on
Agentic
Engine
pori.vanangamudi.org
·
2d
·
Discuss:
r/LocalLLaMA
🛡️
Open Policy Agent
How can
computing
for AI and other
demands
be more energy efficient?
techxplore.com
·
3d
🖥
GPUs
From
Chunks
to
Connections
: The Case for Graph RAG
pub.towardsai.net
·
2d
🔄
LLM RAG Pipelines
The
nature
of LLM
algorithmic
progress
lesswrong.com
·
5d
🏆
LLM Benchmarking
LLMs are Getting a Lot Better and Faster at
Finding
and
Exploiting
Zero-Days
schneier.com
·
1d
🕳
LLM Vulnerabilities
State of AI:
Bi-Annual
Snapshot
iconiqcapital.com
·
1d
🆕
New AI
Reinforcement
Inference
: Leveraging Uncertainty for
Self-Correcting
Language Model Reasoning
arxiv.org
·
1d
🏗️
LLM Infrastructure
Adaptive
Retrieval
helps Reasoning in LLMs -- but
mostly
if it's not used
arxiv.org
·
1d
🔄
LLM RAG Pipelines
Nexus AI – A Chrome extension that
understands
and
summarizes
the page
nexusbrowserai.com
·
1d
·
Discuss:
Hacker News
🧠
Obsidian
World Models and the Data Problem in
Robotics
joeljang.github.io
·
1d
·
Discuss:
Hacker News
✨
Gemini
Oatmeal
-
Constraint
propagation for fun
eli.li
·
3d
·
Discuss:
Lobsters
,
Hacker News
🧮
SMT Solvers
AI
Workflows
with
human-in-the-loop
weavemind.ai
·
3d
·
Discuss:
Hacker News
👨💻
AI Coding
[
RFC
PATCH v1 0/4] Machine Learning (
ML
) library in Linux kernel
lore.kernel.org
·
4d
·
Discuss:
Lobsters
,
Hacker News
🦙
Ollama
Mathematical Resolution of P vs NP through
Informational
Noise
Subtraction
and Linear O(n) Mapping
zenodo.org
·
3d
·
Discuss:
Hacker News
🧮
SMT Solvers
Loading...
Loading more...
« Page 11
•
Page 13 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help