Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🧠 LLM Inference
Quantization, Attention Mechanisms, Batch Processing, KV Caching
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
26924
posts in
690.6
ms
BREAKING🚨:
Stanford
University just
launched
a FREE AI tool for researchers!
threadreaderapp.com
·
3d
🔍
FAISS
Performance
Tip
of the Week #79: Make at most one
tradeoff
at a time
abseil.io
·
4d
⚙️
Mechanical Sympathy
Mastering
Unstructured
data: The
Blueprint
For Efficient Solution
pub.towardsai.net
·
3d
🔤
Tokenization
NVIDIA
VibeTensor
: AI Just Built Its Own Deep Learning Engine… And It Actually Works (AI
Revolution
youtube.com
·
4d
🖥
GPUs
Hardware
Acceleration
jellyfin.org
·
4d
⚡
Hardware Acceleration
Planning Work for Our
Single-Threaded
Brains
linkedin.com
·
4d
🎯
Deep Work
userface.ai
userface.ai
·
4d
🆕
New AI
How
StrongDM
’s AI team build
serious
software without even looking at the code
simonw.substack.com
·
4d
·
Discuss:
Substack
🏗️
LLM Infrastructure
6 AI Agents, One Company
voxyz.space
·
3d
🆕
New AI
Show HN:
LocalGPT
– A local-first AI assistant in Rust with
persistent
memory
news.ycombinator.com
·
4d
·
Discuss:
Hacker News
🔎
Tantivy
LOTFormer
:
Doubly-Stochastic
Linear Attention via Low-Rank Optimal Transport
arxiv.org
·
2d
🕸️
Sparse Vectors
Decomposing
Reasoning
Efficiency
in Large Language Models
arxiv.org
·
1d
🧮
SMT Solvers
Making a Hardware Accelerated Live TV Player from Scratch in C: HLS Streaming,
MPEG-TS
Demuxing
, H.264 Parsing, and Vulkan Video Decoding
blog.jaysmito.dev
·
3d
·
Discuss:
Hacker News
,
r/programming
📄
File Formats
Hallucinations
in
GPT5
– Can models say "I don't know" (June 2025)
jobswithgpt.com
·
4d
·
Discuss:
Hacker News
🚀
Astral
Beyond
agentic
coding
haskellforall.com
·
4d
·
Discuss:
Lobsters
,
Hacker News
,
Hacker News
,
r/programming
👨💻
AI Coding
For real
game-theoretic
reasoning, we need best response in
imperfect
information games
weyxie.bearblog.dev
·
3d
·
Discuss:
Hacker News
🛡️
AI Security
We
recreated
the Anthropic C
compiler
agent
vizops.ai
·
3d
·
Discuss:
Hacker News
⚙️
Language Runtimes
Heterogeneous
Processing: A Strategy for
Augmenting
Moore's Law (2006)
linuxjournal.com
·
4d
·
Discuss:
Hacker News
🖥️
Hardware Architecture
Generative
Modeling
via
Drifting
lambertae.github.io
·
6d
·
Discuss:
Hacker News
📦
Batch Embeddings
EBM
vs. LLMs: Our
Kona
EBM
a 96% vs. 2% Sudoku Benchmark
logicalintelligence.com
·
6d
·
Discuss:
Hacker News
🏆
LLM Benchmarking
Loading...
Loading more...
« Page 23
•
Page 25 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help