Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
⚡ Cache-Aware Algorithms
Memory Hierarchy, Data Locality, Performance Optimization, NUMA
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
80061
posts in
1.21
s
ForesightKV
: Optimizing KV Cache
Eviction
for Reasoning Models by Learning Long-Term Contribution
arxiv.org
·
6d
🗺️
Region Inference
Same Engine, Multiple Gears: Parallelizing
Fixpoint
Iteration at Different
Granularities
(Extended Version)
arxiv.org
·
1d
🚀
Code Motion
How to Design LLM
Applications
for Production: A System Design Guide
dev.to
·
1d
·
Discuss:
DEV
🎨
Domain-Specific Languages
Why AI
Assistants
Forget Everything (And How I Fixed It with
SuperLocalMemory
)
dev.to
·
2d
·
Discuss:
DEV
🗺️
Region Inference
Why Real-Time Execution Is Now Expected in
Lakehouse
Architectures
singlestore.com
·
3d
📋
Task Queues
Speeding
Up
HTML
Generation by 2000%
bobrubbens.nl
·
4d
⚡
Incremental Parsing
Writing an LLM from scratch, part
32d
--
Interventions
: adding attention bias
gilesthomas.com
·
3d
·
Discuss:
Hacker News
✨
Effect Inference
Why
oh
why is my application
waiting
?
colinpaice.blog
·
4d
🔮
Branch Predictors
Fast
Autoscheduling
for Sparse ML
Frameworks
ajroot.pl
·
5d
·
Discuss:
Hacker News
,
r/Compilers
🚀
MLton
Making
Pyrefly
Diagnostics
18x Faster
pyrefly.org
·
4d
·
Discuss:
Hacker News
💬
Interactive REPLs
A general optimization framework for
mapping
local
transition-state
networks
nature.com
·
3d
🗺️
Region Polymorphism
DC
Byte
analysis warns the era of ‘cheap and
abundant
RAM’ is over
kitguru.net
·
5d
🧠
Memory Ordering
The Risk of Not
Optimizing
Clock
Power
semiwiki.com
·
3d
🔮
Speculative Execution
Matching
the right LLM for your GPU feels like an art, but I finally
cracked
it
xda-developers.com
·
2d
📊
Profiling Tools
**Abstract:** This paper proposes a novel framework,
HyperScore-Enabled
Autonomous Anomaly Mitigation (
HEAM
), for real-time anomaly detection and mitigation ...
freederia.com
·
3d
📋
JSON Parsing
When Language Models Get Stuck: The Mechanics of
Repetition
Loops
pub.towardsai.net
·
3d
🦌
ANTLR
llOOPy
lOOPs (Dave
Jarvis
)
dave.autonoma.ca
·
3d
🤐
Zipper Structures
Private Data Space Model
privatedata.space
·
4d
🪢
Rope Data Structures
Import AI 444: LLM
societies
; Huawei makes kernels with AI;
ChipBench
importai.substack.com
·
15h
·
Discuss:
Substack
🪜
Recursive Descent
Engineering
Ethereum
's Speed: How we made
Ethrex
20x faster
blog.lambdaclass.com
·
5d
⚡
Performance
Loading...
Loading more...
« Page 13
•
Page 15 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help