Data Locality

Feeds to Scour
SubscribedAll
Scoured 20 posts in 30.8 ms

RADV Driver Now Leveraging RDNA3+ Hardware Feature For Better Instruction Cache Prefetching

 🔓Open Source Software
phoronix.com·
Less-relevant results

Beyond the Memory Wall: The CPU Was Helping You All Along

 💨Cache-Friendly Algorithms  Content type: Blog
prawns.dev··Hacker News

Making Locality-aware GEMM Compatible with Page-Granularity Placement on Chiplet GPUs

 🤖AI  Content type: Academic
arxiv.org·

The Return of Rigorous Full-System Timing Simulation

 🧠Memory Hierarchy Design
sigarch.org··Hacker News

coherentforge/CambiOS: Zero-trust, capability-based Rust microkernel targeting formal verification. Tri-arch (x86_64 / AArch64 / RISC-V). Sovereign and generative: no telemetry, user owns keys and data. Early-stage — see STATUS.md. Inspired by seL4, Hubris, and Redox.

 🧠Memory Management  Content type: Code
github.com··Hacker News

Two Leaps to 1000 Tokens/s on a 1T-Parameter Model: On Inference Systems, Execution Boundaries, and Co-Design

 Fast AI Inference  Content type: Blog

Prompt Caching on Claude: Cut Input Costs 78% (The Math Nobody Writes Down)

 💻Claude Code
pub.towardsai.net
·

A Database You Can See

 ⚙️Mechanical Sympathy  Content type: Blog
nockawa.github.io·

Why are cached input tokens cheaper with AI services?

 🇨🇳Chinese AI
xeiaso.net·

Neural Field Tokenizations with Hierarchy and Spatial Locality Priors

 🔍Search Indexing  Content type: Academic
arxiv.org·

pylit-ai/opendream: Local-first memory and dreaming automation for agents.

 💻Claude Code  Content type: Code
github.com··Hacker News

Why Your CPU Is Fast but Your Program Is Slow: Understanding the Memory Wall

 🧠Memory Hierarchy Design  Content type: Blog
prawns.dev··Hacker News

DuckDB Storage Engine for MariaDB. When the Sea Lion Learns to Quack.

 🦆DuckDB
mariadb.org··Hacker News

Spatially Masked Regression Reveals Local and Distributed Predictability in Electrophysiological Recordings

 ☢️Nuclear Energy  Content type: Academic
arxiv.org·

A Case for Simulation-Driven Resilience in Agentic Data Systems

 💳AI Commerce  Content type: Blog

HFT Latency Monitoring with Probabilistic Calling Context

 🔬Rust Profiling

Interlude: Spectrum vs. C64 BASIC Showdown

 Zero-Copy Serialization  Content type: Blog

ASTRA-sim 3.0: Next-Level Distributed Machine Learning Simulations via High-Fidelity GPU and Infrastructure Modeling

 🖥GPUs  Content type: Academic
arxiv.org·

bigattichouse/packed-twin-inference: PTI achieves ~2× throughput using a single quantized model (Q5_K_M or better) by running 4 generation streams in one batched decode call. The GPU loads model weights once per step and produces 4 predictions simultaneously. KV cache overhead is ~0.8 GiB total for all 4 streams. No draft model. No quality loss

 🤖AI  Content type: Code
github.com··r/LocalLLaMA

jianzhichun/permafrost: Freeze Claude Code's prompt prefix so DeepSeek's automatic cache always hits — alignment proxy + coalescing + keepalive, installable as a CC plugin. Measured 64% cheaper on real Claude Code traffic.

 🔌Claude Plugins  Content type: Code
github.com··Hacker News

No more posts from emschwartz's subscribed feeds.

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help