Cache Optimization

Feeds to Scour
SubscribedAll
Scoured 36 posts in 15.0 ms

Apple Chip Architecture from 1977 to 2026

 SIMD  Content type: News  Content type: Blog

HFT Latency Monitoring with Probabilistic Calling Context

 SIMD

bigattichouse/packed-twin-inference: PTI achieves ~2× throughput using a single quantized model (Q5_K_M or better) by running 4 generation streams in one batched decode call. The GPU loads model weights once per step and produces 4 predictions simultaneously. KV cache overhead is ~0.8 GiB total for all 4 streams. No draft model. No quality loss

 🖥️HPC  Content type: Code
github.com··r/LocalLLaMA

[Dev Weekly #114] Google’s Gemma 4 Changes the Game | Ruby Performance Secrets Exposed | Trust Over Velocity - The Miners

 🎨Generative Art  Content type: Blog
blog.codeminer42.com·

HP has slashed an astonishing $2,600 off this RTX 5080 gaming PC, nearly 50% off — get an epic Omen 35L rig with a 9900X3D, 64GB DDR5, and 4TB of SSD storage for just $2,899.99

 🖥️HPC
tomshardware.com
·

RATrain: A Resource-Aware Training Runtime for Large Language Models on Bandwidth-Constrained Heterogeneous Supercomputing Platforms

 🖥️HPC  Content type: Academic
arxiv.org·

The Hidden Cost of Records: When Java’s Immutable Data Classes Quietly Hurt Your GC

 🌳B-Trees
javacodegeeks.com·

Release Notes J9.7 - J Wiki

 SIMD

Issue #390 - The ML Engineer 🤖

 🧬k-mer Analysis  Content type: News  Content type: Blog

AGENTSERVESIM: A Hardware-aware Simulator for Multi-Turn LLM Agent Serving

 🖥️HPC  Content type: Academic
arxiv.org·

Tuning SCHED_BATCH for Non-Interactive, CPU-Bound Workloads

 Concurrency  Content type: News  Content type: Blog

The copy_if Speedup That Wasn't About copy_if, Or AVX-512

 SIMD

Passing DBs Through Continuations

 🌳B-Trees  Content type: Blog

jianzhichun/permafrost: Freeze Claude Code's prompt prefix so DeepSeek's automatic cache always hits — alignment proxy + coalescing + keepalive, installable as a CC plugin. Measured 64% cheaper on real Claude Code traffic.

 🌸Bloom Filters  Content type: Code
github.com··Hacker News

I built an ECS framework using C++26 static reflection features.

 Concurrency  Content type: Code
github.com··r/cpp

franz1981/Netty-VirtualThread-Scheduler: A novel integration between Netty and Virtual Threads

 Concurrency  Content type: Code
github.com··Hacker News

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help