📊 Columnar Engines - widget101 · Scour

Guney-olu/nanoslg: A from-scratch implementation of distributed LLM inference in simple readable Python

github.com·4d·

Discuss: Hacker News, r/LLM

📈Performance Profiling

Breaking the Tractability Barrier: A Generic Low-Level Solver for NP-Hard Instances (N=63) on Commodity 64-Bit Silicon

zenodo.org·20h·

Discuss: r/programming

⚙️Query Compilers

OpenAI’s new Codex Spark model is built for speed

thenewstack.io·1d

Concurrent vs. Parallel Execution in LLM API Calls: From an AI Engineer’s Perspective

pub.towardsai.net·4d

🔄Concurrency

Vector Databases Explained: Architecture and System Design for AI Apps

dev.to·4d·

Discuss: DEV

🧭Vector Databases

BalatroBench Benchmarks Large Language Models Playing Balatro

balatrobench.com·16h·

Discuss: Hacker News

InfraBuilder: The Deterministic Hardware Architect

dev.to·5d·

Discuss: DEV

🏛️Lakehouse Architecture

Heterogeneous Processing: A Strategy for Augmenting Moore's Law (2006)

linuxjournal.com·5d·

Discuss: Hacker News

⚡SIMD Optimization

AFMTJ Model For In-Memory Computing (University of Arizona)

semiengineering.com·3d

💾Cache Optimization

Building a Production-Ready Claude Streaming API with Next.js Edge Runtime

bydaewon.gumroad.com·4d·

Discuss: DEV

🌊Apache Flink

How Andrej Karpathy Built a Working Transformer in 243 Lines of Code

analyticsvidhya.com·1d

⚙️Query Compilers

OFP’s data server killers aiming for AI system scalability and efficiency nirvana

blocksandfiles.com·4d

🏛️Lakehouse Architecture

(Early Stage) Heterodox Analytical Processing Engine Utilizing Tinygrad

github.com·5d·

Discuss: Hacker News, Hacker News

AI in Multiple GPUs: Point-to-Point and Collective Operations

towardsdatascience.com·14h

🔄Concurrency

OpenAI dishes out its first model on a plate of Cerebras silicon

theregister.com·1d

🏗data engineering

Definitive Guide to Multi-Threaded Rendering on the Web

hackernoon.com·6d

Supercharging Inference for AI Factories: KV Cache Offload as a Memory-Hierarchy Problem

blog.min.io·1d

🏗️Hardware Architecture

Show HN: GPU ROI simulator based on token usage and model architecture

axiomos.ai·3d·

Discuss: Hacker News

📈Performance Profiling

Optimizing the MongoDB Java Driver: How minor optimizations led to macro gains

linkedin.com·2d·

Discuss: DEV

💾Cache Optimization

Memgraph 3.8 is Out: Atomic GraphRAG + Vector Single Store With Major Performance Upgrades

memgraph.com·1d·

Discuss: Hacker News

📈Performance Profiling

Loading more...