We found embedding indexing bottleneck in the least expected place: JSON parsing
nixiesearch.substack.comยท3dยท
Discuss: Substack
๐Ÿ“‹Tokei
Flag this post
The True Cost of AI Integrations: Comparing Performance and Pricing Models for C# Libraries
dev.toยท3dยท
Discuss: DEV
๐Ÿ‘๏ธObservability
Flag this post
Gated DeltaNet (Linear Attention variant in Qwen3-Next and Kimi Linear)
sebastianraschka.comยท4dยท
๐Ÿค–AI
Flag this post
My First Multi-GPU Kernel: Writing All-to-All for AMD MI300X
gau-nernst.github.ioยท4dยท
Discuss: Hacker News
๐Ÿ“ˆPerformance Profiling
Flag this post
Kimi Linear: An Expressive, Efficient Attention Architecture
arxiviq.substack.comยท5dยท
Discuss: Substack
๐Ÿค–AI
Flag this post
Why stop at 1 million tokens when you can have 10? My journey to extreme context on a gaming GPU. [P]
reddit.comยท2dยท
๐Ÿ—data engineering
Flag this post
The next RISC-V processor frontier: AI
edn.comยท6dยท
Discuss: Hacker News
๐Ÿ—๏ธHardware Architecture
Flag this post
Building Real-Time ML Feature Pipelines with Streaming SQL
timeplus.comยท1dยท
Discuss: Hacker News
โฑ๏ธReal-time Analytics
Flag this post
Read more
yugabyte.comยท3dยท
Discuss: Hacker News
๐ŸงญVector Databases
Flag this post
Low-Level Hacks
blog.raycursive.comยท3dยท
Discuss: Hacker News
๐Ÿฆ€Rust Scientific
Flag this post
Geonum โ€“ geometric number library for unlimited dimensions with O(1) complexity
github.comยท3dยท
Discuss: Hacker News
๐Ÿ”ขNumPy
Flag this post
ZkML Breakthrough: 13B Models Verified in 15 Minutes
lightcapai.medium.comยท4dยท
Discuss: Hacker News
๐ŸงŠIceberg Tables
Flag this post
Perplexity shows how to run monster AI models more efficiently on aging GPUs, AWS networks
theregister.comยท1d
โšกDataFusion
Flag this post
Why is AI Generated Rust slow when compared with Go/C#/Node/JavaScript
srid68.github.ioยท2dยท
Discuss: Hacker News
๐Ÿ“‹Tokei
Flag this post
Scalable In-Memory Associative Processing for Graph Neural Network Inference
dev.toยท4dยท
Discuss: DEV
๐Ÿ—๏ธHardware Architecture
Flag this post
Detailed Technical Documentation on AI Implementation Logic (Taking Large Language Models as an Example )
nbtab.comยท3dยท
Discuss: DEV
โš™๏ธQuery Compilers
Flag this post
Planning > Agents: Getting Reliable Code from LLMs
repoprompt.comยท2dยท
Discuss: Hacker News
๐Ÿ”AI Detection
Flag this post
Balancing Cost, Power, and AI Performance
oreilly.comยท2d
๐Ÿ“ŠApproximate Computing
Flag this post
Generation at the Speed of Thought: Speculative Decoding
bittere.substack.comยท4dยท
Discuss: Substack
๐Ÿ“ŠApproximate Computing
Flag this post