CUDA Templates, Matrix Multiplication, Linear Algebra Primitives, Tensor Operations

Lazy loading isn't the magic pill to fix AI Inference
tensorfuse-docs.mintlify.dev·14h·
Discuss: Hacker News
🚀MLOps
Flag this post
How to Design Efficient Memory Architectures for Agentic AI Systems
pub.towardsai.net·9h
Flash Attention
Flag this post
Benchmarking the cost of Java's EnumSet - A Second Look
kinnen.de·9h·
Discuss: r/programming
⏱️Benchmarking
Flag this post
News for October 2025
ptreview.sublinear.info·1d
🔄ONNX
Flag this post
Tetris: An SLA-aware Application Placement Strategy in the Edge-Cloud Continuum
arxiv.org·23h
🌐Distributed Computing
Flag this post
We hit some annoying gaps with ResourceQuota + GPUs, so HAMi does its own quota pass
reddit.com·17h·
Discuss: r/kubernetes
📈GPU Occupancy
Flag this post
How Transformer Models Detect Anomalies in System Logs
hackernoon.com·1d
📊Gradient Accumulation
Flag this post
Finding Non-Redundant Simpson's Paradox from Multidimensional Data
arxiv.org·23h
🔄ONNX
Flag this post
When numbers lie: the Java equality bug every dev hits at least once
dev.to·2h·
Discuss: DEV
🔬Static Analysis
Flag this post
Predicting & Mitigating Data Corruption in Pure Storage Flash Arrays via Adaptive Bit Error Rate Modeling
dev.to·17h·
Discuss: DEV
⏱️Benchmarking
Flag this post
New comment by xfalcox in "The Case Against PGVector"
github.com·1d·
Discuss: Hacker News
🐕Ruff
Flag this post
Fast, Scalable LDA in C++ with Stochastic Variational Inference
github.com·1d·
Discuss: r/cpp
🏎️TensorRT
Flag this post
I built a small ARM-like virtual system with a custom RTOS and C/C++ toolchain (BEEP-8)
reddit.com·1d·
Discuss: r/embedded
🧠CPU Architecture
Flag this post
Simple rule of thumb for deciding code architecture?
reddit.com·1d·
Discuss: r/godot
🔄ONNX
Flag this post
Computer Science Fundamentals: From Binary Systems to Algorithms
dev.to·12h·
Discuss: DEV
⚙️Systems Programming
Flag this post
The case against pgvector
simonwillison.net·1d
🐕Ruff
Flag this post
FairAIED: Navigating Fairness, Bias, and Ethics in Educational AI Applications
arxiv.org·23h
🛠Ml-eng
Flag this post
Diagnosing Hallucination Risk in AI Surgical Decision-Support: A Sequential Framework for Sequential Validation
arxiv.org·23h
🔄ONNX
Flag this post
Wave-Particle (Continuous-Discrete) Dualistic Visual Tokenization for Unified Understanding and Generation
arxiv.org·23h
🧩Attention Kernels
Flag this post
The Art of the Meta: A Journey into JavaScript Proxies
dev.to·1d·
Discuss: DEV
📜TorchScript
Flag this post