My First Multi-GPU Kernel: Writing All-to-All for AMD MI300X
gau-nernst.github.io·1d·
Discuss: Hacker News
Model Efficiency
Flag this post
Why Multimodal AI Broke the Data Pipeline — And How Daft Is Beating Ray and Spark to Fix It
hackernoon.com·19h
Model Efficiency
Flag this post
GPU Pro – Master Your AI Workflow
github.com·1d·
Model Efficiency
Flag this post
Learning Sparse Approximate Inverse Preconditioners for Conjugate Gradient Solvers on GPUs
arxiv.org·20h
Model Efficiency
Flag this post
ZkML Breakthrough: 13B Models Verified in 15 Minutes
lightcapai.medium.com·1d·
Discuss: Hacker News
LLM Optimization
Flag this post
Automated Anomaly Detection and Self-Calibration in CMUT Array Fabrication via Bayesian Optimization
dev.to·14h·
Discuss: DEV
Model Efficiency
Flag this post
Nvidia GPU Boost: My Stock RTX 5080 Is Consistently Beating Advertised
news.ycombinator.com·2d·
Discuss: Hacker News
Model Efficiency
Flag this post
AMD Will Continue Game Optimization Support For Older Radeon GPU's After All
tech.slashdot.org·1h
Model Efficiency
Flag this post
A Thesis and Playbook for Edge AI
ondeviceguy.substack.com·14h·
Discuss: Substack
Model Efficiency
Flag this post
Torchforge – a PyTorch native library for scalable RL post-training
pytorch.org·4d·
Discuss: Hacker News
Model Efficiency
Flag this post
Dive into Systems
diveintosystems.org·8h·
Discuss: Hacker News
✍️Prompt Engineering
Flag this post
Gated DeltaNet (Linear Attention variant in Qwen3-Next and Kimi Linear)
sebastianraschka.com·22h·
Discuss: r/LLM
LLM Optimization
Flag this post
AMD Radeon AI PRO R9700 Offers Competitive Workstation Graphics Performance/Value
phoronix.com·9h·
Discuss: Hacker News
Model Efficiency
Flag this post
The next RISC-V processor frontier: AI
edn.com·3d·
Discuss: Hacker News
Model Efficiency
Flag this post
Show HN: a Rust ray tracer that runs on any GPU – even in the browser
github.com·11h·
Discuss: Hacker News
Model Efficiency
Flag this post
Can-t stop till you get enough
cant.bearblog.dev·1d·
Discuss: Hacker News
🤖AI
Flag this post
Scalable In-Memory Associative Processing for Graph Neural Network Inference
dev.to·1d·
Discuss: DEV
Model Efficiency
Flag this post
Cocoon from Telegram: A Decentralized AI Network That Pays GPU Owners in Crypto
decrypt.co·1d·
Discuss: Hacker News
Model Efficiency
Flag this post
How to Use Multimodal AI Models With Docker Model Runner
docker.com·11h
🤖AI
Flag this post
Digital Twins: the missing pieces we can solve with Machine Learning
quantblog.wordpress.com·20h·
Discuss: Hacker News
🔍AI Interpretability
Flag this post