🔲 ML Hardware - nickyfoto · Scour

Characterization of machine learning compilers for LLM inference on NVIDIA GPUs 🧠LLMs

link.springer.com·4d·Hacker News

The Model Parking Tax: Quantifying the Hidden Energy Cost of Always-On GPU Model Deployment 🤖AI Research

CUDA 13.3 Lands, AI Writes Blackwell Kernels, & FP4 VRAM Optimization for LLMs ⚡Performance Engineering

dev.to·12h·DEV

Show HN: cuSBF – Faster GPU Bloom Filter for Sequence Data ⚡Performance Engineering

github.com·16h·Hacker News

Build high-performance generative AI systems with Strands Agents, NVIDIA NIM, and Amazon Bedrock AgentCore 🕵️AI Agents

aws.amazon.com·1d

I Made Local AI Faster Than the Cloud — A Complete Home Automation Voice Control Journey ⚡Performance Engineering

linkedin.com·50m·DEV

Running Flux Schnell (12B) + LLMs on a Legacy AMD RX 580 (8GB) via Native Vulkan — Full Architecture Guide [2026] ⚡Performance Engineering

setup-ia-local-rx580-vulkan.firebaseapp.com·5d·DEV

Dense vs MoE Models Explained 🧠LLMs

engineersmeetai.substack.com·22h·Substack

AI Datacenters Were Built for GPUs. What Happens When You Remove the GPUs? 🏗️System Design

almartis.xyz·2d·Hacker News

Argonne flexes spare supercompute to build private AI inference service 🤖AI Research

theregister.com·13h·Hacker News

AI Infrastructure Preflight at User space: Validating Multi Node, Multi GPU Slurm Clusters ⚡Performance Engineering

techcommunity.microsoft.com·5d

Not All On-Device AI Is The Same: How Chip Compute Tiers Decide What Your Product Can Actually Do 🔌Embedded Systems

easelinktech.com·2d·Hacker News

The future of AI is an AI futures market 🤖AI Research

·1d·Hacker News

Nvidia bets $150B on Taiwan as Trump's plan to make US an AI hub backfires ⚖️Tech Policy

arstechnica.com·13h

Getting Started with Slinky on DigitalOcean Kubernetes ☁️Cloud Computing

digitalocean.com·6d

Presentation: Designing AI Platforms for Reliability: Tools for Certainty, Agents for Discovery 🤖AI Research

·1d

NVIDIA Removes Gaming Revenue Category From Financial Reports ⚖️Tech Policy

guru3d.com·6d·Hacker News, r/LocalLLaMA

The Download: keeping up with AI, and the future of IVF 🤖AI Research

technologyreview.com·16h·Hacker News

The Open/Closed Problem in AI 🤖AI Research

blog.mempko.com·5d·Lobsters, Hacker News

openbmb/MiniCPM5-1B 🧠LLMs

huggingface.co·2d·r/LocalLLaMA

Log in to enable infinite scrolling