🏎️ TensorRT - miterion · Scour

Cerebras: The $56.4 Billion IPO Challenging NVIDIA’s Memory Wall ⚡Flash Attention

artificialintelligencemadesimple.com·2d

Unleashing the Power of ONNX for Speedier SBERT Inference 🔄ONNX

towardsai.net·2d

AMD makes FSR 4 upscaling official for Radeon RX 7000- and 6000-series cards — RDNA 3 and RDNA 2 chips will soon enjoy improved visuals 🎮NVIDIA

tomshardware.com

·1w

Instant GPU Efficiency Visibility at Fleet Scale ⏱️CUDA Events

TFLite Model Conversion: 10 Commands That Actually Work 📉Model Quantization

tildalice.io·3d

kouhxp/yapsnap: Snap any video URL or audio file into plaintext. No GPU. No cloud. One command. 🔓Open-source

github.com·19h·Hacker News

Google Tensor SDK Beta with LiteRT 🎯Tensor Cores

developers.googleblog.com·1d

AMD Confirms FSR 4.1 Support for Radeon RX 7000 in July, RX 6000 GPUs Get it in 2027 🔍Nsight

gizchina.com·6d

ADI to Acquire IVR Tech to Join Data Center’s Power Gold Rush 🔧PTX

eetimes.com·2d

Show HN: FlashAttention-2 in Cute, from Scratch ⚡Flash Attention

blog.echen.io·3d·Hacker News

PyTorch Triton Kernel Transparent Tracing and Compilation ⚡torch.compile

leimao.github.io·17h

Training a 22MB prompt injection classifier 📊Gradient Accumulation

stackone.com·1d·Hacker News

AMD FSR 4.1 Coming to RDNA 2 Will Benefit Xbox Series X More Than PS5 Due to Hardware, SDK – Rumor 🔧PTX

gamingbolt.com·6d

Token-Space Mask Prediction for Efficient Vision Transformer Segmentation 🧩Attention Kernels

Notes on pretraining parallelisms and failed training runs. ⏱️CUDA Events

dwarkesh.com·4d·Hacker News

Understanding KV Cache: The Hidden Memory Cost of Serving LLMs ⚡Flash Attention

melchi.me·2d·Hacker News

An LLM on a Sony PSP ⚙️Systems Programming

·5d

Coding Agent Inference Benchmark Revealed ⚡ONNX Runtime

startuphub.ai·1d

Ollama vs vLLM vs llama.cpp: Which Wins for Your Use Case 📊Profiling Tools

tildalice.io·5d

AMD's FSR 4 coming to RDNA 2 could give the Xbox Series X a PS5 Pro-like upgrade 🔧PTX

tweaktown.com·6d

Sign up or log in to see more results

Log in to enable infinite scrolling