⚡ Cuda - miterion · Scour

AI in Multiple GPUs: Understanding the Host and Device Paradigm

towardsdatascience.com·6h

⏱️CUDA Events

datavorous/spheni: An in-memory vector search library in C++ with Python bindings

github.com·1d·

Discuss: Hacker News

GPU-Fuzz: Finding Memory Errors in Deep Learning Frameworks

arxiv.org·14h

Show HN: Solving Sudoku reasoning via Energy Geometric models

davisgeometric.com·10h·

Discuss: Hacker News

Building a Zero-Dependency secp256k1 CUDA Engine from Scratch (2.5B ops/SEC)

github.com·1d·

Discuss: Hacker News

Implementing 3D Graphics Basics

hackaday.com·16h

BOute: Cost-Efficient LLM Serving with Heterogeneous LLMs and GPUs via Multi-Objective Bayesian Optimization

arxiv.org·14h

gist.github.com·21h·

Discuss: Hacker News, Hacker News

📉Model Quantization

Two Ways to Move Tensors Without Stopping: Inside vLLM's Async GPU Transfer Patterns

dev.to·22h·

Discuss: DEV

🌊CUDA Streams

NVIDIA GeForce NOW Turns Screens Into a Gaming Machine

elevenforum.com·4h

A C implementation of the inference pipeline for the Mistral AI’s Voxtral Realtime 4B model

blog.adafruit.com·2h

🏎️TensorRT

NVIDIA DGX Spark Powers Big Projects in Higher Education

blogs.nvidia.com·4h

Ming-flash-omni-2.0: 100B MoE (6B active) omni-modal model - unified speech/SFX/music generation

huggingface.co·1h·

Discuss: r/LocalLLaMA

⚡Flash Attention

The Efficiency Wall: Why the Next 1,000x Leap Isn’t More GPUs

pub.towardsai.net

·15h

🌊CUDA Streams

EGPU Enclosures support

lemmy.ml·22h

📈GPU Occupancy

Nvidia RTX 6000D teardown shows 84GB VRAM using 3GB memory chips

club386.com·2h

📈GPU Occupancy

Porting an INT8 VHDL CNN from Intel Agilex 3 to Lattice Certus-NX

news.ycombinator.com·6h·

Discuss: Hacker News

What Agentic AI "Vibe Coding" In The Hands Of Actual Programmers / Engineers

stochasticlifestyle.com·7h

CPU cloth simulation performance comparable to GPU SotA

sig25ddmpd.github.io·18h·

Discuss: Hacker News

How Programmers Spend Their Time

probablydance.com·1d·

Discuss: Hacker News

⚡Flash Attention

Loading more...