🟢 CUDA - nayyara.airlangga

RightNow-AI/AutoMegaKernel: An agent harness that compiles a model into one provably-correct, self-retargeting CUDA megakernel and self-tunes it past cuBLAS at batch-1 LLM decode.

🔢GEMM Optimization Code

github.com··Hacker News

NVIDIA Confidential Computing to Help Expand Apple’s Private Cloud Compute

🎮GPU Computing Blog

blogs.nvidia.com·

NVIDIA Nsight Compute

🎮GPU Computing

developer.nvidia.com·

AutoMegaKernel: A Statically-Checked Agent Harness for Self-Retargeting Megakernel Synthesis

🎮GPU Computing Academic

arxiv.org··Hacker News

Less-relevant results

how to make brave use nvidia gpu on ubuntu?

⚗️Kernel Fusion

lemmy.ml·

Ollama 0.30 delivers faster NVIDIA GPU performance and wider hardware support

🧠Inference Engineering

alternativeto.net·

Nvidia RTX Spark: The $2,900 Floor Tells You Everything

🎮GPU Computing Blog Discussion

tildalice.io·

Google Pays SpaceX $920M/Month for AI Compute (4 minute read)

💰Inference Cost

winbuzzer.com·

Google-SpaceX $30B Compute Deal Raises Cloud Buyer Questions

☁️Cloud Infrastructure

techrepublic.com·

AMD Radeon RX 9070 GRE vs. Nvidia GeForce RTX 5070

🎮GPU Computing

club386.com·

Particle: SpaceX Discloses $920 Million‑a‑Month Google Compute Deal Ahead of IPO

⚗️Kernel Fusion News

particle.news·

From GPU to Token: The 8-Layer Observability Stack for AI Infrastructure

💰Inference Cost Blog

jimmysong.io·

KJLdefeated/RL.cu: RLVR training for LLM in CUDA/C++

💾KV Cache Code

github.com··Hacker News

Apple WWDC Announces Privacy Leap Off a Cliff

⚡FlashAttention

flyingpenguin.com·

APEX4: Efficient Pure W4A4 LLM Inference via Intra-SM Compute Rebalancing

🎮GPU Computing Academic

arxiv.org·

Apple extends Private Cloud Compute to third-party data centers

☁️Cloud Infrastructure

helpnetsecurity.com·

SpaceX to lease 110,000 Nvidia GPUs from Google in a $920 million per month cloud agreement

⚗️Kernel Fusion News

digg.com·

Exploiting GPU Tensor Cores from Java using Babylon [Juan Fumero]

CUDA-Oxide 0.2 Brings Early Improvements To Pure Rust CUDA Kernels

Exploiting GPU Tensor Cores from Java using Babylon

RightNow-AI/AutoMegaKernel: An agent harness that compiles a model into one provably-correct, self-retargeting CUDA megakernel and self-tunes it past cuBLAS at batch-1 LLM decode.

NVIDIA Confidential Computing to Help Expand Apple’s Private Cloud Compute

NVIDIA Nsight Compute

AutoMegaKernel: A Statically-Checked Agent Harness for Self-Retargeting Megakernel Synthesis

how to make brave use nvidia gpu on ubuntu?

Ollama 0.30 delivers faster NVIDIA GPU performance and wider hardware support

Nvidia RTX Spark: The $2,900 Floor Tells You Everything

Google Pays SpaceX $920M/Month for AI Compute (4 minute read)

Google-SpaceX $30B Compute Deal Raises Cloud Buyer Questions

AMD Radeon RX 9070 GRE vs. Nvidia GeForce RTX 5070

Particle: SpaceX Discloses $920 Million‑a‑Month Google Compute Deal Ahead of IPO

From GPU to Token: The 8-Layer Observability Stack for AI Infrastructure

KJLdefeated/RL.cu: RLVR training for LLM in CUDA/C++

Apple WWDC Announces Privacy Leap Off a Cliff

APEX4: Efficient Pure W4A4 LLM Inference via Intra-SM Compute Rebalancing

Apple extends Private Cloud Compute to third-party data centers

SpaceX to lease 110,000 Nvidia GPUs from Google in a $920 million per month cloud agreement