Toward Better Hip Kernel Generation for AMD GPUs (opens in new tab)
--- ## TLDR In this work, we explore how to make language models better at generating high performance **HIP kernels** for **AMD GPUs**\. We present the following: 1. A **synthetic dataset** of 500 new PyTorch reference tasks using **mutation**, **composition**, and **constraint-based generation** to cover a broader range of workloads\. 1. A **multi-agent optimization pipeline** for HIP kernel generation\. Instead of relying on single-shot prompting, we used specialized agents for tas...
Read the original article