Hardware Acceleration

Feeds to Scour
SubscribedAll
Scoured 539 posts in 15.5 ms

Framework Desktop AMD 395+ (rdna 3.5) cannot run confyui err Fix 2026

 🔥PyTorch  Content type: Blog
runaihome.com··DEV

AMD's Lemonade SDK For Local AI Adds NVIDIA CUDA Support

 🖥computers
phoronix.com·

Founding Engineer - FPGA, RTL, & ASIC Architect at Zettascale

 🔌FPGA

Towards Autonomous Accelerator Design: FPGA Accelerator Generation with SECDA

 🔌FPGA  Content type: Academic
arxiv.org·

Exploiting GPU Tensor Cores from Java using Babylon [Juan Fumero]

 🔥Burn
openjdk.org··r/java

Exploring the Classic Xilinx XC5202-6PQ100I FPGA

 🔌FPGA
hackster.io·

KJLdefeated/RL.cu: RLVR training for LLM in CUDA/C++

 🔥PyTorch  Content type: Code
github.com··Hacker News

Rethinking the Logic-Routing Tradeoff in FPGAs

 🔌FPGA  Content type: News
eetimes.com·

From GPU to Token: The 8-Layer Observability Stack for AI Infrastructure

 🏗️AI Infrastructure  Content type: Blog
jimmysong.io·

NVIDIA Nsight Compute

 🌟Ray Tracing
developer.nvidia.com·

Asics running shoes are up to 42% off — I’ve handpicked 7 deals on shoes I’ve tested and recommend

 👔Men's Fashion
tomsguide.com
·

Agilex 9 FPGAs power COTS VPX boards

 🔌FPGA
edn.com·

Deep X XM2 NPU: 80 TOPS Generative AI Accelerator at 5W

 📡Edge Computing
armdevices.net·

Programming Domain-Specific FPGA Hardblocks from HLS: An RTL Blackbox Approach

 🔌FPGA  Content type: Academic
arxiv.org·

CUDA-Oxide 0.2 Brings Early Improvements To Pure Rust CUDA Kernels

 🏗️AI Infrastructure
phoronix.com·

Communication Strategy Selection for Multi-GPU 3D FDTD with Convolutional Perfectly Matched Boundary Layers

 🌟Ray Tracing  Content type: Academic
arxiv.org·

AgentCompile: An LLM-Guided Compiler for Direct CUDA Inference

 🔥PyTorch  Content type: Academic
arxiv.org·

FlexNPU: Transparent NPU Virtualization for Dynamic LLM Prefill-Decode Co-location

 🤖AI Inference  Content type: Academic
arxiv.org·

Modeling, Optimizing and Exploring Multi-Die FPGA Routing Architectures

 🔌FPGA  Content type: Academic
arxiv.org·

LLM-Based Porting of Optimized C++ to CUDA Through Deoptimization and Reoptimization

 🏗️AI Infrastructure  Content type: Academic
arxiv.org·

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help