Hardware Acceleration

Feeds to Scour
SubscribedAll
Scoured 79 posts in 15.1 ms

KJLdefeated/RL.cu: RLVR training for LLM in CUDA/C++

 🔥PyTorch  Content type: Code
github.com··Hacker News

Ultrafast machine learning on FPGAs via Kolmogorov-Arnold Networks

 🔌FPGA

AutoMegaKernel: A Statically-Checked Agent Harness for Self-Retargeting Megakernel Synthesis

 🏗️AI Infrastructure  Content type: Academic
arxiv.org··Hacker News

PCIe Benefits From AI, Despite Scaling Protocols

 🤖AI agents
semiengineering.com·

Nvidia's RTX Spark is a developer's dream, but AMD's Ryzen AI Max+ is what most people actually need for local AI

 🖥computers
xda-developers.com·

Full Context on a Vulkan-Only Strix Halo: The Decode-Drop Reproduces, but the Sweet Spot Moves

 🎨Shader Programming

Release TorchCodec 0.14: HDR Video Decoding for CPU & CUDA, and Fast Wav Decoder · meta-pytorch/torchcodec

 🔥PyTorch  Content type: Code
github.com··Hacker News

Google reportedly orders at least three million chips from Intel to arrive in 2028, as TSMC struggles to keep up with the AI boom

 🖥computers  Content type: News
pcgamer.com
··Hacker News

Jensen Huang Just Called This the Next Trillion-Dollar AI Chip Stock

 🏗️AI Infrastructure
finance.yahoo.com·

geohot/fromthetransistor: From the Transistor to the Web Browser, a rough outline for a 12 week course

 🔌FPGA  Content type: Code
github.com··Hacker News

Mid-range GPUs have largely dodged the memory crisis, but not for much longer

 🌟Ray Tracing
xda-developers.com·

I stopped using most of Rust’s advanced features for my ML library

 🔥PyTorch  Content type: Code
github.com··r/rust

🫧 AI Companies' Shared Destiny Recalls Dot-Com Bubble Memories

 🏗️AI Infrastructure  Content type: Discussion

Unpacking AI: The Hardware Behind AI

 🧠AI  Content type: News

EP217: Latency vs Throughput vs Bandwidth

 🏗️System Design  Content type: News  Content type: Blog
blog.bytebytego.com·

AMD shipped Nvidia's new AI laptop over a year ago, and the software is finally catching up

 🌟Ray Tracing
xda-developers.com·

LLM-Based Porting of Optimized C++ to CUDA Through Deoptimization and Reoptimization

 🏗️AI Infrastructure  Content type: Academic
arxiv.org·

The Edge LLM Offload Story

 Performance
semiengineering.com·

zhongkaifu/TensorSharp: A C# inference engine for running large language models (LLMs) locally using GGUF model files. TensorSharp provides a console application, a web-based chatbot interface, and Ollama/OpenAI-compatible HTTP APIs for programmatic access. It supports Windows/MacOS/Linux with full GPU capability

 💻Local LLMs  Content type: Code
github.com··Hacker News

CodegenBench: Can LLMs Write Efficient Code Across Architectures?

 🔥PyTorch  Content type: Academic
arxiv.org··Hacker News

No more posts from nmarshall's subscribed feeds.

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help