Hardware Acceleration

Feeds to Scour
SubscribedAll
Scoured 78 posts in 39.2 ms

Founding Engineer - FPGA, RTL, & ASIC Architect at Zettascale

 Systems Performance

Towards Autonomous Accelerator Design: FPGA Accelerator Generation with SECDA

 🪄Prompt Engineering  Content type: Academic
arxiv.org·

Vortex 3.0 Released As Full-Stack, Open-Source RISC-V GPU Now With 3D Pipeline

 🖥GPUs
phoronix.com·

Ultrafast machine learning on FPGAs via Kolmogorov-Arnold Networks

 🔢BitNet Inference

Training an LLM in Swift, Part 2: macOS built-in frameworks

 🔄SIMD Programming  Content type: Blog

NVIDIA Accelerates Google DeepMind’s DiffusionGemma for Local AI

 🤖AI  Content type: Blog
blogs.nvidia.com·

jeffhuen/RustyCSV: High-performance CSV parsing for Elixir. Rust NIF with SIMD acceleration, parallel parsing, and bounded-memory streaming. Drop-in NimbleCSV replacement.

 SIMD Optimization  Content type: Code
github.com··Hacker News

1-bit and 1.58 bit LLM Benchmarking on Jetson Orin Nano Super | Bonsai LM

 🤖AI
smolhub.com··r/LocalLLaMA

Optimizing Local LLM Inference on Constrained Hardware

 🤖AI
pub.towardsai.net
·

Alchip Accelerates on AI ASIC Demand

 💻Chips
semiwiki.com·

Building Multi-Agent Systems For ASIC Flows

 💻Chips
semiengineering.com·

Marvell: Why Jensen Huang Wants A $1T Competitor (NASDAQ:MRVL)

 🚀Startups  Content type: News
seekingalpha.com
·

NVIDIA Nsight Compute

 🏗️LLM Infrastructure
developer.nvidia.com·

geohot/fromthetransistor: From the Transistor to the Web Browser, a rough outline for a 12 week course

 🧠Memory Management  Content type: Code
github.com··Hacker News

APEX4: Efficient Pure W4A4 LLM Inference via Intra-SM Compute Rebalancing

 🏗️LLM Infrastructure  Content type: Academic
arxiv.org·

Camera for Radio Waves using 4x4 MIMO

 🔬RaBitQ

Why my SIMD code was silently running as scalar, and what debugging it taught me about production environment assumptions

 SIMD Optimization  Content type: Blog

DiffusionGemma: The Developer Guide

 Fast AI Inference  Content type: Blog
developers.googleblog.com·

Making Local LLM Go Brrr

 🤖AI

Programming Domain-Specific FPGA Hardblocks from HLS: An RTL Blackbox Approach

 🖥️Hardware Architecture  Content type: Academic
arxiv.org·

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help