Burn

Rust Deep Learning, Tensor Operations, WGPU Backend, Model Training, Type-safe ML

Feeds to Scour
SubscribedAll
Scoured 35 posts in 14.0 ms

I stopped using most of Rust’s advanced features for my ML library

 🔥PyTorch  Content type: Code
github.com··r/rust

Re-quantizing a local LLM 14x faster by skipping the tensors that didn't change

 💻Local LLMs  Content type: News  Content type: Blog

TensorBench: Benchmarking Coding Agents on a Compiler-Based Tensor Framework

 🔥PyTorch  Content type: Academic
arxiv.org·

Is Doom a Tensor? [video]

 🔥PyTorch  Content type: Video
youtube.com··Hacker News

How we fight GPU scarcity without compromise

 🤖AI Inference  Content type: Blog
equixly.com··Hacker News

A system programmer’s guide to LLM inference

 💻Local LLMs  Content type: Blog

lbj96347/nemotron-3.5-asr-ios: On-device, offline speech recognition for iPhone/iPad using NVIDIA's Nemotron-3.5-ASR Streaming 0.6B (multilingual) via CoreML.SwiftUI app with mic capture + audio file import, RNN-Tdecoding, and live benchmark metrics (latency, RTF, memory).

 🎙️Whisper  Content type: Code
github.com··Hacker News

Google reportedly orders at least three million chips from Intel to arrive in 2028, as TSMC struggles to keep up with the AI boom

 🖥computers  Content type: News
pcgamer.com
··Hacker News

Tensor Shapes in Pyrefly – Avik Chaudhuri – PyCon US 2026 Typing Summit [video]

 🔥PyTorch  Content type: Video
youtube.com··Hacker News

Apple rebuilt its on-device AI stack at WWDC 2026

 💻Local LLMs  Content type: Blog
ziraph.com··Hacker News

Tensor Algebraic Property Skeletons: Amplifying Property-Based Testing for AI Compilers

 🔨Compilers  Content type: Academic
arxiv.org·

The Edge LLM Offload Story

 Performance
semiengineering.com·

harshuljain13/llm-inference-at-scale: A Practitioner handbook for production llm serving.

 🤖AI Inference  Content type: Code
github.com··Hacker News

Integrate on-device AI models into your app using Core AI - WWDC26 - Videos

 🌟cool github projects

Florian Brand, Prime Intellect research engineer, adopts Gemma 4 E4B 6-bit quantized as his primary local Mac LLM

 🏗️AI Infrastructure  Content type: News
digg.com··Hacker News

Running Qwen 35B MoE at 450k Context on a Single 32GB GPU

 💻Local LLMs

Unpacking AI: The Hardware Behind AI

 Hardware Acceleration  Content type: News

Gemma 4 QAT models: Optimizing model compression for mobile and laptop efficiency

 💻Local LLMs  Content type: News  Content type: Blog
blog.google··Hacker News

Higgs Audio v3 TTS 4B. Built for voice chat. Support 100 languages and inline control.

 🗣️Speech Synthesis

Apples to Apples: MLX vs. Llama.cpp for Gemma 4 12B on an M1 16GB

 🤖AI agents  Content type: Blog
ziraph.com··Hacker News

No more posts from nmarshall's subscribed feeds.

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help