PyTorch

Feeds to Scour
SubscribedAll
Scoured 229 posts in 14.1 ms

Location: Edmonton, Canada Remote: Yes Willing to relocate: Yes, within Canada T...

 📱Edge AI  Content type: Discussion

AgentCompile: An LLM-Guided Compiler for Direct CUDA Inference

 📱Edge AI  Content type: Academic
arxiv.org·

A system programmer’s guide to LLM inference

 👁Vision Language Model  Content type: Blog

Benchmarking dots.tts on Strix Halo

 📱Edge AI
sleepingrobots.com·

Apple rebuilt its on-device AI stack at WWDC 2026

 📱Edge AI  Content type: Blog
ziraph.com··Hacker News

The Transformer, Demystified — Let's Actually Build One

 👁Vision Language Model  Content type: News
mlwhiz.com
·

NVIDIA Accelerates Google DeepMind’s DiffusionGemma for Local AI

 📱Edge AI  Content type: Blog
blogs.nvidia.com·

Unsloth Gemma 4 QAT

 📱Edge AI
unsloth.ai·

Stop hand-tuning kernels: How Neuron Agentic Development accelerates AWS Trainium optimizations

 📱Edge AI  Content type: Blog
aws.amazon.com·

KJLdefeated/RL.cu: RLVR training for LLM in CUDA/C++

 📱Edge AI  Content type: Code
github.com··Hacker News

AutoMegaKernel: A Statically-Checked Agent Harness for Self-Retargeting Megakernel Synthesis

 📱Edge AI  Content type: Academic
arxiv.org··Hacker News

Discrete Diffusion Modelling by Estimating the Ratios of the Data Distribution

 👁Vision Language Model  Content type: News  Content type: Blog

Evaluating and developing machine learning models: äN introduction (gpn24)

 📱Edge AI
cdn.media.ccc.de·

Build a local voice agent with Red Hat OpenShift AI

 👁Vision Language Model
developers.redhat.com·

WSL 3 will finally let Linux apps use your GPU and NPU without the performance tax

 📱Edge AI
xda-developers.com·

New comment by Revanthkodati in "Ask HN: Who wants to be hired? (June 2026)"

 🤖AI

Unreleased RTX 3050 Ti engineering sample appears in photos and benchmarks — the RTX 3060 alternative that never happened

 📱Edge AI  Content type: News
tomshardware.com
·

APEX4: Efficient Pure W4A4 LLM Inference via Intra-SM Compute Rebalancing

 📱Edge AI  Content type: Academic
arxiv.org·

Show HN: Magenta Real-Time Music Generation on iPhone, Without the GPU

 📱Edge AI  Content type: Code
github.com··Hacker News

Job Searcher

 📱Edge AI  Content type: Blog
huggingface.co·
Sign up or log in to see more results

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help