Feeds to Scour
SubscribedAll
Scoured 82625 posts in 407.7 ms
MiniTensor: A Lightweight, High-Performance Tensor Operations Library
arxiv.orgยท1d
๐ŸŽ๏ธTensorRT
Preview
Report Post
Optimizing Tensor Train Decomposition in DNNs for RISC-V Architectures Using Design Space Exploration and Compiler Optimizations
arxiv.orgยท1d
๐ŸŽ๏ธTensorRT
Preview
Report Post
Anthropic's Performance Take-Home: A 65x Optimization (For Dummies)
ikot.blogยท21hยท
Discuss: Hacker News
๐ŸŽ›๏ธCUDA Optimization
Preview
Report Post
Card Achieves 3x Faster Training With Novel Causal Autoregressive Diffusion
quantumzeitgeist.comยท15h
๐Ÿ“ŠGradient Accumulation
Preview
Report Post
Writing an optimizing tensor compiler from scratch
michaelmoroz.github.ioยท3dยท
Discuss: Hacker News
๐ŸŽ๏ธTensorRT
Preview
Report Post
The Better Lesson? Geometry and Topology in the Era of Deep Learning
bastian.rieck.meยท5h
๐ŸŽ๏ธTensorRT
Preview
Report Post
**Abstract:** This paper introduces a novel approach to Neural Architecture Search (NAS) specifically tailored for resource-constrained edge AI vision system...
freederia.comยท4h
โšกONNX Runtime
Preview
Report Post
Writing an Optimizing Tensor Compiler from Scratch
hackaday.comยท4d
๐ŸŽ๏ธTensorRT
Preview
Report Post
A Map of ML Hardware Architectural Trade-Offs
vbml.substack.comยท3dยท
Discuss: Substack
๐ŸŒŠCUDA Streams
Preview
Report Post
Zephyr: Direct Distillation of LM Alignment
dev.toยท1hยท
Discuss: DEV
๐Ÿ› Ml-eng
Preview
Report Post
Silicon coupled with open development platforms drives context-aware edge AI
edn.comยท3h
โšกONNX Runtime
Preview
Report Post
Diffusion LLM Sampling Achieves 70% Latency Reduction With Novel NPU Design
quantumzeitgeist.comยท1d
๐ŸŽ›๏ธCUDA Optimization
Preview
Report Post
AI and the Reconfiguration of the Counterintelligence Battlefield
tandfonline.comยท51m
๐Ÿง BF16
Preview
Report Post
iree-org/wave: Wave: Python Domain-Specific Language for High Performance Machine Learning
github.comยท20hยท
Discuss: Hacker News
๐Ÿ“œTorchScript
Preview
Report Post
WebGPU Cameras
webgpufundamentals.orgยท3h
๐ŸŽฎNVIDIA
Preview
Report Post
Beyond Two Towers: Re-architecting the Serving Stack for Next-Gen Ads Lightweight Ranking Modelsโ€ฆ
medium.comยท1d
โšกONNX Runtime
Preview
Report Post
ML for Energy-Performance-Aware Scheduling On Heterogeneous Multicore Architectures (Cambridge)
semiengineering.comยท1d
๐Ÿ“ˆOccupancy Optimization
Preview
Report Post
PyTorch in 2026: The Complete Guide
dev.toยท19hยท
Discuss: DEV
๐Ÿ“œTorchScript
Preview
Report Post
The Core Flaws of Modern AI based on Large Language Models (longpost)
bykozy.meยท22hยท
Discuss: Hacker News
๐Ÿ“ŠGradient Accumulation
Preview
Report Post
Weight Initialization in Deep Learning: Xavier (Glorot), He (Kaiming), and Beyond
pub.towardsai.net
ยท8h
๐Ÿ“ŠGradient Accumulation
Preview
Report Post

Keyboard Shortcuts

Navigation
Next / previous item
j/k
Open post
oorEnter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help