Optimal Control Theoretic Neural Optimizer: From Backpropagation to Dynamic Programming
arxiv.org·2w
📐Linear Algebra
Flag this post
(Forward) automatic implicit differentiation in Rust with num-dual 0.12.0
reddit.com·3w·
Discuss: r/rust
🦀Rust
Flag this post
Yes, you should understand backprop (2016)
karpathy.medium.com·4d·
Discuss: Hacker News
📐Linear Algebra
Flag this post
Topographical sparse mapping: A training framework for deep learning models
sciencedirect.com·1d·
Discuss: Hacker News
📐Linear Algebra
Flag this post
A Practitioner's Guide to Kolmogorov-Arnold Networks
arxiviq.substack.com·3d·
Discuss: Substack
SIMD
Flag this post
Generalization Below the Edge of Stability: The Role of Data Geometry
arxiv.org·2w
📐Linear Algebra
Flag this post
Neural Networks from Scratch in Python: Simpler Than You Think
hamza.se·3w·
Discuss: Hacker News
📐Linear Algebra
Flag this post
When Transformers Sing: Adapting SpectralKD for Text-Based Knowledge Distillation
towardsdatascience.com·1w
📡DSP
Flag this post
RL for Reasoning by Adaptively Revealing Rationales
machinelearning.apple.com·1w
⚙️Compilers
Flag this post
Gradient GPS: Turbocharge Your Diffusion Models with Targeted Tuning
dev.to·5d·
Discuss: DEV
⚙️Compilers
Flag this post
A non-diagonal SSM RNN computed in parallel without requiring stabilization
github.com·2w·
Discuss: Hacker News
⚙️Compilers
Flag this post
Automatic network structure discovery of physics informed neural networks via knowledge distillation
nature.com·1w
📡DSP
Flag this post
Hypernetworks: Neural Networks for Hierarchical Data
blog.sturdystatistics.com·3w·
Discuss: Hacker News
SIMD
Flag this post
Gated DeltaNet (Linear Attention variant in Qwen3-Next and Kimi Linear)
sebastianraschka.com·3d·
⚙️Compilers
Flag this post
Shape-Shifting AI: Making Models That Adapt to Data
dev.to·4d·
Discuss: DEV
🎨Computer Graphics
Flag this post
krnel-graph: The scikit-learn of AI internals
krnel.ai·2w
⚙️Compilers
Flag this post