📈 Optimization - jyunzhang · Scour

Optimal Rates for Generalization of Gradient Descent Methods with Deep Neural Networks

🧠Deep Learning Academic

Machine learning from scratch, what to build before using scikit-learn

🤖Machine Learning Tutorial

iwtlp.com··DEV

Pytorch for Neural Networks Part 6: Understanding Epochs and Loss

🧠Deep Learning Blog

How LLMs Work?

🤖Transformers

pub.towardsai.net

·

ml-from-scratch-book/code: Companion code for Machine Learning From Scratch — 10 core ML algorithms built from scratch with NumPy, compared with Scikit-learn and PyTorch.

🤖Machine Learning Code

github.com··Hacker News

Gram Newton-Schulz: A Fast, Hardware-Aware Newton-Schulz Algorithm for Muon

📐Linear Algebra Blog

tridao.me··Hacker News

Ultrafast machine learning on FPGAs via Kolmogorov-Arnold Networks

🧠Deep Learning

aarushgupta.io··Lobsters, Hacker News

Pytorch for Neural Networks Part 5: Preparing the Model for Training

🧠Deep Learning Blog

Karpathy’s 90-Second Time Machine Through 33 Years of Neural Networks

🧠Deep Learning

pub.towardsai.net

·

Generalization in Deep Neural Networks: Minimax Rates for Gradient Methods

🤖Machine Learning Academic

markusheimerl/gpt: A generative pretrained transformer implementation

🤖Transformers Code

github.com··Hacker News

Building a Multilayer Perceptron from Scratch: What It Taught Me About Neural Networks

🧠Deep Learning Blog

Multilevel Stochastic Gradient Descent for Risk-Averse PDE-Constrained Optimization

🎮Reinforcement Learning Academic

Beyond Basic RAG (Part 3): Agentic RAG, CRAG, Self-RAG and GraphRAG Explained | M012 | Mehul Ligade

pub.towardsai.net

·

Second-Order Path Kernel Interpolation Formulas in Machine Learning

🧠Deep Learning Academic

Meltedd/scarecrow: An adversarial frame pattern optimizer for evading automated license plate recognition, personalized to your plate.

🎭Anthropic Claude Code

github.com··Hacker News

From Linear Regression to Gradient Descent

🧠Deep Learning Blog

Optimizing Energy-based Neural Network Training with Coherent Ising Machine

🧠Deep Learning Academic

Uniform Stability and Generalization Error of GD and SGD on Fixed-Point Parameters

🔥PyTorch Academic

Fourier fractal dimension to predict the generalization of deep neural networks

🧠Deep Learning Academic

Log in to enable infinite scrolling