⛰️ Gradient Descent - pfh · Scour

Natural Hypergradient Descent: Algorithm Design, Convergence Analysis, and Parallel Implementation

arxiv.org·12h

🎪Convex Optimization

Don't give away to the gradient descent

carteakey.dev·17h·

Discuss: Hacker News

📊Empirical Bayes

Gradient Residual Connections

arxiv.org·1d

🎪Convex Optimization

Wavelet Meets Adam: Compressing Gradients for Memory-Efficient Training

chipublib.idm.oclc.org·1d

Architectural and Mathematical Foundations of Machine Learning: A Rigorous Synthesis of Theory, Geometry, and Implementation

chizkidd.github.io·1d·

Discuss: Hacker News

🗺️Manifold Learning

Gibbs Measures from Deep Shaped Multilayer Perceptrons

link.aps.org·5h

📊Empirical Bayes

gist.github.com·20h·

Discuss: Hacker News, Hacker News

A training principle for drifting models

breno.bearblog.dev·6h

🎪Convex Optimization

Learning Optimization Tools

trendhunter.com·2d

🎪Convex Optimization

Grassmannian Manifold Learning: Optimization and Deep Learning Architectures

hackernoon.com·1d

🗺️Manifold Learning

Active learning Kriging with functional dimension reduction for reliability analysis of stochastic dynamical systems

sciencedirect.com·1h

🔗Markov Chains

The 4 Mixture of Experts Architectures: How to Train 100B Models at 10B Cost

pub.towardsai.net

·4h

Building a Robust Classifier with Stacked Generalization

dev.to·2d·

Discuss: DEV

UbiquitousLearning/mllm: Fast Multimodal LLM on Mobile Devices

github.com·8h

🦠Whole cell model

LateOn-Code & ColGrep: LightOn unveils state-of-the-art code retrieval models and code search tooling

huggingface.co·1h·

Discuss: Hacker News

How Andrej Karpathy Built a Working Transformer in 243 Lines of Code

analyticsvidhya.com·4h

EyesOff: Why Some Models Quantize Better Than Others

ym2132.github.io·18h·

Discuss: Hacker News

New Generative Paradigm: Drifting Model

mail.bycloud.ai·1d

📊Empirical Bayes

Hybrid meta-optimized GNN network to optimize pitch angle and active power of wind turbines for reducing fatigue load

sciencedirect.com·1d

🎪Convex Optimization

Wahba’s Problem and SO(3) Optimization: Rotation Learning in Geometric ML

hackernoon.com·1d

🎪Convex Optimization

Loading more...