Gradient Descent, Convex Optimization, Stochastic Methods, Loss Functions

Restaurant Shift Scheduling via Linear Optimization and Staff Constraints
news.ycombinator.com·3h·
Discuss: Hacker News
🗣️Large Language Models
Flag this post
Spiral Development for Hardware Programs
asbuilt.pub·6h·
Discuss: Hacker News
🌐Distributed Systems
Flag this post
Olmo 3: America’s truly open reasoning models
interconnects.ai·15h·
Discuss: Hacker News
🗣️Large Language Models
Flag this post
Distribution Matching Distillation Meets Reinforcement Learning
arxiv.org·3d
🎮Reinforcement Learning
Flag this post
A CUR Krylov Solver for Large-Scale Linear Matrix Equations
arxiv.org·2d
📐Linear Algebra
Flag this post
Teaching signal synchronization in deep neural networks with prospective neurons
arxiv.org·1d
⏱️Time Series Analysis
Flag this post
Graded strength of comparative illusions is explained by Bayesian inference
arxiv.org·2d
🎲Probability Theory
Flag this post
Making Evidence Actionable in Adaptive Learning
arxiv.org·2d
🔶TensorFlow
Flag this post
A Bayesian Model for Multi-stage Censoring
arxiv.org·3d
📈Statistical Learning
Flag this post
Differentiable Sparse Identification of Lagrangian Dynamics
arxiv.org·4d
Automatic Differentiation
Flag this post
Unbiased Semantic Decoding with Vision Foundation Models for Few-shot Segmentation
arxiv.org·1d
📡Information Theory
Flag this post
Extended Physics Informed Neural Network for Hyperbolic Two-Phase Flow in Porous Media
arxiv.org·2d
Automatic Differentiation
Flag this post
H-CNN-ViT: A Hierarchical Gated Attention Multi-Branch Model for Bladder Cancer Recurrence Prediction
arxiv.org·2d
🔥PyTorch
Flag this post