Gradient Descent, Convex Optimization, Stochastic Methods, Loss Functions

Restaurant Shift Scheduling via Linear Optimization and Staff Constraints
news.ycombinator.com·4h·
Discuss: Hacker News
🗣️Large Language Models
Flag this post
AI is a new computing paradigm – Karpathy
threadreaderapp.com·2d·
Discuss: Hacker News
🤖Transformers
Flag this post
CroPS: Improving Dense Retrieval with Cross-Perspective Positive Samples in Short-Video Search
arxiv.org·1d
🗣️Large Language Models
Flag this post
Hierarchical Semantic Learning for Multi-Class Aorta Segmentation
arxiv.org·2d
🧠Deep Learning
Flag this post
Thinking about reasoning models made me less worried about scheming
lesswrong.com·11h
🗣️Large Language Models
Flag this post
Area-Optimal Control Strategies for Heterogeneous Multi-Agent Pursuit
arxiv.org·1d
🎯Optimization Theory
Flag this post
Multi-Horizon Time Series Forecasting of non-parametric CDFs with Deep Lattice Networks
arxiv.org·2d
⏱️Time Series Analysis
Flag this post
ALEX:A Light Editing-knowledge Extractor
arxiv.org·2d
🗣️Large Language Models
Flag this post
Uncertainty-Calibrated Prediction of Randomly-Timed Biomarker Trajectories with Conformal Bands
arxiv.org·2d
📈Statistical Learning
Flag this post
Departures: Distributional Transport for Single-Cell Perturbation Prediction with Neural Schr\"odinger Bridges
arxiv.org·3d
🔶TensorFlow
Flag this post
Unveiling Intrinsic Dimension of Texts: from Academic Abstract to Creative Story
arxiv.org·1d
💬Natural Language Processing
Flag this post
Uncover and Unlearn Nuisances: Agnostic Fully Test-Time Adaptation
arxiv.org·3d
🗣️Large Language Models
Flag this post
SCI: An Equilibrium for Signal Intelligence
arxiv.org·3d
🤖AI
Flag this post
Oxytrees: Model Trees for Bipartite Learning
arxiv.org·3d
🗣️Large Language Models
Flag this post
Physics-Informed Neural Network-based Reliability Analysis of Buried Pipelines
arxiv.org·3d
🧠Neural Networks
Flag this post
Evolutionary Retrofitting
arxiv.org·4d
🗣️Large Language Models
Flag this post