Post-Training LLMs as Better Decision-Making Agents: A Regret-Minimization Approach
arxiv.orgยท16h
๐ฏReinforcement Learning
Flag this post
Deep Learning Without Training
๐ฅPyTorch
Flag this post
Taming Chaos: Predicting Unpredictable Systems Without Guesswork by Arvind Sundararajan
๐Dynamic Programming
Flag this post
Accelerating MySQL Query Optimization via Reinforcement Learning & Hypergraph Analysis
๐Query Optimization
Flag this post
Normalized tensor train decomposition
arxiv.orgยท16h
๐งฎEmbeddings
Flag this post
Waterfall Methodology AI: The Smart Evolution of Traditional Project Management
๐ฌPrompt Engineering
Flag this post
Why Nonparametric Models Deserve a Second Look
towardsdatascience.comยท2d
๐ง Machine Learning
Flag this post
Fantastic (small) Retrievers and How to Train Them: mxbai-edge-colbert-v0 TechReport
๐ฑEdge AI
Flag this post
Alleviating Hyperparameter-Tuning Burden in SVM Classifiers for Pulmonary Nodules Diagnosis with Multi-Task Bayesian Optimization
arxiv.orgยท1d
๐ง Machine Learning
Flag this post
Predictive Maintenance of Typhoon HIL Simulator Components via Sensor Fusion and Bayesian Optimization
๐ง Machine Learning
Flag this post
Energy Loss Functions for Physical Systems
arxiv.orgยท2d
๐Linear Algebra
Flag this post
Online Learning to Rank under Corruption: A Robust Cascading Bandits Approach
arxiv.orgยท1d
๐ธ๏ธGraph Theory
Flag this post
Topographical sparse mapping: A training framework for deep learning models
๐๏ธComputer Vision
Flag this post
Non-Asymptotic Optimization and Generalization Bounds for Stochastic Gauss-Newton in Overparameterized Models
arxiv.orgยท16h
๐ฌDeep Learning
Flag this post
Deep Koopman Economic Model Predictive Control of a Pasteurisation Unit
arxiv.orgยท16h
๐Dynamic Programming
Flag this post
An introduction to program synthesis (Part II) - Automatically generating features for machine learning
๐ญProgram Synthesis
Flag this post
Matrix Sensing with Kernel Optimal Loss: Robustness and Optimization Landscape
arxiv.orgยท2d
๐ขNumPy
Flag this post
On the relationship between MESP and 0/1 D-Opt and their upper bounds
arxiv.orgยท16h
๐Dynamic Programming
Flag this post
Loading...Loading more...