SampCert: Verified Foundations for Differential Privacy (PLDI 2025)
📊linear programming
Flag this post
Online Learning to Rank under Corruption: A Robust Cascading Bandits Approach
arxiv.org·13h
🧩operations research
Flag this post
Dynamic Gradient Echo Pulse Sequence Optimization via Reinforcement Learning for Reduced Artifact in 3T MRI
🧩operations research
Flag this post
Unlocking AI Speed: The Hidden Symmetries in Reinforcement Learning
📊linear programming
Flag this post
Climate Adaptation with Reinforcement Learning: Economic vs. Quality of Life Adaptation Pathways
arxiv.org·13h
🧩operations research
Flag this post
Generalizing Test-Time Compute-Optimal Scaling as an Optimizable Graph
📊linear programming
Flag this post
Learning Complementary Policies for Human-AI Teams
arxiv.org·2d
🧩operations research
Flag this post
Expected Value Analysis in AI Product Management
towardsdatascience.com·2h
🧩operations research
Flag this post
How I Leverage LLMs
📊linear programming
Flag this post
Alpamayo-R1: Bridging Reasoning and Action Prediction for Generalizable Autonomous Driving in the Long Tail
arxiv.org·2d
📊linear programming
Flag this post
Algorithmic Trust Calibration via Adversarial Multi-Agent Simulations
🧩operations research
Flag this post
Beyond Standard LLMs
📊linear programming
Flag this post
Auditable-choice reframing unlocks RL-based verification for open-ended tasks
arxiv.org·1d
🧩operations research
Flag this post
Comparative Analysis of Discrete and Continuous Action Spaces in Reservoir Management and Inventory Control Problems
arxiv.org·1d
🧩operations research
Flag this post
Real-Time Process Optimization via Adaptive Bayesian Reinforcement Learning and Multi-Objective Genetic Algorithms
📊linear programming
Flag this post
Neural Green's Functions
arxiv.org·1d
📊linear programming
Flag this post
Loading...Loading more...