Robust Single-Agent Reinforcement Learning for Regional Traffic Signal Control Under Demand Fluctuations
arxiv.org·1d
📈Optimization
Flag this post
Enhancing Diffusion-based Restoration Models via Difficulty-Adaptive Reinforcement Learning with IQA Reward
arxiv.org·1d
📈Optimization
Flag this post
Equilibrium Policy Generalization: A Reinforcement Learning Framework for Cross-Graph Zero-Shot Generalization in Pursuit-Evasion Games
arxiv.org·1d
📈Optimization
Flag this post
Quadratic Direct Forecast for Training Multi-Step Time-Series Forecast Models
arxiv.org·1d
🔵Eigenvalues
Flag this post
The Next Frontier in NLP: Smarter Agents, Not Just Bigger Models
pub.towardsai.net·1h
λFunctional Programming
Flag this post
The Collaboration Gap
arxiv.org·1h
⏱️Computational Complexity
Flag this post
For Synthetic Situations
lesswrong.com·1d
λFunctional Programming
Flag this post
[D] PhD New Grad Role OA
⏱️Computational Complexity
Flag this post
Hyper Hawkes Processes: Interpretable Models of Marked Temporal Point Processes
arxiv.org·1d
⏱️Computational Complexity
Flag this post
The Math That Makes Zero-Shot Learning Work: A Proof Using Only Addition
pub.towardsai.net·1d
🔢mathmemathics
Flag this post
MISA: Memory-Efficient LLMs Optimization with Module-wise Importance Sampling
arxiv.org·1d
📈Optimization
Flag this post
PAINT25 Invited Talk transcript: “Notational Freedom via Self-Raising Diagrams”
programmingmadecomplicated.wordpress.com·17h
🔬Static Analysis
Flag this post
Geometric Data Valuation via Leverage Scores
arxiv.org·1h
🔢mathmemathics
Flag this post
Branched Signature Model
arxiv.org·1d
🌳Red-Black Trees
Flag this post
Loading...Loading more...