EVOM: Agentic Meta-Evolution of Actor-Critic Architectures for Reinforcement Learning (opens in new tab)

In actor-critic reinforcement learning, network architectures are typically manually designed. Automating this design is challenging because each candidate must be trained before evaluation, and the design space is open-ended. To address these challenges, we introduce EVOM, an agentic meta-evolution framework for discovering high-performance actor-critic architectures. We frame architecture search as a bi-level optimization: an inner loop trai...

Read the original article