Meta-Reinforcement Learning via Evolution for Multi-Objective Combinatorial Supply Chain Optimisation (opens in new tab)
Meta-reinforcement learning is a promising approach to multi-objective optimisation because it enables rapid policy adaptation across changing environments and preference settings. However, conventional few-shot methods usually fine-tune from a single shared meta-policy, which can reduce solution diversity and limit exploration of the Pareto front, especially in high-dimensional combinatorial problems such as supply chain optimisation. We propos...
Read the original article