Meta-Reinforcement Learning via Evolution for Multi-Objective Combinatorial Supply Chain Optimisation (opens in new tab)

Meta-reinforcement learning is a promising approach to multi-objective optimisation because it enables rapid policy adaptation across changing environments and preference settings. However, conventional few-shot methods usually fine-tune from a single shared meta-policy, which can reduce solution diversity and limit exploration of the Pareto front, especially in high-dimensional combinatorial problems such as supply chain optimisation. We propos...

Read the original article