Learning General Policies with Policy Gradient Methods
arxiv.org·4d
🎯Reinforcement Learning
Preview
Report Post

Title:Learning General Policies with Policy Gradient Methods

View PDF HTML (experimental)

Abstract:While reinforcement learning methods have delivered remarkable results in a number of settings, generalization, i.e., the ability to produce policies that generalize in a reliable and systematic way, has remained a challenge. The problem of generalization has been addressed formally in classical planning where provable correct policies that generalize over all instances of a given domain have been learned using combinatorial methods. The aim of this work is to bring these two research threads together to illuminate the conditions under which (deep) reinforcement learning approaches, and in particular, po…

Similar Posts

Loading similar posts...