A-LAMP: Agentic LLM-Based Framework for Automated MDP Modeling and Policy Generation
arxiv.org·1d
🧮Prolog Parsing
Preview
Report Post

View PDF

Abstract:Applying reinforcement learning (RL) to real-world tasks requires converting informal descriptions into a formal Markov decision process (MDP), implementing an executable environment, and training a policy agent. Automating this process is challenging due to modeling errors, fragile code, and misaligned objectives, which often impede policy training. We introduce an agentic large language model (LLM)-based framework for automated MDP modeling and policy generation (A-LAMP), that automatically translates free-form natural language task descriptions into an MDP formulation and trained policy. The framework decomposes modeling, coding, and training into verifiable stages, ensuring semantic alignment throughout the pipeline. …

Similar Posts

Loading similar posts...