🎮 Reinforcement Learning - jyunzhang · Scour

Learning to replenish: A hybrid deep reinforcement learning for dynamic inventory management in the pharmaceutical supply chains

🤖Machine Learning Academic

Exact Unlearning in Reinforcement Learning

🤖LLMs Academic

Fog of Love: Engineering Virtuous Agent Behavior with Affinity-based Reinforcement Learning in a Game Environment

📈Optimization Academic

Drag reduction or reward hacking? Recurrent multi-agent reinforcement learning that earns its reward

🔲Cellular Automata Academic

Selective-Advantage Entropy-Adaptive Horizon GRPO: Asymmetric Token-Level Discounting for Efficient Reinforcement Learning of Language Models

📈Optimization Academic

Smart Transportation Without Neurons -- Fair Metro Network Expansion with Tabular Reinforcement Learning

🎭Anthropic Claude Academic

Explainably Safe Reinforcement Learning

💬Prompt Engineering Academic

From Ticks to Flows: Dynamics of Neural Reinforcement Learning in Continuous Environments

🧠Deep Learning Academic

GARL: Game-Theoretic Reinforcement Learning for Multi-Agent Strategic Prioritisation

⚙️Concurrency Models Academic

Self-Optimizing Control of Continuous Processes Based on Reinforcement Learning

📈Optimization Academic

Merging model-based control with multi-agent reinforcement learning for multi-agent cooperative teaming strategies

🤖AI Academic

RUBAS: Rubric-Based Reinforcement Learning for Agent Safety

🔐Cryptography Academic

Reinforcement Learning from Rich Feedback with Distributional DAgger

📈Optimization Academic

OrderGrad: Optimizing Beyond the Mean with Order-Statistic Policy Gradient Estimation

📈Optimization Academic

Agentic Monte Carlo: Simulating Reinforcement Learning for Black-Box Agents

🔲Cellular Automata Academic

AgentJet: A Flexible Swarm Training Framework for Agentic Reinforcement Learning

💬Prompt Engineering Academic

Log in to enable infinite scrolling