🎮 Reinforcement Learning - ashiqabdulkhader · Scour

Preserving Disagreement: Architectural Heterogeneity and Coherence Validation in Multi-Agent Policy Simulation 🧠AI Agents

SpecRLBench: A Benchmark for Generalization in Specification-Guided Reinforcement Learning 🧠LLMs

Lifting Embodied World Models for Planning and Control 🧠AI Agents

From Coarse to Fine: Self-Adaptive Hierarchical Planning for LLM Agents 🧠AI Agents

Split over $n$ resource sharing problem: Are fewer capable agents better than many simpler ones? 🕸️Distributed Systems

CoFi-PGMA: Counterfactual Policy Gradients under Filtered Feedback for Multi-Agent LLMs 🧠LLMs

reward-lens: A Mechanistic Interpretability Library for Reward Models 🧠LLMs

Frictive Policy Optimization for LLMs: Epistemic Intervention, Risk-Sensitive Control, and Reflective Alignment 🧠LLMs

NeuroPlastic: A Plasticity-Modulated Optimizer for Biologically Inspired Learning Dynamics 🧠LLMs

Perfecting Aircraft Maneuvers with Reinforcement Learning 🚗Autonomous Systems

Co-Learning Port-Hamiltonian Systems and Optimal Energy-Shaping Control 🚗Autonomous Systems

TCOD: Exploring Temporal Curriculum in On-Policy Distillation for Multi-turn Autonomous Agents 🧠LLMs

3D Generation for Embodied AI and Robotic Simulation: A Survey 🤖Robotics

SOLAR-RL: Semi-Online Long-horizon Assignment Reinforcement Learning 🧠AI Agents

BitRL: Reinforcement Learning with 1-bit Quantized Language Models for Resource-Constrained Edge Deployment ⚙️MLOps

CODA: Coordination via On-Policy Diffusion for Multi-Agent Offline Reinforcement Learning 🕸️Distributed Systems

AEL: Agent Evolving Learning for Open-Ended Environments 🧠AI Agents

CAPSULE: Control-Theoretic Action Perturbations for Safe Uncertainty-Aware Reinforcement Learning 🚗Autonomous Systems

Reinforcement Learning with Foundation Priors: Let the Embodied Agent Efficiently Learn on Its Own 🧠AI Agents

Agent-Centric Visual Reinforcement Learning under Dynamic Perturbations 🧠AI Agents

Sign up or log in to see more results

Log in to enable infinite scrolling