Prefrontal inhibitory mechanisms associated with Putamen activity during valence learning revealed by multimodal fMRI-fMRS
nature.com·4d
🎮Reinforcement Learning
Flag this post
Magentic Marketplace: an open-source simulation environment for studying agentic markets
microsoft.com·1d
🧊Iceberg Tables
Flag this post
On the Fundamental Limitations of Decentralized Learnable Reward Shaping in Cooperative Multi-Agent Reinforcement Learning
arxiv.org·3d
🎮Reinforcement Learning
Flag this post
**Adaptive Algorithmic Profiling & Resource Allocation via Dynamic Markov Chain Optimization**
⚡SIMD Optimization
Flag this post
Fair and Explainable Credit-Scoring under Concept Drift: Adaptive Explanation Frameworks for Evolving Populations
arxiv.org·7h
🎮Reinforcement Learning
Flag this post
Learning Without Critics? Revisiting GRPO in Classical Reinforcement Learning Environments
arxiv.org·1d
🎮Reinforcement Learning
Flag this post
Deep Koopman Economic Model Predictive Control of a Pasteurisation Unit
arxiv.org·7h
🤖AI
Flag this post
Multi-Sensor Distributed Hypothesis Testing in the Low-Power Regime
arxiv.org·3d
📊Information Theory
Flag this post
SSPO: Subsentence-level Policy Optimization
arxiv.org·7h
🎮Reinforcement Learning
Flag this post
I Want to Break Free! Persuasion and Anti-Social Behavior of LLMs in Multi-Agent Settings with Social Hierarchy
arxiv.org·2d
🎮Reinforcement Learning
Flag this post
Neural Green's Functions
arxiv.org·2d
🎮Reinforcement Learning
Flag this post
Design-Based Supply Chain Operations Research Model: Fostering Resilience And Sustainability In Modern Supply Chains
arxiv.org·2d
🔧Data Engineering
Flag this post
Novelty and Impact of Economics Papers
arxiv.org·3d
🔬Academic Search
Flag this post
Online Energy Storage Arbitrage under Imperfect Predictions: A Conformal Risk-Aware Approach
arxiv.org·3d
🎮Reinforcement Learning
Flag this post
Deep Value Benchmark: Measuring Whether Models Generalize Deep values or Shallow Preferences
arxiv.org·2d
🎮Reinforcement Learning
Flag this post
Reevaluating Self-Consistency Scaling in Multi-Agent Systems
arxiv.org·3d
🎮Reinforcement Learning
Flag this post
DR. WELL: Dynamic Reasoning and Learning with Symbolic World Model for Embodied LLM-Based Multi-Agent Collaboration
arxiv.org·7h
🎮Reinforcement Learning
Flag this post
Dialogue as Discovery: Navigating Human Intent Through Principled Inquiry
arxiv.org·4d
🎮Reinforcement Learning
Flag this post
Loading...Loading more...