Inverse Reinforcement Learning Meets Large Language Model Post-Training: Basics, Advances, and Opportunities
arxiv.org·2d
RemVerse: Supporting Reminiscence Activities for Older Adults through AI-Assisted Virtual Reality
arxiv.org·2d
Partial decidability protocol for the Wang tiling problem from statistical mechanics and chaotic mapping
arxiv.org·2d
Neural Co-state Regulator: A Data-Driven Paradigm for Real-time Optimal Control with Input Constraints
arxiv.org·3d
The Generalist Brain Module: Module Repetition in Neural Networks in Light of the Minicolumn Hypothesis
arxiv.org·2d
Xiangqi-R1: Enhancing Spatial Strategic Reasoning in LLMs for Chinese Chess via Reinforcement Learning
arxiv.org·3d
Monotone weak distributive laws over the lifted powerset monad in categories of algebras
arxiv.org·2d
TrialCompass: Visual Analytics for Enhancing the Eligibility Criteria Design of Clinical Trials
arxiv.org·3d
Unveiling the Visual Rhetoric of Persuasive Cartography: A Case Study of the Design of Octopus Maps
arxiv.org·3d
Loading...Loading more...