RLVMR: Reinforcement Learning with Verifiable Meta-Reasoning Rewards for Robust Long-Horizon Agents
arxiv.orgยท6d
SAMPO: Visual Preference Optimization for Intent-Aware Segmentation with Vision Foundation Models
arxiv.orgยท1d
Estimation of Hemodynamic Parameters via Physics Informed Neural Networks including Hematocrit Dependent Rheology
arxiv.orgยท17h
Improving Q-Learning for Real-World Control: A Case Study in Series Hybrid Agricultural Tractors
arxiv.orgยท17h
Spec-VLA: Speculative Decoding for Vision-Language-Action Models with Relaxed Acceptance
arxiv.orgยท6d
HyCodePolicy: Hybrid Language Controllers for Multimodal Monitoring and Decision in Embodied Agents
arxiv.orgยท1d
A Comparative Study of Optimal Control and Neural Networks in Asteroid Rendezvous Mission Analysis
arxiv.orgยท17h
MARS: A Meta-Adaptive Reinforcement Learning Framework for Risk-Aware Multi-Agent Portfolio Management
arxiv.orgยท1d
SigBERT: Combining Narrative Medical Reports and Rough Path Signature Theory for Survival Risk Estimation in Oncology
arxiv.orgยท5d
Loading...Loading more...