Distinguishing Target and Non-Target Fixations with EEG and Eye Tracking in Realistic Visual Scenes
arxiv.orgยท1h
RLVMR: Reinforcement Learning with Verifiable Meta-Reasoning Rewards for Robust Long-Horizon Agents
arxiv.orgยท5d
SAMPO: Visual Preference Optimization for Intent-Aware Segmentation with Vision Foundation Models
arxiv.orgยท1h
MemTool: Optimizing Short-Term Memory Management for Dynamic Tool Calling in LLM Agent Multi-Turn Conversations
arxiv.orgยท6d
Accessibility and Social Inclusivity: A Literature Review of Music Technology for Blind and Low Vision People
arxiv.orgยท1h
Spec-VLA: Speculative Decoding for Vision-Language-Action Models with Relaxed Acceptance
arxiv.orgยท5d
HyCodePolicy: Hybrid Language Controllers for Multimodal Monitoring and Decision in Embodied Agents
arxiv.orgยท1h
Loading...Loading more...