Stream Processing, Low Latency, Event Processing, Online Learning
Audio-Thinker: Guiding Audio Language Model When and How to Think via Reinforcement Learning
arxiv.org·5d
How Does a Virtual Agent Decide Where to Look? - Symbolic Cognitive Reasoning for Embodied Head Rotation
arxiv.org·4d
A Markov Decision Process Framework for Early Maneuver Decisions in Satellite Collision Avoidance
arxiv.org·6d
Multi-head Transformers Provably Learn Symbolic Multi-step Reasoning via Gradient Descent
arxiv.org·5d
Robust Reinforcement Learning over Wireless Networks with Homomorphic State Representations
arxiv.org·5d
Loading...Loading more...