Actor Model, Fault Tolerance, Hot Swapping, Telecommunications
MEMTRACK: Evaluating Long-Term Memory and State Tracking in Multi-Platform Dynamic Agent Environments
arxiv.org·4d
Token Hidden Reward: Steering Exploration-Exploitation in Group Relative Deep Reinforcement Learning
arxiv.org·9h
Say One Thing, Do Another? Diagnosing Reasoning-Execution Gaps in VLM-Powered Mobile-Use Agents
arxiv.org·4d
Social Agent: Mastering Dyadic Nonverbal Behavior Generation via Conversational LLM Agents
arxiv.org·9h
Loading...Loading more...