DiffCap: Diffusion-based Real-time Human Motion Capture using Sparse IMUs and a Monocular Camera
arxiv.orgยท2d
Multimodal AI Systems for Enhanced Laying Hen Welfare Assessment and Productivity Optimization
arxiv.orgยท1d
Enhancing Vision-Language Model Training with Reinforcement Learning in Synthetic Worlds for Real-World Success
arxiv.orgยท6d
Topos Causal Models
arxiv.orgยท14h
Touch Speaks, Sound Feels: A Multimodal Approach to Affective and Social Touch from Robots to Humans
arxiv.orgยท1d
"Pull or Not to Pull?'': Investigating Moral Biases in Leading Large Language Models Across Ethical Dilemmas
arxiv.orgยท1d
MeteorPred: A Meteorological Multimodal Large Model and Dataset for Severe Weather Event Prediction
arxiv.orgยท1d
Boosting Visual Knowledge-Intensive Training for LVLMs Through Causality-Driven Visual Object Completion
arxiv.orgยท6d
Epidemic Control on a Large-Scale-Agent-Based Epidemiology Model using Deep Deterministic Policy Gradient
arxiv.orgยท2d
SLIP: Soft Label Mechanism and Key-Extraction-Guided CoT-based Defense Against Instruction Backdoor in APIs
arxiv.orgยท2d
Loading...Loading more...