Towards Robust Evaluation of Visual Activity Recognition: Resolving Verb Ambiguity with Sense Clustering
arxiv.org·3d
Heterogeneous optimized Schwarz Methods for heat conduction in composites with thermal contact resistance
arxiv.org·16h
Modeling Annotator Disagreement with Demographic-Aware Experts and Synthetic Perspectives
arxiv.org·5d
Hearing More with Less: Multi-Modal Retrieval-and-Selection Augmented Conversational LLM-Based ASR
arxiv.org·6d
LECTOR: LLM-Enhanced Concept-based Test-Oriented Repetition for Adaptive Spaced Learning
arxiv.org·5d
Towards MR-Based Trochleoplasty Planning
arxiv.org·16h
Thinking With Videos: Multimodal Tool-Augmented Reinforcement Learning for Long Video Reasoning
arxiv.org·4d
Boosting Visual Knowledge-Intensive Training for LVLMs Through Causality-Driven Visual Object Completion
arxiv.org·4d
Loading...Loading more...