Speech Recognition, Music Information Retrieval, Acoustic Modeling, Sound Classification
Make Silence Speak for Itself: a multi-modal learning analytic approach with neurophysiological data
arxiv.org·23h
Chain-of-Cooking:Cooking Process Visualization via Bidirectional Chain-of-Thought Guidance
arxiv.org·23h
STARN-GAT: A Multi-Modal Spatio-Temporal Graph Attention Network for Accident Severity Prediction
arxiv.org·1d
State evolution beyond first-order methods I: Rigorous predictions and finite-sample guarantees
arxiv.org·1d
DRL-AdaPart: DRL-Driven Adaptive STAR-RIS Partitioning for Fair and Frugal Resource Utilization
arxiv.org·1d
Loading...Loading more...