Speech Recognition, Music Information Retrieval, Acoustic Modeling, Sound Classification
Enhancing Few-shot Keyword Spotting Performance through Pre-Trained Self-supervised Speech Models
arxiv.orgยท1d
Expanding Relevance Judgments for Medical Case-based Retrieval Task with Multimodal LLMs
arxiv.orgยท1d
RecLLM-R1: A Two-Stage Training Paradigm with Reinforcement Learning and Chain-of-Thought v1
arxiv.orgยท20h
Episode-specific Fine-tuning for Metric-based Few-shot Learners with Optimization-based Training
arxiv.orgยท1d
ECG-SMART-NET: A Deep Learning Architecture for Precise ECG Diagnosis of Occlusion Myocardial Infarction
arxiv.orgยท20h
GLIMPSE: Gradient-Layer Importance Mapping for Prompted Visual Saliency Explanation for Generative LVLMs
arxiv.orgยท20h
Breaking the Transcription Bottleneck: Fine-tuning ASR Models for Extremely Low-Resource Fieldwork Languages
arxiv.orgยท1d
Safe Pruning LoRA: Robust Distance-Guided Pruning for Safety Alignment in Adaptation of LLMs
arxiv.orgยท20h
SUTRA: Decoupling Concept & Language for Multilingual LLM Excellence
hackernoon.comยท8h
Loading...Loading more...