Speak, Edit, Repeat: High-Fidelity Voice Editing and Zero-Shot TTS with Cross-Attentive Mamba
arxiv.org·4d
Making Machines Sound Sarcastic: LLM-Enhanced and Retrieval-Guided Sarcastic Speech Synthesis
arxiv.org·2d
Towards a Typology of Strange LLM Chains-of-Thought
lesswrong.com·1d
The key to conversational speech recognition
datasciencecentral.com·1d
MuFFIN: Multifaceted Pronunciation Feedback Model with Interactive Hierarchical Neural Modeling
arxiv.org·4d
Loading...Loading more...