Speak, Edit, Repeat: High-Fidelity Voice Editing and Zero-Shot TTS with Cross-Attentive Mamba
arxiv.org·3d
Towards a Typology of Strange LLM Chains-of-Thought
lesswrong.com·1d
Making Machines Sound Sarcastic: LLM-Enhanced and Retrieval-Guided Sarcastic Speech Synthesis
arxiv.org·1d
The key to conversational speech recognition
datasciencecentral.com·1d
MuFFIN: Multifaceted Pronunciation Feedback Model with Interactive Hierarchical Neural Modeling
arxiv.org·3d
Everyday AI Agents
oreilly.com·10h
Loading...Loading more...