Neural TTS, Voice Cloning, Real-time Audio, Kitten TTS
The Ultra Scale Playbook vol-2: Data Parallelism
jaisidhsingh.bearblog.dev·5h
MOON: Generative MLLM-based Multimodal Representation Learning for E-commerce Product Understanding
arxiv.org·1d
Sycophancy under Pressure: Evaluating and Mitigating Sycophantic Bias via Adversarial Dialogues in Scientific QA
arxiv.org·23h
CardAIc-Agents: A Multimodal Framework with Hierarchical Adaptation for Cardiac Care Support
arxiv.org·23h
Loading...Loading more...