Transcribing Bengali Text with Regional Dialects to IPA using District Guided Tokens (opens in new tab)
arXiv:2403.17407v4 Announce Type: replace-cross Abstract: Accurate transcription of Bengali text to the International Phonetic Alphabet (IPA) is a challenging task due to the complex phonology of the language and context-dependent sound changes. This challenge is even more for regional Bengali dialects due to unavailability of standardized spelling conventions for these dialects, presence of local and foreign words popular in those regions and phonological diversity across different regions. ...
Read the original article