Multilingual Text-to-Speech Synthesis for Turkic Languages Using Transliteration

dc.contributor.authorRustem Yeshpanov
dc.contributor.authorSaida Mussakhojayeva
dc.contributor.authorYerbolat Khassanov
dc.date.accessioned2025-08-22T10:14:40Z
dc.date.available2025-08-22T10:14:40Z
dc.date.issued2023-08-14
dc.description.abstractThis work aims to build a multilingual text-to-speech (TTS) synthesis system for ten lower-resourced Turkic languages: Azerbaijani, Bashkir, Kazakh, Kyrgyz, Sakha, Tatar, Turkish, Turkmen, Uyghur, and Uzbek. We specifically target the zero-shot learning scenario, where a TTS model trained using the data of one language is applied to synthesise speech for other, unseen languages. An end-to-end TTS system based on the Tacotron 2 architecture was trained using only the available data of the Kazakh language. To generate speech for the other Turkic languages, we first mapped the letters of the Turkic alphabets onto the symbols of the International Phonetic Alphabet (IPA), which were then converted to the Kazakh alphabet letters. To demon strate the feasibility of the proposed approach, we evaluated the multilingual Turkic TTS model subjectively and obtained promising results. To enable replication of the experiments, we make our code and dataset publicly available in our GitHub repository.
dc.identifier.citationYeshpanov Rustem, Mussakhojayeva Saida, Khassanov Yerbolat. (2023). Multilingual Text-to-Speech Synthesis for Turkic Languages Using Transliteration. INTERSPEECH 2023. https://doi.org/https://doi.org/10.21437/interspeech.2023-249en
dc.identifier.doi10.21437/interspeech.2023-249
dc.identifier.urihttps://doi.org/10.21437/interspeech.2023-249
dc.identifier.urihttps://nur.nu.edu.kz/handle/123456789/9886
dc.language.isoen
dc.publisherISCA
dc.relation.ispartofINTERSPEECH 2023en
dc.sourceINTERSPEECH 2023, (2023)en
dc.subjectTransliterationen
dc.subjectComputer scienceen
dc.subjectNatural language processingen
dc.subjectSpeech synthesisen
dc.subjectArtificial intelligenceen
dc.subjectSpeech recognitionen
dc.subjecttype of access: open accessen
dc.titleMultilingual Text-to-Speech Synthesis for Turkic Languages Using Transliterationen
dc.typearticleen

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Multilingual_Text-to-Speech_Synthesis_for_Turkic_Languages_Using_Transliteration__3d685be2.pdf
Size:
300.18 KB
Format:
Adobe Portable Document Format

Collections