Regarding the impact of Kazakh phonetic transcription on the performance of automatic speech recognition systems

dc.contributor.authorKarabalayeva, Muslima
dc.contributor.authorYessenbayev, Zhandos
dc.contributor.authorKozhirbayev, Zhanibek
dc.date.accessioned2018-05-28T09:22:59Z
dc.date.available2018-05-28T09:22:59Z
dc.date.issued2017-10-21
dc.description.abstractOver the past decades automatic speech recognition has made remarkable advances, in both theoretical and practical aspects. Evolution of research in this field has been proceeding from the recognition of individual sounds and phonemes to the recognition of continuous and mixed speech, including tasks of automatic transcription of broadcast news and telephone conversations. Despite the high performance of continuous speech recognition systems, which makes up to 95%, the performance of phoneme recognition systems remains below 85%. However, phoneme recognition is widely used in a number of applications, such as spoken term detection, language identification, speaker identification and others. The paper presents the results of the experiments on continuous Kazakh speech recognition using different phoneme sets and alternative phonetic transcriptions. This study was instigated by the fact that in modern Kazakh linguistics there is no common agreement about the phonetic system of the Kazakh language, while the list of phonemes and their number noticeably vary in different textbooks. Therefore, we aimed our experiments to study the impact of the phonetic system of the language, its orthoepic rules and the corresponding phonetic transcriptions on the performance of the phoneme recognition systems, which are the initial stage in the general systems of continuous speech recognition. The following 6 systems of phonetic transcription have been considered and tested in our study. The fi rst one is a project of the new Kazakh alphabet and a set of spelling rules proposed by Prof. A. Sharipbay. The second system is a set of orthoepic rules for the actual Kazakh Cyrillic alphabet, introduced by Kazakh linguists – the authors of the Kazakh “Orthoepical Dictionary”. The third one of the systems considered is a phonetic system and a set of empirical transcription rules used by the authors of this work in their studies...en_US
dc.identifier.urihttp://nur.nu.edu.kz/handle/123456789/3204
dc.language.isoenen_US
dc.publisherNazarbayev University, National Laboratory Astana.en_US
dc.rightsAttribution-NonCommercial-ShareAlike 3.0 United States*
dc.rights.urihttp://creativecommons.org/licenses/by-nc-sa/3.0/us/*
dc.subjectautomatic speech recognitionen_US
dc.subjectphoneme recognitionen_US
dc.subjectphonetic transcriptionen_US
dc.subjectKazakh speechen_US
dc.titleRegarding the impact of Kazakh phonetic transcription on the performance of automatic speech recognition systemsen_US
dc.typeArticleen_US
workflow.import.sourcescience

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
content.pdf
Size:
4.62 MB
Format:
Adobe Portable Document Format
Description:

Collections