DSpace Repository

Spoken term detection for Kazakh language

Show simple item record

dc.contributor.author Kozhirbayev, Zhanibek
dc.contributor.author Karabalayeva, Muslima
dc.contributor.author Yessenbayev, Zhandos
dc.date.accessioned 2018-05-29T05:13:14Z
dc.date.available 2018-05-29T05:13:14Z
dc.date.issued 2016-08-21
dc.identifier.uri http://nur.nu.edu.kz/handle/123456789/3211
dc.description.abstract The paper presents a spoken term detection system for Kazakh language in which significant improvements are obtained through modifying speech-to-text process used for generating word- based lattices. These lattices are indexed and used for the keyword search later. Spoken Term Detection systems quickly discover the occurrence of a term, which might be just a word or sequence of words, in a large audio set of heterogeneous speech records. The paper provides an overview of a speech-to-text and keyword search system architecture built primarily on the top of the Kaldi toolkit and expands on a few highlights. Our aim was to develop a general system pipeline which could be advanced regarding phonological and linguistic features of Kazakh language in order to detect OOV keywords. en_US
dc.language.iso en en_US
dc.publisher The 4-th International Conference on Computer Processing of Turkic Languages en_US
dc.rights Attribution-NonCommercial-NoDerivs 3.0 United States *
dc.rights.uri http://creativecommons.org/licenses/by-nc-nd/3.0/us/ *
dc.subject Speech Retrieval, Lattice Indexing, Spoken Term Detection, Speech Recognition, Keyword Search en_US
dc.title Spoken term detection for Kazakh language en_US
dc.title.alternative ПОИСК РАЗГОВОРНОГО ТЕРМИНА НА КАЗАХСКОМ ЯЗЫКЕ en_US
dc.type Conference Paper en_US
workflow.import.source science


Files in this item

The following license files are associated with this item:

This item appears in the following Collection(s)

Show simple item record

Attribution-NonCommercial-NoDerivs 3.0 United States Except where otherwise noted, this item's license is described as Attribution-NonCommercial-NoDerivs 3.0 United States