A free/open-source hybrid morphological disambiguation tool for Kazakh

Assylbekov, Zhenisbek; Washington, Jonathan; Tyers, Francis; Nurkas, Assulan; Sundetova, Aida; Karibayeva, Aidana; Abduali, Balzhan; Amirova, Dina

A free/open-source hybrid morphological disambiguation tool for Kazakh

Files

kaz-tagger.pdf (1.03 MB)

Date

2016-04

Authors

Assylbekov, Zhenisbek

Washington, Jonathan

Tyers, Francis

Nurkas, Assulan

Sundetova, Aida

Karibayeva, Aidana

Abduali, Balzhan

Amirova, Dina

Publisher

DOI: 10.13140/RG.2.2.12467.43045

Abstract

This paper presents the results of developing a morphological disambiguation tool for Kazakh. Starting with a previously developed rule-based approach, we tried to cope with the complex morphology of Kazakh by breaking up lexical forms across their derivational boundaries into inflectional groups and modeling their behavior with statistical methods. A hybrid rule-based/statistical approach appears to benefit morphological disambiguation demonstrating a per-token accuracy of 91% in running text.

Keywords

open-source, morphological disambiguation, hybrid morphological disambiguation tool, tool for Kazakh, Research Subject Categories::MATHEMATICS

Citation

Assylbekov, Zhenisbek; North, Jonathan; Tyers, Francis; Nurkas, Assulan; Sundetova, Aida; Karibayeva, Aidana; Abduali, Balzhan; Amirova, Dina. (2016). A free/open-source hybrid morphological disambiguation tool for Kazakh.; Conference Paper.

URI

http://nur.nu.edu.kz/handle/123456789/1692

Collections

Conference papers

Full item page

A free/open-source hybrid morphological disambiguation tool for Kazakh

Files

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

Keywords

Citation

URI

Collections