Makhambetov, O.Makazhanov, A.Yessenbayev, Z.Matkarimov, B.Sabyrgaliyev, I.Sharafudinov, A.2015-11-042015-11-042013http://nur.nu.edu.kz/handle/123456789/748A language corpus is a collection of texts written in that language and classified by genres. Corpora are actively used by researchers from different fields (most notably linguists and computer scientists) and by industry (Google, Yandex, etc.)enfirst research weekinternet data accumulationInternet data accumulation and processing complexAbstract