Abstract:
A language corpus is a collection of texts written in that language and classified by genres. Corpora are actively used by researchers from different fields (most notably linguists and computer scientists) and by industry (Google, Yandex, etc.)