DSpace Repository

THE REDISCOVERY HYPOTHESIS: LANGUAGE MODELS NEED TO MEET LINGUISTICS

Show simple item record

dc.contributor.author Maxat, Tezekbayev
dc.date.accessioned 2022-05-11T07:56:19Z
dc.date.available 2022-05-11T07:56:19Z
dc.date.issued 2022-05
dc.identifier.citation Tezekbayev Maxat (2022). The Rediscovery Hypothesis: Language Models Need to Meet Linguistics. Nazarbayev University, Nur-sultan, Kazakhstan en_US
dc.identifier.uri http://nur.nu.edu.kz/handle/123456789/6141
dc.description.abstract There is an ongoing debate in the NLP community whether modern language models contain linguistic knowledge, recovered through so-called probes. This work examines whether linguistic knowledge is a necessary condition for the good performance of modern language models, which we call the rediscovery hypothesis. In the first place, we show that language models that are significantly compressed but perform well on their pretraining objectives retain good scores when probed for linguistic structures. This result supports the rediscovery hypothesis and leads to an information-theoretic framework that relates language modeling objectives with linguistic information. This framework also provides a metric to measure the impact of linguistic information on the word prediction task. We reinforce our analytical results with various experiments, both on synthetic and on real NLP tasks in English. en_US
dc.language.iso en en_US
dc.publisher Nazarbayev University School of Sciences and Humanities en_US
dc.rights Attribution-NonCommercial-ShareAlike 3.0 United States *
dc.rights.uri http://creativecommons.org/licenses/by-nc-sa/3.0/us/ *
dc.subject Type of access: Open Access en_US
dc.subject Linguistics en_US
dc.subject Language Models en_US
dc.title THE REDISCOVERY HYPOTHESIS: LANGUAGE MODELS NEED TO MEET LINGUISTICS en_US
dc.type Master's thesis en_US
workflow.import.source science


Files in this item

The following license files are associated with this item:

This item appears in the following Collection(s)

Show simple item record

Attribution-NonCommercial-ShareAlike 3.0 United States Except where otherwise noted, this item's license is described as Attribution-NonCommercial-ShareAlike 3.0 United States

Video Guide

Submission guideSubmission guide

Submit your materials for publication to

NU Repository Drive

Browse

My Account

Statistics