BENIGN OVERFITTING WITH RETRIEVAL AUGMENTED MODELS
dc.contributor.author | Assylbekov, Zhenisbek | |
dc.contributor.author | Tezekbayev, Maxat | |
dc.contributor.author | Nikoulina, Vassilina | |
dc.contributor.author | Gallé, Matthias | |
dc.date.accessioned | 2022-10-11T08:54:20Z | |
dc.date.available | 2022-10-11T08:54:20Z | |
dc.date.issued | 2022 | |
dc.description.abstract | Despite the fact that modern deep neural networks have the ability to memorize (almost) the entire training set they generalize well to unseen data, contradicting traditional learning theory. This phenomenon --- dubbed benign overfitting --- has been theoretically studied so far in simplified settings only. At the same time, ML practitioners (especially in NLP) figured out how to exploit this feature for more efficient training: retrieval-augmented models (e.g., kNN-LM, RETRO) explicitly store (part of) the training sample in the storage and thus try to (partially) remove a load of memorization from the neural network. In this paper we link these apparently separate threads of research, and propose several possible research directions regarding benign overfitting in retrieval-augmented models. | en_US |
dc.identifier.uri | http://nur.nu.edu.kz/handle/123456789/6735 | |
dc.language.iso | en_US | en_US |
dc.publisher | Nazarbayev University School of Sciences and Humanities | |
dc.rights | CC0 1.0 Universal | * |
dc.rights.uri | http://creativecommons.org/publicdomain/zero/1.0/ | * |
dc.subject | Machine Learning, Retrieval Augmentation, Benign Overfitting, Long Tail Theory, Simplicity Bias, Natural Language Processing, Memorization and Generalization, Learning Theory | en_US |
dc.subject | Type of access: Open Access | |
dc.title | BENIGN OVERFITTING WITH RETRIEVAL AUGMENTED MODELS | en_US |
dc.type | Working Paper | en_US |
workflow.import.source | science |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- Benign_Overfitting_with_Retrieval_Augmented_Models.pdf
- Size:
- 489.34 KB
- Format:
- Adobe Portable Document Format
- Description:
License bundle
1 - 1 of 1
No Thumbnail Available
- Name:
- license.txt
- Size:
- 6.28 KB
- Format:
- Item-specific license agreed upon to submission
- Description: