Long-Tail Theory Under Gaussian Mixtures
| dc.contributor.author | Zhenisbek Assylbekov | |
| dc.contributor.author | Vassilina Nikoulina | |
| dc.contributor.author | Artur Pak | |
| dc.contributor.author | Igor Melnykov | |
| dc.contributor.author | Maxat Tezekbayev | |
| dc.contributor.author | Arman Bolatov | |
| dc.date.accessioned | 2025 | |
| dc.date.issued | 2023 | |
| dc.description.abstract | We suggest a simple Gaussian mixture model for data generation that complies with Feldman’s long tail theory (2020). We demonstrate that a linear classifier cannot decrease the generalization error below a certain level in the proposed model, whereas a nonlinear classifier with a memorization capacity can. This confirms that for long-tailed distributions, rare training examples must be considered for optimal generalization to new data. Finally, we show that the performance gap between linear and nonlinear models can be lessened as the tail becomes shorter in the subpopulation frequency distribution, as confirmed by experiments on synthetic and real data. | |
| dc.identifier.citation | Arman Bolatov, Maxat Tezekbayev, Igor Melnykov, Artur Pak, Vassilina Nikoulina, & Zhenisbek Assylbekov (2023). Long-Tail Theory Under Gaussian Mixtures. . https://doi.org/10.3233/FAIA230260 | |
| dc.identifier.doi | 10.3233/FAIA230260 | |
| dc.identifier.uri | https://doi.org/10.3233/FAIA230260 | |
| dc.identifier.uri | https://nur.nu.edu.kz/handle/123456789/17460 | |
| dc.language | en | |
| dc.publisher | Frontiers in Artificial Intelligence and Applications | |
| dc.rights | All rights reserved | |
| dc.source | Frontiers in Artificial Intelligence and Applications | |
| dc.subject | Unsupervised learning | |
| dc.subject | Quantum mechanics | |
| dc.subject | Mathematical analysis | |
| dc.subject | Physics | |
| dc.subject | Pattern recognition (psychology) | |
| dc.subject | Statistical physics | |
| dc.subject | Applied mathematics | |
| dc.subject | Algorithm | |
| dc.subject | Computer science | |
| dc.subject | Training set | |
| dc.subject | Mathematics | |
| dc.subject | Artificial intelligence | |
| dc.subject | Generalization error | |
| dc.subject | Nonlinear system | |
| dc.subject | Generalization | |
| dc.subject | Classifier (UML) | |
| dc.subject | Gaussian | |
| dc.title | Long-Tail Theory Under Gaussian Mixtures | |
| dc.type | Article |