Long-Tail Theory Under Gaussian Mixtures

Zhenisbek Assylbekov; Vassilina Nikoulina; Artur Pak; Igor Melnykov; Maxat Tezekbayev; Arman Bolatov

doi:10.3233/FAIA230260

Long-Tail Theory Under Gaussian Mixtures

dc.contributor.author	Zhenisbek Assylbekov
dc.contributor.author	Vassilina Nikoulina
dc.contributor.author	Artur Pak
dc.contributor.author	Igor Melnykov
dc.contributor.author	Maxat Tezekbayev
dc.contributor.author	Arman Bolatov
dc.date.accessioned	2025
dc.date.issued	2023
dc.description.abstract	We suggest a simple Gaussian mixture model for data generation that complies with Feldman’s long tail theory (2020). We demonstrate that a linear classifier cannot decrease the generalization error below a certain level in the proposed model, whereas a nonlinear classifier with a memorization capacity can. This confirms that for long-tailed distributions, rare training examples must be considered for optimal generalization to new data. Finally, we show that the performance gap between linear and nonlinear models can be lessened as the tail becomes shorter in the subpopulation frequency distribution, as confirmed by experiments on synthetic and real data.
dc.identifier.citation	Arman Bolatov, Maxat Tezekbayev, Igor Melnykov, Artur Pak, Vassilina Nikoulina, & Zhenisbek Assylbekov (2023). Long-Tail Theory Under Gaussian Mixtures. . https://doi.org/10.3233/FAIA230260
dc.identifier.doi	10.3233/FAIA230260
dc.identifier.uri	https://doi.org/10.3233/FAIA230260
dc.identifier.uri	https://nur.nu.edu.kz/handle/123456789/17460
dc.language	en
dc.publisher	Frontiers in Artificial Intelligence and Applications
dc.rights	All rights reserved
dc.source	Frontiers in Artificial Intelligence and Applications
dc.subject	Unsupervised learning
dc.subject	Quantum mechanics
dc.subject	Mathematical analysis
dc.subject	Physics
dc.subject	Pattern recognition (psychology)
dc.subject	Statistical physics
dc.subject	Applied mathematics
dc.subject	Algorithm
dc.subject	Computer science
dc.subject	Training set
dc.subject	Mathematics
dc.subject	Artificial intelligence
dc.subject	Generalization error
dc.subject	Nonlinear system
dc.subject	Generalization
dc.subject	Classifier (UML)
dc.subject	Gaussian
dc.title	Long-Tail Theory Under Gaussian Mixtures
dc.type	Article

Collections

Books & Book Chapters

Long-Tail Theory Under Gaussian Mixtures

Files

Collections