BLIND SOURCE SEPARATION FOR AUTOMATIC MUSIC TRANSCRIPTION

dc.contributor.authorKurmangaliyev, Bauyrzhan
dc.date.accessioned2024-06-18T06:57:19Z
dc.date.available2024-06-18T06:57:19Z
dc.date.issued2024-05-01
dc.description.abstractThe primary objective of this project is to develop methods aimed to the conduct the blind signal separation of musical notes with Nonnegative Matrix Factorization (NMF). This is motivated by the fact that music signals are often recorded with a single microphone, hence, there is a need to develop the Automatic Music Transcription (AMT) methods that could mitigate this assumption and produce the desirable separation result. Therefore, this project report presents the rank estimation method for determination of number of musical notes in the recording. It is motivated by the fact that most of the research works on NMF assume \emph{a priori} knowledge regarding the rank of factorization which may not be available in most of the real world scenarios. As a result, the Weighted Singular Value Thresholding based on Stein's Unbiased Risk Estimate (WSVT-SURE) in which rank estimation is performed by non-uniform shrinkage of singular values via weight vector is presented. We also introduce gradient optimization of a smooth approximation of WSVT-SURE (GWSVT-SURE) to estimate the optimal threshold parameter. In the context of AMT, the proposed algorithms allow one to estimate the number of musical note components in the recordings. The proposed algorithms have been evaluated with the polyphonic piano music excerpts. It is observed that the proposed WSVT-SURE algorithm reaches significant improvement in the estimation performance, while GWSVT-SURE shows substantial savings in the computational cost.en_US
dc.identifier.citationKurmangaliyev, B. (2024). Blind Source Separation for Automatic Music Transcription. Nazarbayev University School of Engineering and Digital Sciencesen_US
dc.identifier.urihttp://nur.nu.edu.kz/handle/123456789/7880
dc.language.isoenen_US
dc.publisherNazarbayev University School of Engineering and Digital Sciencesen_US
dc.subjectType of access: Embargoen_US
dc.subjectAutomatic Music Transcriptionen_US
dc.subjectGradient Optimizationen_US
dc.subjectLatent Dimensionality Estimationen_US
dc.subjectNonnegative Matrix Factorizationen_US
dc.subjectSingular Value Thresholdingen_US
dc.titleBLIND SOURCE SEPARATION FOR AUTOMATIC MUSIC TRANSCRIPTIONen_US
dc.typeBachelor's thesisen_US
workflow.import.sourcescience

Files

Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
main.pdf
Size:
3.46 MB
Format:
Adobe Portable Document Format
Description:
capstone project
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
6.28 KB
Format:
Item-specific license agreed upon to submission
Description: