BLIND SOURCE SEPARATION FOR AUTOMATIC MUSIC TRANSCRIPTION
dc.contributor.author | Kurmangaliyev, Bauyrzhan | |
dc.date.accessioned | 2024-06-18T06:57:19Z | |
dc.date.available | 2024-06-18T06:57:19Z | |
dc.date.issued | 2024-05-01 | |
dc.description.abstract | The primary objective of this project is to develop methods aimed to the conduct the blind signal separation of musical notes with Nonnegative Matrix Factorization (NMF). This is motivated by the fact that music signals are often recorded with a single microphone, hence, there is a need to develop the Automatic Music Transcription (AMT) methods that could mitigate this assumption and produce the desirable separation result. Therefore, this project report presents the rank estimation method for determination of number of musical notes in the recording. It is motivated by the fact that most of the research works on NMF assume \emph{a priori} knowledge regarding the rank of factorization which may not be available in most of the real world scenarios. As a result, the Weighted Singular Value Thresholding based on Stein's Unbiased Risk Estimate (WSVT-SURE) in which rank estimation is performed by non-uniform shrinkage of singular values via weight vector is presented. We also introduce gradient optimization of a smooth approximation of WSVT-SURE (GWSVT-SURE) to estimate the optimal threshold parameter. In the context of AMT, the proposed algorithms allow one to estimate the number of musical note components in the recordings. The proposed algorithms have been evaluated with the polyphonic piano music excerpts. It is observed that the proposed WSVT-SURE algorithm reaches significant improvement in the estimation performance, while GWSVT-SURE shows substantial savings in the computational cost. | en_US |
dc.identifier.citation | Kurmangaliyev, B. (2024). Blind Source Separation for Automatic Music Transcription. Nazarbayev University School of Engineering and Digital Sciences | en_US |
dc.identifier.uri | http://nur.nu.edu.kz/handle/123456789/7880 | |
dc.language.iso | en | en_US |
dc.publisher | Nazarbayev University School of Engineering and Digital Sciences | en_US |
dc.subject | Type of access: Embargo | en_US |
dc.subject | Automatic Music Transcription | en_US |
dc.subject | Gradient Optimization | en_US |
dc.subject | Latent Dimensionality Estimation | en_US |
dc.subject | Nonnegative Matrix Factorization | en_US |
dc.subject | Singular Value Thresholding | en_US |
dc.title | BLIND SOURCE SEPARATION FOR AUTOMATIC MUSIC TRANSCRIPTION | en_US |
dc.type | Bachelor's thesis | en_US |
workflow.import.source | science |
Files
Original bundle
1 - 1 of 1
No Thumbnail Available
- Name:
- main.pdf
- Size:
- 3.46 MB
- Format:
- Adobe Portable Document Format
- Description:
- capstone project
License bundle
1 - 1 of 1
No Thumbnail Available
- Name:
- license.txt
- Size:
- 6.28 KB
- Format:
- Item-specific license agreed upon to submission
- Description: