MULTIMODAL MACHINE LEARNING FOR EMOTION RECOGNITION
| dc.contributor.author | Kazikhan, Margulan | |
| dc.date.accessioned | 2025-06-03T07:33:34Z | |
| dc.date.available | 2025-06-03T07:33:34Z | |
| dc.date.issued | 2025-05-08 | |
| dc.description.abstract | Emotion recognition has become a popular research area in recent years due to the abundance of useful applications. This technology has been used in a variety of areas, including social media, crowd monitoring, live streaming, and human-robot interaction. Recent approaches to emotion recognition have used neural networks such as transformers, multimodal classification, LSTMs, and convolutional neural networks. Recent research has been facilitated by publicly available datasets, which include videos of persons that have been labeled with the dominant emotion of the given scene. In this work, a multimodal technique is used to classify scenes by emotional expressions from such videos by extracting video frames, audio, and transcribed text. In this work, we have investigated ways to achieve improved performance and efficiency at each stage of the classification process, where we have focused on developing and refining the preprocessing stages of each data input type. This work has allowed us to achieve 89% accuracy on a commonly-used dataset, using a combination of video, audio and text. | |
| dc.identifier.citation | Kazikhan, M. (2025). Multimodal Machine Learning for Emotion Recognition. Nazarbayev University School of Engineering and Digital Sciences | |
| dc.identifier.uri | https://nur.nu.edu.kz/handle/123456789/8714 | |
| dc.language.iso | en | |
| dc.publisher | Nazarbayev University School of Engineering and Digital Sciences | |
| dc.rights | Attribution-NonCommercial-ShareAlike 3.0 United States | en |
| dc.rights.uri | http://creativecommons.org/licenses/by-nc-sa/3.0/us/ | |
| dc.subject | type of access: open access | |
| dc.subject | Emotion Recognition | |
| dc.subject | Multimodal Learning | |
| dc.subject | Deep Learning | |
| dc.subject | Image Processing | |
| dc.subject | Intention Estimation | |
| dc.title | MULTIMODAL MACHINE LEARNING FOR EMOTION RECOGNITION | |
| dc.type | Master`s thesis |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- MultimodalMachineLearningforEmotionRecognition.pdf
- Size:
- 6.12 MB
- Format:
- Adobe Portable Document Format
- Description:
- Master's Thesis