MULTIMODAL MACHINE LEARNING FOR EMOTION RECOGNITION

dc.contributor.authorKazikhan, Margulan
dc.date.accessioned2025-06-03T07:33:34Z
dc.date.available2025-06-03T07:33:34Z
dc.date.issued2025-05-08
dc.description.abstractEmotion recognition has become a popular research area in recent years due to the abundance of useful applications. This technology has been used in a variety of areas, including social media, crowd monitoring, live streaming, and human-robot interaction. Recent approaches to emotion recognition have used neural networks such as transformers, multimodal classification, LSTMs, and convolutional neural networks. Recent research has been facilitated by publicly available datasets, which include videos of persons that have been labeled with the dominant emotion of the given scene. In this work, a multimodal technique is used to classify scenes by emotional expressions from such videos by extracting video frames, audio, and transcribed text. In this work, we have investigated ways to achieve improved performance and efficiency at each stage of the classification process, where we have focused on developing and refining the preprocessing stages of each data input type. This work has allowed us to achieve 89% accuracy on a commonly-used dataset, using a combination of video, audio and text.
dc.identifier.citationKazikhan, M. (2025). Multimodal Machine Learning for Emotion Recognition. Nazarbayev University School of Engineering and Digital Sciences
dc.identifier.urihttps://nur.nu.edu.kz/handle/123456789/8714
dc.language.isoen
dc.publisherNazarbayev University School of Engineering and Digital Sciences
dc.rightsAttribution-NonCommercial-ShareAlike 3.0 United Statesen
dc.rights.urihttp://creativecommons.org/licenses/by-nc-sa/3.0/us/
dc.subjecttype of access: open access
dc.subjectEmotion Recognition
dc.subjectMultimodal Learning
dc.subjectDeep Learning
dc.subjectImage Processing
dc.subjectIntention Estimation
dc.titleMULTIMODAL MACHINE LEARNING FOR EMOTION RECOGNITION
dc.typeMaster`s thesis

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
MultimodalMachineLearningforEmotionRecognition.pdf
Size:
6.12 MB
Format:
Adobe Portable Document Format
Description:
Master's Thesis