REAL-TIME SPEECH EMOTION RECOGNITION (RSER) IN WIRELESS MULTIMEDIA SENSOR NETWORKS

dc.contributor.authorZhilibayev, Serik
dc.date.accessioned2021-08-02T10:17:48Z
dc.date.available2021-08-02T10:17:48Z
dc.date.issued2021-08
dc.description.abstractRecently Wireless Multimedia Sensor Networks (WMSN) is extensively used and huge amounts of data are generated on daily basis. There are huge processes that have to be monitored in real-time, so preprocessing and fast analysis of raw data is required to be done and stored on the edge. Since, edge computation allows the environment to be decentralized, which makes it highly responsive, low price, scalable, and secure. WMSN and edge computing are important in areas like healthcare where the subject has to be monitored and analyzed continuously. In this work, we propose the healthcare system for monitoring human emotion using speech in realtime (RSER). Firstly, this project aims to analyze state-of-the are SER approaches with respect to time and the ability to work on constrained devices. Secondly, the new approach based on time analysis will be provided. There will be Exploratory data Analysis on multiple datasets that will be used for training such as the Ryerson Audio-Visual Database of Emotional Speech and Song (RAVDESS) , Berlin (EMO-DB) and IMEOCAP datasets. Data based on Vocal tract spectrum features and low-level acoustic features (Pitch and energy) will be extracted. The data will be trained and evaluated on Deep Learning and Machine Learning algorithms. Algorithms will be prioritized by their time, energy, and accuracy metrics. Then, this experiment will be tested and evaluated on embedded device (Raspberry PI). Finally, modified model based on algorithm analysis will be tested on 3 Scenarios (Processing on Edge, Processing Sink, and Streaming).en_US
dc.identifier.citation"Zhilibayev, S. (2021). Real-time Speech Emotion Recognition (RSER) in Wireless Multimedia Sensor Networks (Unpublished master's thesis). Nazarbayev University, Nur-Sultan, Kazakhstan"en_US
dc.identifier.urihttp://nur.nu.edu.kz/handle/123456789/5642
dc.language.isoenen_US
dc.publisherNazarbayev University School of Engineering and Digital Sciencesen_US
dc.rightsAttribution-NonCommercial-ShareAlike 3.0 United States*
dc.rights.urihttp://creativecommons.org/licenses/by-nc-sa/3.0/us/*
dc.subjectRSERen_US
dc.subjectWireless Multimedia Sensor Networksen_US
dc.subjectWMSNen_US
dc.subjectRyerson Audio-Visual Database of Emotional Speech and Songen_US
dc.subjectReal-time Speech Emotion Recognitionen_US
dc.subjecttype of access: open accessen_US
dc.subjectResearch Subject Categories::TECHNOLOGYen_US
dc.titleREAL-TIME SPEECH EMOTION RECOGNITION (RSER) IN WIRELESS MULTIMEDIA SENSOR NETWORKSen_US
dc.typeMaster's thesisen_US
workflow.import.sourcescience

Files

Original bundle

Now showing 1 - 2 of 2
Loading...
Thumbnail Image
Name:
Thesis - Serik Zhilibayev.pdf
Size:
3.41 MB
Format:
Adobe Portable Document Format
Description:
Thesis
Loading...
Thumbnail Image
Name:
Presentation - Serik Zhilibayev.pptx
Size:
3.6 MB
Format:
Microsoft Powerpoint XML
Description:
Presentation