AI-BASED MULTIMODAL EMOTION RECOGNITION SYSTEM
Loading...
Date
Journal Title
Journal ISSN
Volume Title
Publisher
Nazarbayev University School of Engineering and Digital Sciences
Abstract
The project built an AI system for emotion recognition which integrated video data-processing with audio analysis as well as textual information assessment. Real-time facial emotion detection through Vision Transformers (ViT) and speech emotion recognition through Wav2Vec2 made up the core targets of the project with the aim of their integration. The project overcame dataset problems along with scope adjustments by concentrating on processing video and audio content instead of text analysis. The ultimate version of the prototype shows 90% accuracy in detecting emotions across high-definition video material thus creating a new framework which benefits applications in the areas of service interaction and psychiatric assessment.
Description
Citation
Sapa, N., Ospanov, A., Zhexembeyev, T., Sultangazy, I. (2025). AI-based multimodal emotion recognition system. Nazarbayev University School of Engineering and Digital Sciences
Collections
Endorsement
Review
Supplemented By
Referenced By
Creative Commons license
Except where otherwised noted, this item's license is described as Attribution-ShareAlike 3.0 United States
