AI-BASED MULTIMODAL EMOTION RECOGNITION SYSTEM

Loading...
Thumbnail Image

Journal Title

Journal ISSN

Volume Title

Publisher

Nazarbayev University School of Engineering and Digital Sciences

Abstract

The project built an AI system for emotion recognition which integrated video data-processing with audio analysis as well as textual information assessment. Real-time facial emotion detection through Vision Transformers (ViT) and speech emotion recognition through Wav2Vec2 made up the core targets of the project with the aim of their integration. The project overcame dataset problems along with scope adjustments by concentrating on processing video and audio content instead of text analysis. The ultimate version of the prototype shows 90% accuracy in detecting emotions across high-definition video material thus creating a new framework which benefits applications in the areas of service interaction and psychiatric assessment.

Description

Citation

Sapa, N., Ospanov, A., Zhexembeyev, T., Sultangazy, I. (2025). AI-based multimodal emotion recognition system. Nazarbayev University School of Engineering and Digital Sciences

Endorsement

Review

Supplemented By

Referenced By

Creative Commons license

Except where otherwised noted, this item's license is described as Attribution-ShareAlike 3.0 United States