Enhancing skeleton-based action recognition with hybrid real and gan-generated datasets
Loading...
Date
Journal Title
Journal ISSN
Volume Title
Publisher
Private Company Technology Center
Abstract
This research addresses the critical challenge of recognizing mutual actions involving multiple individuals, an important task for applications such as video surveillance, human-computer interaction, autonomous systems, and behavioral analysis. Identifying these actions from 3D skeleton motion sequences poses significant challenges due to the necessity of accurately capturing intricate spatial and temporal patterns in diverse, dynamic, and often unpredictable environments. To tackle this, a robust neural network framework was developed that combines Convolutional Neural Networks (CNNs) for efficient spatial feature extraction with Long Short-Term Memory (LSTM) networks to model temporal dependencies over extended sequences. A distinguishing feature of this study is the creation of a hybrid dataset that which combines real-world skeleton motion data with synthetically generated samples, produced using Generative Adversarial Networks (GANs). This dataset enriches variability, enhances generalization, and mitigates data scarcity challenges. Experimental findings across three different network architectures demonstrate that our method significantly enhances recognition accuracy, mainly due to the integration of CNNs and LSTMs alongside the broadened dataset. Our approach successfully identifies complex interactions and ensures consistent performance across different perspectives and environmental conditions. The improved reliability in recognition indicates that this framework can be effectively utilized in practical applications such as security systems, crowd monitoring, and other areas where precise detection of mutual actions is critical, particularly in real-time and dynamic environments
Description
Keywords
Computer science, Artificial intelligence, Generalization, Convolutional neural network, Machine learning, Feature (linguistics), Reliability (semiconductor), Task (project management), Pattern recognition (psychology), Data mining, Physics, Quantum mechanics, Mathematical analysis, Linguistics, Philosophy, Power (physics), Mathematics, Management, Economics, type of access: open access
Citation
Islamgozhayev Talgat, Amirgaliyev Beibut, Kozhirbayev Zhanibek. (2024). Enhancing skeleton-based action recognition with hybrid real and gan-generated datasets. Eastern-European Journal of Enterprise Technologies. https://doi.org/https://doi.org/10.15587/1729-4061.2024.317092