๐œ“-VIT: HUMAN ACTIVITY RECOGNITION USING AUXILIARY TASKS-ENHANCED VIDEO TRANSFORMERS

dc.contributor.authorKirillov, Kirill
dc.date.accessioned2025-06-03T09:58:55Z
dc.date.available2025-06-03T09:58:55Z
dc.date.issued2025-05-11
dc.description.abstractHuman Activity Recognition (HAR) is a critical task in healthcare, enabling the goal of emergency detection and prevention without human supervision by employing IoT devices and machine learning techniques. While traditional unimodal approaches to HAR often fall short in accurately recognizing complex or subtle activities, multimodal systems integrating data from sensors such as accelerometers, gyroscopes, video, and audio provide richer context and higher accuracy. This work introduces Pose- and Sensor-Induced Video Transformer (-ViT) framework that enhances HAR performance by inducing motion sensor data through auxiliary learning tasks during training, while maintaining vision-only inference efficiency. Building on the principles of the Pose Induced Video Transformer (-ViT), our methodology extends auxiliary task learning to gyroscope and accelerometer modalities by introducing induction modules. Experiments demonstrate that combining these modules with a video transformer backbone improves recognition of fine-grained human activities by up to 7%, particularly for subtle motions, thus advancing HAR systems toward practical healthcare deployment without requiring wearable sensors during real-world use
dc.identifier.citationKirillov, K. (2025). ๐œ“-ViT: Human Activity Recognition using Auxiliary Tasks-Enhanced Video Transformers. Nazarbayev University School of Engineering and Digital Sciences
dc.identifier.urihttps://nur.nu.edu.kz/handle/123456789/8726
dc.language.isoen
dc.publisherNazarbayev University School of Engineering and Digital Sciences
dc.rightsAttribution 3.0 United Statesen
dc.rights.urihttp://creativecommons.org/licenses/by/3.0/us/
dc.subjecttype of access: open access
dc.title๐œ“-VIT: HUMAN ACTIVITY RECOGNITION USING AUXILIARY TASKS-ENHANCED VIDEO TRANSFORMERS
dc.typeMaster`s thesis

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
thesis_signed.pdf
Size:
2.67 MB
Format:
Adobe Portable Document Format
Description:
Master`s thesis