dc.contributor.author | Mussagaliyev, Bibarys | |
dc.date.accessioned | 2022-06-10T10:45:50Z | |
dc.date.available | 2022-06-10T10:45:50Z | |
dc.date.issued | 2022-05 | |
dc.identifier.citation | Mussagaliyev, B. (2022). Optimization of the Real-Time State-of-the-Art YOLOv4 Object Detector By Modified Neck Structure (Unpublished master's thesis). Nazarbayev University, Nur-Sultan, Kazakhstan | en_US |
dc.identifier.uri | http://nur.nu.edu.kz/handle/123456789/6235 | |
dc.description.abstract | The state-of-the-art YOLOv4 object detector has already demonstrated its effective inference (65 frames per second (FPS) on V100 Tesla) and relatively high accuracy on MSCOCO dataset (mAP 43.5 %) in real-time mode. Moreover, simplicity of the model’s training and testing appears as another advantage for machine learning community. The ability of the model to be learned as a unified system on just a single graphic processing unit (GPU) unsurprisingly established itself as the milestone in the real-time object detection field. This work aims to review the fundamental and most recent academic work in the field and suggest the incremental research towards the optimization of the YOLOv4 architecture. We propose a model, named SAMD-YOLOv4, with modified neck structure, which reduces number of learning parameters by decreased number of filters with 1×1 kernel, which is followed by spatial attention module and dilated convolutional layers. We demonstrate that method is capable to reduce model’s complexity by 7.3% with no effect on model’s precision as well as lowered inference time by 6.9%. In Chapters below, we provide experimental results and comparison study on baseline YOLOv4 and our SAMD-YOLOv4. Furthermore, the TensorRT-based inference’s results will be revealed and studied. | en_US |
dc.language.iso | en | en_US |
dc.publisher | Nazarbayev University School of Engineering and Digital Sciences | en_US |
dc.rights | Attribution-NonCommercial-ShareAlike 3.0 United States | * |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-sa/3.0/us/ | * |
dc.subject | SAMD-YOLOv4 | en_US |
dc.subject | frames per second | en_US |
dc.subject | FPS | en_US |
dc.subject | V100 Tesla | en_US |
dc.subject | MSCOCO dataset | en_US |
dc.subject | Optimization | en_US |
dc.subject | Modified Neck Structure | en_US |
dc.subject | Object Detector | en_US |
dc.subject | Research Subject Categories::TECHNOLOGY | en_US |
dc.subject | Type of access: Open Access | en_US |
dc.title | OPTIMIZATION OF THE REAL-TIME STATE-OF-THE-ART YOLOV4 OBJECT DETECTOR BY MODIFIED NECK STRUCTURE | en_US |
dc.type | Master's thesis | en_US |
workflow.import.source | science |
The following license files are associated with this item: