OPTIMIZATION OF THE REAL-TIME STATE-OF-THE-ART YOLOV4 OBJECT DETECTOR BY MODIFIED NECK STRUCTURE

dc.contributor.authorMussagaliyev, Bibarys
dc.date.accessioned2022-06-10T10:45:50Z
dc.date.available2022-06-10T10:45:50Z
dc.date.issued2022-05
dc.description.abstractThe state-of-the-art YOLOv4 object detector has already demonstrated its effective inference (65 frames per second (FPS) on V100 Tesla) and relatively high accuracy on MSCOCO dataset (mAP 43.5 %) in real-time mode. Moreover, simplicity of the model’s training and testing appears as another advantage for machine learning community. The ability of the model to be learned as a unified system on just a single graphic processing unit (GPU) unsurprisingly established itself as the milestone in the real-time object detection field. This work aims to review the fundamental and most recent academic work in the field and suggest the incremental research towards the optimization of the YOLOv4 architecture. We propose a model, named SAMD-YOLOv4, with modified neck structure, which reduces number of learning parameters by decreased number of filters with 1×1 kernel, which is followed by spatial attention module and dilated convolutional layers. We demonstrate that method is capable to reduce model’s complexity by 7.3% with no effect on model’s precision as well as lowered inference time by 6.9%. In Chapters below, we provide experimental results and comparison study on baseline YOLOv4 and our SAMD-YOLOv4. Furthermore, the TensorRT-based inference’s results will be revealed and studied.en_US
dc.identifier.citationMussagaliyev, B. (2022). Optimization of the Real-Time State-of-the-Art YOLOv4 Object Detector By Modified Neck Structure (Unpublished master's thesis). Nazarbayev University, Nur-Sultan, Kazakhstanen_US
dc.identifier.urihttp://nur.nu.edu.kz/handle/123456789/6235
dc.language.isoenen_US
dc.publisherNazarbayev University School of Engineering and Digital Sciencesen_US
dc.rightsAttribution-NonCommercial-ShareAlike 3.0 United States*
dc.rights.urihttp://creativecommons.org/licenses/by-nc-sa/3.0/us/*
dc.subjectSAMD-YOLOv4en_US
dc.subjectframes per seconden_US
dc.subjectFPSen_US
dc.subjectV100 Teslaen_US
dc.subjectMSCOCO dataseten_US
dc.subjectOptimizationen_US
dc.subjectModified Neck Structureen_US
dc.subjectObject Detectoren_US
dc.subjectResearch Subject Categories::TECHNOLOGYen_US
dc.subjecttype of access: open accessen_US
dc.titleOPTIMIZATION OF THE REAL-TIME STATE-OF-THE-ART YOLOV4 OBJECT DETECTOR BY MODIFIED NECK STRUCTUREen_US
dc.typeMaster's thesisen_US
workflow.import.sourcescience

Files

Original bundle

Now showing 1 - 2 of 2
Loading...
Thumbnail Image
Name:
Thesis - Bibarys Mussagaliyev.pdf
Size:
9.52 MB
Format:
Adobe Portable Document Format
Description:
Thesis
Loading...
Thumbnail Image
Name:
Presentation - Bibarys Mussagaliyev.pptx
Size:
1.96 MB
Format:
Microsoft Powerpoint XML
Description:
Presentation