Semantic-Aware Image Compression with Masking and Generative Reconstruction for Edge Deployment

dc.contributor.advisorZorbas, Dimitrios
dc.contributor.authorAitymbetov, Nurmukhammed
dc.date.accessioned2026-06-02T06:08:22Z
dc.date.issued2026-05-05
dc.description.abstractSurveillance cameras are increasingly deployed at the network edge, continuously streaming high-resolution imagery to remote servers for processing and analysis. This becomes an issue in remote locations served by network links where the available bandwidth is severely constrained. To fit images within such limits, aggressive compression is required. Standard codecs such as JPEG, however, treat all image regions equally, degrading vehicles, pedestrians and road markings too, alongside the sky and vegetation that carry no useful information. The result is that the information from the regions most critical to downstream tasks is lost to visual artifacts. This thesis presents Masking-based Semantic-aware Image Compression (MSIC), a framework that solves this problem without requiring any changes to the existing codecs. On the sender side, a semantic segmentation model guides a patch-level masking step that zeroes out low-importance regions before JPEG encoding. Zero-valued patches encode to nearly zero DCT coefficients and therefore can be encoded with fewer bits, resulting in significant compression savings. On the receiver side, an autoencoder-based inpainting neural network fills in the masked regions from surrounding context. This design is intentionally asymmetrical: the sender stays lightweight to run on edge hardware without GPU acceleration, while the heavy reconstruction is run on the server side. The primary advantage of this design is that it allows MSIC to achieve compression gains from less important regions of the image, while maintaining high quality for more important regions regardless of compression levels. The framework is evaluated on the Cityscapes dataset against standard JPEG and SQ-GAN, a recent learned semantic compression baseline. At matched bitrate, the proposed method leads on structural similarity and preserves vehicles, road markings and other regions of interest at full quality. Therefore, the tradeoff favors MSIC in the type of content that downstream surveillance applications rely heavily upon.
dc.identifier.citationAitymbetov, N. (2026). Semantic-Aware Image Compression with Masking and Generative Reconstruction for Edge Deployment. Nazarbayev University School of Engineering and Digital Sciences
dc.identifier.urihttps://nur.nu.edu.kz/handle/123456789/18828
dc.language.isoen
dc.publisherNazarbayev University School of Engineering and Digital Sciences
dc.rightsAttribution 3.0 United Statesen
dc.rights.urihttp://creativecommons.org/licenses/by/3.0/us/
dc.subjectImage Compression
dc.subjectSemantic Communications
dc.subjectNeural Image Reconstruction
dc.subjectImage Inpainting
dc.titleSemantic-Aware Image Compression with Masking and Generative Reconstruction for Edge Deployment
dc.typeMaster`s thesis

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
MSc_Thesis_Nurmukhammed_Aitymbetov.pdf
Size:
10.38 MB
Format:
Adobe Portable Document Format
Access status: Embargo until 2028-05-25 , Download