Semantic-Aware Image Compression with Masking and Generative Reconstruction for Edge Deployment
| dc.contributor.advisor | Zorbas, Dimitrios | |
| dc.contributor.author | Aitymbetov, Nurmukhammed | |
| dc.date.accessioned | 2026-06-02T06:08:22Z | |
| dc.date.issued | 2026-05-05 | |
| dc.description.abstract | Surveillance cameras are increasingly deployed at the network edge, continuously streaming high-resolution imagery to remote servers for processing and analysis. This becomes an issue in remote locations served by network links where the available bandwidth is severely constrained. To fit images within such limits, aggressive compression is required. Standard codecs such as JPEG, however, treat all image regions equally, degrading vehicles, pedestrians and road markings too, alongside the sky and vegetation that carry no useful information. The result is that the information from the regions most critical to downstream tasks is lost to visual artifacts. This thesis presents Masking-based Semantic-aware Image Compression (MSIC), a framework that solves this problem without requiring any changes to the existing codecs. On the sender side, a semantic segmentation model guides a patch-level masking step that zeroes out low-importance regions before JPEG encoding. Zero-valued patches encode to nearly zero DCT coefficients and therefore can be encoded with fewer bits, resulting in significant compression savings. On the receiver side, an autoencoder-based inpainting neural network fills in the masked regions from surrounding context. This design is intentionally asymmetrical: the sender stays lightweight to run on edge hardware without GPU acceleration, while the heavy reconstruction is run on the server side. The primary advantage of this design is that it allows MSIC to achieve compression gains from less important regions of the image, while maintaining high quality for more important regions regardless of compression levels. The framework is evaluated on the Cityscapes dataset against standard JPEG and SQ-GAN, a recent learned semantic compression baseline. At matched bitrate, the proposed method leads on structural similarity and preserves vehicles, road markings and other regions of interest at full quality. Therefore, the tradeoff favors MSIC in the type of content that downstream surveillance applications rely heavily upon. | |
| dc.identifier.citation | Aitymbetov, N. (2026). Semantic-Aware Image Compression with Masking and Generative Reconstruction for Edge Deployment. Nazarbayev University School of Engineering and Digital Sciences | |
| dc.identifier.uri | https://nur.nu.edu.kz/handle/123456789/18828 | |
| dc.language.iso | en | |
| dc.publisher | Nazarbayev University School of Engineering and Digital Sciences | |
| dc.rights | Attribution 3.0 United States | en |
| dc.rights.uri | http://creativecommons.org/licenses/by/3.0/us/ | |
| dc.subject | Image Compression | |
| dc.subject | Semantic Communications | |
| dc.subject | Neural Image Reconstruction | |
| dc.subject | Image Inpainting | |
| dc.title | Semantic-Aware Image Compression with Masking and Generative Reconstruction for Edge Deployment | |
| dc.type | Master`s thesis |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- MSc_Thesis_Nurmukhammed_Aitymbetov.pdf
- Size:
- 10.38 MB
- Format:
- Adobe Portable Document Format