SarcomaNet: A Privacy-Preserving Multi-Task Deep Learning Framework for Soft Tissue Sarcoma Analysis from Multi-Modal Imaging

Loading...
Thumbnail Image

Journal Title

Journal ISSN

Volume Title

Publisher

Nazarbayev University School of Engineering and Digital Sciences

Abstract

Soft tissue sarcomas (STSs) are a rare and biologically heterogeneous group of malignancies comprising more than 100 histological subtypes, with an annual incidence of fewer than 5 per 100,000 persons. Their rarity makes large, well-annotated imaging datasets difficult to assemble, limiting the use of deep learning for automated tumour segmentation, histological grade classification, and survival risk estimation—three clinically relevant tasks that are often performed separately and with limited quantitative imaging support. This thesis presents SarcomaNet, a unified, privacy preserving tri-head deep learning framework that addresses all three tasks jointly from multi-modal imaging using the publicly available TCIA Soft Tissue Sarcoma dataset (𝑁 = 51). SarcomaNet is built on a shared 3D Residual U-Net (ResU-Net) encoder–decoder backbone and introduces four complementary innovations to address the constraints of rare-cancer imaging. (1) Cross-Modal Masked Autoencoding (CrMAE) is a self-supervised pre-training strategy that leverages the complementary biophysics of multi-modal imaging by masking a randomly selected modality channel (T2FS MRI, FDG-PET, or CT) at 50% patch density and training the encoder to reconstruct the masked modality from the two visible channels. Unlike spatial masked autoencoders, CrMAE encourages the encoder to learn physics-grounded cross-modal correspondences (oedema ↔ metabolic activity ↔ tissue ensity) hat are directly informative of tumour biology. Over 40 pre-training epochs on all 51 unlabelled patient volumes, CrMAE reduces reconstruction MSE by 87.7% without requiring external data...

Description

Citation

Rasool, M. H.(2026). SarcomaNet: A Privacy-Preserving Multi-Task Deep Learning Framework for Soft Tissue Sarcoma Analysis from Multi-Modal Imaging. Nazarbayev University School of Engineering and Digital Sciences

Endorsement

Review

Supplemented By

Referenced By

Creative Commons license

Except where otherwised noted, this item's license is described as Attribution-NonCommercial-ShareAlike 3.0 United States