ADVANCING BLOOD SAMPLE ANALYSIS: INCORPORATING EXPERT OPINIONS AND EXPLAINABLE AI IN MULTI-LABEL DISEASE PREDICTION

dc.contributor.authorOrynbay, Sultan
dc.contributor.authorAkanova, Inabat
dc.contributor.authorTurmakhan, Diana
dc.contributor.authorBeken, Ulpan
dc.contributor.authorSerikkazhy, Islam
dc.date.accessioned2024-06-21T08:42:21Z
dc.date.available2024-06-21T08:42:21Z
dc.date.issued2024-04-19
dc.description.abstractBlood sample analysis plays a crucial role in modern medical practice, aiding in the detection of a wide array of diseases. Despite its significance, the potential of blood samples for predicting various diseases has remained largely unexplored. Our project aimed to dive into evaluate the efficacy of blood samples in predicting a broad spectrum of disease using large-scale MIMIC III medical dataset. Given the sparse nature of the data, we combine imputation with multi-task models for which we identify and utilize meaningful auxiliary tasks and are thus able to reach an average state-of-the-art ROC-AUC score of 81% across the 50 most prevalent diseases within the dataset. To further validate our findings, we sought the expertise of five medical doctors, who independently rated the predictability of these diseases from blood samples. Spearman’s rho analysis revealed a substantial agreement ( = 0.61) between the doctors’ ratings and the actual ROCAUC values of our machine learning models. In order to add transparency and reliability, we employed the Local Interpretable Modelagnostic Explanations (LIME) method to identify the most predictive blood sample features. These findings were rigorously cross-checked with medical experts, affirming the robustness and credibility of our predictive models. Our study represents a significant advancement in the field of medical diagnostics, showcasing the untapped potential of blood sample analysis in disease prediction. By integrating cuttingedge machine learning techniques with expert validation, we pave the way for enhanced patient care and improved healthcare outcomes.en_US
dc.identifier.citationOrynbay, S., Akanova, I., Turmakhan, D., Beken, U., & Serikkazhy, I. (2024). Advancing blood sample analysis: Incorporating expert opinions and explainable AI in multi-label disease prediction. Nazarbayev University School Engineering and Digital Sciencesen_US
dc.identifier.urihttp://nur.nu.edu.kz/handle/123456789/7936
dc.language.isoenen_US
dc.publisherNazarbayev University School Engineering and Digital Sciencesen_US
dc.rightsAttribution-NonCommercial-NoDerivs 3.0 United States*
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/3.0/us/*
dc.subjectblood samplesen_US
dc.subjectmulti-label classificationen_US
dc.subjectmulti-task learningen_US
dc.subjectexplainable AIen_US
dc.subjectimputationen_US
dc.subjectexpert opinionen_US
dc.subjectType of access: Restricteden_US
dc.titleADVANCING BLOOD SAMPLE ANALYSIS: INCORPORATING EXPERT OPINIONS AND EXPLAINABLE AI IN MULTI-LABEL DISEASE PREDICTIONen_US
dc.typeBachelor's thesisen_US
workflow.import.sourcescience

Files

Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
final_report.pdf
Size:
537.31 KB
Format:
Adobe Portable Document Format
Description:
Bachelor thesis, report
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
6.28 KB
Format:
Item-specific license agreed upon to submission
Description: