DSpace Repository

Analytics of Heterogeneous Breast Cancer Data Using Neuroevolution

Система будет остановлена для регулярного обслуживания. Пожалуйста, сохраните рабочие данные и выйдите из системы.

Show simple item record

dc.contributor.author Abdikenov, Beibit
dc.contributor.author Iklassov, Zangir
dc.contributor.author Sharipov, Askhat
dc.contributor.author Hussain, Shahid
dc.contributor.author Jamwal, Prashant K.
dc.date.accessioned 2019-12-11T03:54:33Z
dc.date.available 2019-12-11T03:54:33Z
dc.date.issued 2019-02-01
dc.identifier.citation Abdikenov, B., Iklassov, Z., Sharipov, A., Hussain, S., & Jamwal, P. K. (2019). Analytics of Heterogeneous Breast Cancer Data Using Neuroevolution. IEEE Access, 7, 18050–18060. https://doi.org/10.1109/access.2019.2897078 en_US
dc.identifier.issn 2169-3536
dc.identifier.uri http://nur.nu.edu.kz/handle/123456789/4341
dc.description https://ieeexplore.ieee.org/document/8632897 en_US
dc.description.abstract Breast cancer prognostic modeling is difficult since it is governed by many diverse factors. Given the low median survival and large scale breast cancer data, which comes from high throughput technology, the accurate and reliable prognosis of breast cancer is becoming increasingly difficult. While accurate and timely prognosis may save many patients from going through painful and expensive treatments, it may also help oncologists in managing the disease more efficiently and effectively. Data analytics augmented by machine-learning algorithms have been proposed in past for breast cancer prognosis; and however, most of these could not perform well owing to the heterogeneous nature of available data and model interpretability related issues. A robust prognostic modeling approach is proposed here whereby a Pareto optimal set of deep neural networks (DNNs) exhibiting equally good performance metrics is obtained. The set of DNNs is initialized and their hyperparameters are optimized using the evolutionary algorithm, NSGAIII. The final DNN model is selected from the Pareto optimal set of many DNNs using a fuzzy inferencing approach. Contrary to using DNNs as the black box, the proposed scheme allows understanding how various performance metrics (such as accuracy, sensitivity, F1, and so on) change with changes in hyperparameters. This enhanced interpretability can be further used to improve or modify the behavior of DNNs. The heterogeneous breast cancer database requires preprocessing for better interpretation of categorical variables in order to improve prognosis from classifiers. Furthermore, we propose to use a neural network-based entity-embedding method for categorical features with high cardinality. This approach can provide a vector representation of categorical features in multidimensional space with enhanced interpretability. It is shown with evidence that DNNs optimized using evolutionary algorithms exhibit improved performance over other classifiers mentioned in this paper. en_US
dc.language.iso en en_US
dc.publisher Institute of Electrical and Electronics Engineers en_US
dc.rights Attribution-NonCommercial-ShareAlike 3.0 United States *
dc.rights.uri http://creativecommons.org/licenses/by-nc-sa/3.0/us/ *
dc.subject Breast cancer prognostic modelling en_US
dc.subject entity embedding en_US
dc.subject deep learning networks en_US
dc.subject evolutionary algorithms en_US
dc.subject fuzzy inferencing en_US
dc.title Analytics of Heterogeneous Breast Cancer Data Using Neuroevolution en_US
dc.type Article en_US
workflow.import.source science


Files in this item

The following license files are associated with this item:

This item appears in the following Collection(s)

Show simple item record

Attribution-NonCommercial-ShareAlike 3.0 United States Except where otherwise noted, this item's license is described as Attribution-NonCommercial-ShareAlike 3.0 United States