Anticancer Peptides Classification Using Long-Short-Term Memory with Novel Feature Representation

dc.contributor.authorNazer Al Tahifah
dc.contributor.authorMuhammad Sohail Ibrahim
dc.contributor.authorErum Rehman
dc.contributor.authorNaveed Ahmed
dc.contributor.authorAbdul Wahab
dc.contributor.authorShujaat Khan
dc.date.accessioned2025-08-26T11:25:34Z
dc.date.available2025-08-26T11:25:34Z
dc.date.issued2024-01-01
dc.description.abstractCancer presents a formidable challenge due to its complexity, variability, and multitude of causes. Despite being extensively studied, our understanding of cancer remains incomplete. This underscores the urgent need for comprehensive therapeutic strategies. Among the potential treatment avenues, anticancer peptides (ACPs) hold significant promise. However, the identification and synthesis of these peptides on a large scale pose ongoing challenges, necessitating the development of reliable prediction methods. Existing methods for predicting ACPs often suffer from low accuracy and rely on features with limited resolution. To address this, we propose a novel classification approach based on long short‑term memory (LSTM) networks, utilizing a new set of features. This feature set comprises both contemporary and innovative extraction techniques. The contemporary features include binary profile features and k‑mer sparse matrices of reduced amino acids. The novel features are derived from the Composition of the K‑Spaced Side Chain Pairs (CKSSCP), the Composition of the K‑Spaced Electrically Charged Side Chain Pairs (CKSECSCP), and a combination of [pk(CO₂H)] + [pk(NH₂)] + [pk(R)] + [Isoelectric point]. The combined feature set is employed to train the LSTM model, and extensive experiments are conducted on benchmark datasets using k‑fold cross‑validation. The results demonstrate that our model surpasses other ACP classification methods in terms of accuracy and Matthews correlation coefficient (MCC). Specifically, for the ACP740 dataset with 5 folds, we achieve an MCC of 74.61%, outperforming ACP‑KSRC, ACP‑MHCNN, and ACP‑DA by 7.61%, 2.61%, and 10.61%, respectively. For the ACP344 dataset, the MCC reaches 84.46%, surpassing ACP‑KSRC and the ZH‑method by 3.46% and 6.46%, respectively. With its superior classification performance, our proposed model could aid in identifying novel ACPs and contribute to a deeper understanding of their structural and chemical characteristics.en
dc.identifier.citationAl Tahifah Nazer, Sohail Ibrahim Muhammad, Rehman Erum, Ahmed Naveed, Wahab Abdul, Khan Shujaat. (2025). Anticancer Peptides Classification Using Long-Short-Term Memory With Novel Feature Representation. IEEE Access. https://doi.org/10.1109/access.2024.3523068en
dc.identifier.doi10.1109/access.2024.3523068
dc.identifier.urihttps://doi.org/10.1109/access.2024.3523068
dc.identifier.urihttps://nur.nu.edu.kz/handle/123456789/10273
dc.language.isoen
dc.publisherInstitute of Electrical and Electronics Engineers (IEEE)
dc.rightsOpen accessen
dc.source(2024)en
dc.subjectTerm (time)en
dc.subjectComputer scienceen
dc.subjectRepresentation (politics)en
dc.subjectFeature (linguistics)en
dc.subjectArtificial intelligenceen
dc.subjectLong short term memoryen
dc.subjectPattern recognition (psychology)en
dc.subjectNatural language processingen
dc.subjectMachine learningen
dc.subjectArtificial neural networken
dc.subjectRecurrent neural networken
dc.subjectLinguisticsen
dc.subjectPhilosophyen
dc.subjectPhysicsen
dc.subjectQuantum mechanicsen
dc.subjectPoliticsen
dc.subjectPolitical scienceen
dc.subjectLaw; type of access: open accessen
dc.titleAnticancer Peptides Classification Using Long-Short-Term Memory with Novel Feature Representationen
dc.typearticleen

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
10.1109_ACCESS.2024.3523068.pdf
Size:
3.27 MB
Format:
Adobe Portable Document Format

Collections