Anticancer Peptides Classification Using Long-Short-Term Memory with Novel Feature Representation
| dc.contributor.author | Nazer Al Tahifah | |
| dc.contributor.author | Muhammad Sohail Ibrahim | |
| dc.contributor.author | Erum Rehman | |
| dc.contributor.author | Naveed Ahmed | |
| dc.contributor.author | Abdul Wahab | |
| dc.contributor.author | Shujaat Khan | |
| dc.date.accessioned | 2025-08-26T11:25:34Z | |
| dc.date.available | 2025-08-26T11:25:34Z | |
| dc.date.issued | 2024-01-01 | |
| dc.description.abstract | Cancer presents a formidable challenge due to its complexity, variability, and multitude of causes. Despite being extensively studied, our understanding of cancer remains incomplete. This underscores the urgent need for comprehensive therapeutic strategies. Among the potential treatment avenues, anticancer peptides (ACPs) hold significant promise. However, the identification and synthesis of these peptides on a large scale pose ongoing challenges, necessitating the development of reliable prediction methods. Existing methods for predicting ACPs often suffer from low accuracy and rely on features with limited resolution. To address this, we propose a novel classification approach based on long short‑term memory (LSTM) networks, utilizing a new set of features. This feature set comprises both contemporary and innovative extraction techniques. The contemporary features include binary profile features and k‑mer sparse matrices of reduced amino acids. The novel features are derived from the Composition of the K‑Spaced Side Chain Pairs (CKSSCP), the Composition of the K‑Spaced Electrically Charged Side Chain Pairs (CKSECSCP), and a combination of [pk(CO₂H)] + [pk(NH₂)] + [pk(R)] + [Isoelectric point]. The combined feature set is employed to train the LSTM model, and extensive experiments are conducted on benchmark datasets using k‑fold cross‑validation. The results demonstrate that our model surpasses other ACP classification methods in terms of accuracy and Matthews correlation coefficient (MCC). Specifically, for the ACP740 dataset with 5 folds, we achieve an MCC of 74.61%, outperforming ACP‑KSRC, ACP‑MHCNN, and ACP‑DA by 7.61%, 2.61%, and 10.61%, respectively. For the ACP344 dataset, the MCC reaches 84.46%, surpassing ACP‑KSRC and the ZH‑method by 3.46% and 6.46%, respectively. With its superior classification performance, our proposed model could aid in identifying novel ACPs and contribute to a deeper understanding of their structural and chemical characteristics. | en |
| dc.identifier.citation | Al Tahifah Nazer, Sohail Ibrahim Muhammad, Rehman Erum, Ahmed Naveed, Wahab Abdul, Khan Shujaat. (2025). Anticancer Peptides Classification Using Long-Short-Term Memory With Novel Feature Representation. IEEE Access. https://doi.org/10.1109/access.2024.3523068 | en |
| dc.identifier.doi | 10.1109/access.2024.3523068 | |
| dc.identifier.uri | https://doi.org/10.1109/access.2024.3523068 | |
| dc.identifier.uri | https://nur.nu.edu.kz/handle/123456789/10273 | |
| dc.language.iso | en | |
| dc.publisher | Institute of Electrical and Electronics Engineers (IEEE) | |
| dc.rights | Open access | en |
| dc.source | (2024) | en |
| dc.subject | Term (time) | en |
| dc.subject | Computer science | en |
| dc.subject | Representation (politics) | en |
| dc.subject | Feature (linguistics) | en |
| dc.subject | Artificial intelligence | en |
| dc.subject | Long short term memory | en |
| dc.subject | Pattern recognition (psychology) | en |
| dc.subject | Natural language processing | en |
| dc.subject | Machine learning | en |
| dc.subject | Artificial neural network | en |
| dc.subject | Recurrent neural network | en |
| dc.subject | Linguistics | en |
| dc.subject | Philosophy | en |
| dc.subject | Physics | en |
| dc.subject | Quantum mechanics | en |
| dc.subject | Politics | en |
| dc.subject | Political science | en |
| dc.subject | Law; type of access: open access | en |
| dc.title | Anticancer Peptides Classification Using Long-Short-Term Memory with Novel Feature Representation | en |
| dc.type | article | en |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- 10.1109_ACCESS.2024.3523068.pdf
- Size:
- 3.27 MB
- Format:
- Adobe Portable Document Format