dc.contributor.author | Meshituly, Maksat | |
dc.date.accessioned | 2022-07-13T08:11:13Z | |
dc.date.available | 2022-07-13T08:11:13Z | |
dc.date.issued | 2022-04 | |
dc.identifier.citation | Meshituly, M. (2022). A pipeline for protein-ligand interaction analysis (Unpublished master's thesis). Nazarbayev University, Nur-Sultan, Kazakhstan | en_US |
dc.identifier.uri | http://nur.nu.edu.kz/handle/123456789/6415 | |
dc.description.abstract | The drug discovery and development is a very complex, time and cost intensive process with multiple steps. One of the key steps in this process is the identification of the binding sites between protein and ligands. There are various data resources that can serve in the step of high throughput in silico screening, a process for drug candidate search such as to find a molecule for specific target that has the required chemical and biological properties, and can be used in the further downstream process of the drug development. These large databases in combination with big data processing methods, such as data mining, data fusion and data integration, are attracting much attention from various scientific communities in studying the problem of the automation of drug design. Big data processing methods and techniques are efficient and capable of implementing screening for molecular properties and drug design for millions of chemical compounds. In particular, the automated analysis and prediction methods of protein binding sites and potential ligand conformations can accelerate and effectively advance the drug development process. There are many gaps that would need an improvement in the current automated drug design in particular for the processing and analysis of protein-ligand interaction that would increase the accuracy of the final prediction. Some of the areas for development would be the removal of inaccuracies and errors in the initial dataset creation and processing for the used training sets of compounds. These problems are critical to solve because mistakes can lead to serious health and economic consequences such as harm to patients and severe financial losses for the stakeholders. In the proposed approach, a multi-step pipeline for protein-ligand interaction analysis based on the large dataset of compounds from the Protein Data Bank was considered. The result of this pipeline would ultimately provide a practical way to manipulate large-scale chemical data using familiar software for specialists in molecular chemistry without consuming large amounts of computing power. | en_US |
dc.language.iso | en | en_US |
dc.publisher | Nazarbayev University School of Engineering and Digital Sciences | en_US |
dc.rights | Attribution-NonCommercial-ShareAlike 3.0 United States | * |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-sa/3.0/us/ | * |
dc.subject | Type of access: Gated Access | en_US |
dc.subject | protein-ligand interaction | en_US |
dc.subject | pipeline | en_US |
dc.subject | Protein Data Bank | en_US |
dc.subject | SMILES | en_US |
dc.subject | PyMOL RMSD | en_US |
dc.subject | Research Subject Categories::TECHNOLOGY | en_US |
dc.title | A PIPELINE FOR PROTEIN-LIGAND INTERACTION ANALYSIS | en_US |
dc.type | Master's thesis | en_US |
workflow.import.source | science |
The following license files are associated with this item: