DSpace Repository

A PIPELINE FOR PROTEIN-LIGAND INTERACTION ANALYSIS

Show simple item record

dc.contributor.author Meshituly, Maksat
dc.date.accessioned 2022-07-13T08:11:13Z
dc.date.available 2022-07-13T08:11:13Z
dc.date.issued 2022-04
dc.identifier.citation Meshituly, M. (2022). A pipeline for protein-ligand interaction analysis (Unpublished master's thesis). Nazarbayev University, Nur-Sultan, Kazakhstan en_US
dc.identifier.uri http://nur.nu.edu.kz/handle/123456789/6415
dc.description.abstract The drug discovery and development is a very complex, time and cost intensive process with multiple steps. One of the key steps in this process is the identification of the binding sites between protein and ligands. There are various data resources that can serve in the step of high throughput in silico screening, a process for drug candidate search such as to find a molecule for specific target that has the required chemical and biological properties, and can be used in the further downstream process of the drug development. These large databases in combination with big data processing methods, such as data mining, data fusion and data integration, are attracting much attention from various scientific communities in studying the problem of the automation of drug design. Big data processing methods and techniques are efficient and capable of implementing screening for molecular properties and drug design for millions of chemical compounds. In particular, the automated analysis and prediction methods of protein binding sites and potential ligand conformations can accelerate and effectively advance the drug development process. There are many gaps that would need an improvement in the current automated drug design in particular for the processing and analysis of protein-ligand interaction that would increase the accuracy of the final prediction. Some of the areas for development would be the removal of inaccuracies and errors in the initial dataset creation and processing for the used training sets of compounds. These problems are critical to solve because mistakes can lead to serious health and economic consequences such as harm to patients and severe financial losses for the stakeholders. In the proposed approach, a multi-step pipeline for protein-ligand interaction analysis based on the large dataset of compounds from the Protein Data Bank was considered. The result of this pipeline would ultimately provide a practical way to manipulate large-scale chemical data using familiar software for specialists in molecular chemistry without consuming large amounts of computing power. en_US
dc.language.iso en en_US
dc.publisher Nazarbayev University School of Engineering and Digital Sciences en_US
dc.rights Attribution-NonCommercial-ShareAlike 3.0 United States *
dc.rights.uri http://creativecommons.org/licenses/by-nc-sa/3.0/us/ *
dc.subject Type of access: Gated Access en_US
dc.subject protein-ligand interaction en_US
dc.subject pipeline en_US
dc.subject Protein Data Bank en_US
dc.subject SMILES en_US
dc.subject PyMOL RMSD en_US
dc.subject Research Subject Categories::TECHNOLOGY en_US
dc.title A PIPELINE FOR PROTEIN-LIGAND INTERACTION ANALYSIS en_US
dc.type Master's thesis en_US
workflow.import.source science


Files in this item

The following license files are associated with this item:

This item appears in the following Collection(s)

Show simple item record

Attribution-NonCommercial-ShareAlike 3.0 United States Except where otherwise noted, this item's license is described as Attribution-NonCommercial-ShareAlike 3.0 United States