TOWARDS AUTOMATED MOLECULAR SEARCH IN DRUG SPACE

dc.contributor.authorZhumagambetov, Rustam
dc.date.accessioned2021-07-01T10:41:04Z
dc.date.available2021-07-01T10:41:04Z
dc.date.issued2021-05
dc.description.abstractRecent advances in convolutional neural networks have inspired the application of deep learning to other disciplines. Even though image processing and natural language processing have turned out to be the most successful, there are many other areas that have benefited, like computational chemistry in general and drug design in particular. From 2018 the scientific community has seen a surge of methodologies related to the generation of diverse molecular libraries using machine learning. The first goal is to provide an accessible way of using machine learning algorithms to chemists without technical knowledge. Hence, cheML.io, a web database that contains virtual molecules generated by 10 recent ML algorithms, is proposed. It allows users to browse the data in a user-friendly and convenient manner. ML-generated molecules with desired structures and properties can be retrieved with the help of a drawing widget. For the case of a specific search leading to insufficient results, users are able to create new molecules on demand. The second goal is to develop an algorithm that allows the generation of diverse focused libraries utilizing one, or two seed molecules which guide the generation of de novo molecules. Here a variant of transformers, an architecture recently developed for natural language processing, was employed for this purpose. The results indicate that this model is indeed applicable for the task of generating focussed molecular libraries and leads to statistically significant increases in some of the core metrics of the MOSES benchmark. A benchmark that provides baselines and metrics that can characterize the main attributes of the algorithms by examining the generated molecules. In addition, a novel way of generating libraries where two seed molecules can be fused is introduced.en_US
dc.identifier.citationZhumagambetov, R. (2021). Towards Automated Molecular Search in Drug Space (Unpublished master's thesis). Nazarbayev University, Nur-Sultan, Kazakhstanen_US
dc.identifier.urihttp://nur.nu.edu.kz/handle/123456789/5495
dc.language.isoenen_US
dc.publisherNazarbayev University School of Engineering and Digital Sciencesen_US
dc.rightsAttribution-NonCommercial-ShareAlike 3.0 United States*
dc.rights.urihttp://creativecommons.org/licenses/by-nc-sa/3.0/us/*
dc.subjectResearch Subject Categories::TECHNOLOGYen_US
dc.subjectType of access: Embargoen_US
dc.subjectdrug spaceen_US
dc.subjectMOSESen_US
dc.subjectde novo moleculesen_US
dc.titleTOWARDS AUTOMATED MOLECULAR SEARCH IN DRUG SPACEen_US
dc.typeMaster's thesisen_US
workflow.import.sourcescience

Files

Original bundle

Now showing 1 - 2 of 2
Loading...
Thumbnail Image
Name:
Thesis - Rustam Zhumagambetov.pdf
Size:
4.67 MB
Format:
Adobe Portable Document Format
Description:
Thesis
Loading...
Thumbnail Image
Name:
Presentation - Rustam Zhumagambetov.pptx
Size:
7.73 MB
Format:
Microsoft Powerpoint XML
Description:
Presentation