LAUNCH OF Q-SYMPHONY BIOINFORMATICS COMPUTING SYSTEM: A HIGH-PERFORMANCE CLUSTER FOR ANALYSIS OF LARGE-SCALE GENOMIC DATASETS

dc.contributor.authorMolkenov, A.
dc.contributor.authorDaniyarov, A.
dc.contributor.authorSharip, A.
dc.contributor.authorSeisenova, A.
dc.contributor.authorKarabayev, D.
dc.contributor.authorKairov, U.
dc.date.accessioned2020-11-23T03:08:10Z
dc.date.available2020-11-23T03:08:10Z
dc.date.issued2020
dc.description.abstractIntroduction: One whole human genome, provided by next generation sequencing platforms, in raw format takes 20 to 50 GB. In the course of bioinformatics analysis and data analysis, the data volume increases to 300-500 GB per genome. with an increase in the number of samples, the occupied volume increases. Such a large amount of data required for the analysis of whole genomes demands powerful computing power in the form of servers and data warehouses combined into clusters. We at Laboratory of Bioinformatics and Systems Biology have developed and launched Q-Symphony bioinformatics computing system called (“Qazaq Symphony of Bioinformatics”) for bioinformatics analyses of solving large scale genomic datasets. Materials and methods: The Q-Symphony bioinformatics computing system consists 12high-performance HPE servers: 1control node, 8 compute nodes, 1fat-memory compute node, and 2storage nodes. The system runs on Red Hat Enterprise Linux. The management node controls access to user profiles, data warehouse and Moab Workload Manager. The total number of processing cores is 172, the total amount of RAM is 3072GB, and the total storage capacity is 198 TB, a peak performance of the system of 7.3 TFlops. All nodes use high-speed Infiniband network connections, which allow the data exchange between nodes at 100 Gbps speed. The computational capabilities of the Q-symphony system allow us to evenly distribute resources for each task performed, monitor the load on processor and memory resources in real time, and queue and execute sequentially large lists of tasks. Results: Benchmark measurements performed on Q-symphony system showed an increase of subtasks execution from 15 to 54 times compared to standard solutions built on similar computational processors. Conclusion: The presence of Q-Symphony, well-established and proven bioinformatics methods will make it possible to successfully analyze large-scale human genomic data and determine structural genomic variants and carry out complex comparative and population analysis.en_US
dc.identifier.urihttp://nur.nu.edu.kz/handle/123456789/5124
dc.language.isoenen_US
dc.publisherInternational conference "MODERN PERSPECTIVES FOR BIOMEDICAL SCIENCES: FROM BENCH TO BEDSIDE”; National Laboratory Astanaen_US
dc.rightsAttribution-NonCommercial-ShareAlike 3.0 United States*
dc.rights.urihttp://creativecommons.org/licenses/by-nc-sa/3.0/us/*
dc.subjectbioinformaticsen_US
dc.subjectnext-generation sequencingen_US
dc.subjectwhole genome analysisen_US
dc.subjectResearch Subject Categories::MEDICINEen_US
dc.titleLAUNCH OF Q-SYMPHONY BIOINFORMATICS COMPUTING SYSTEM: A HIGH-PERFORMANCE CLUSTER FOR ANALYSIS OF LARGE-SCALE GENOMIC DATASETSen_US
dc.typeAbstracten_US
workflow.import.sourcescience

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Abstract 30.pdf
Size:
197.68 KB
Format:
Adobe Portable Document Format
Description:
Abstract

Collections