BIBLIOMETRIC ANALYSIS OF SCIENTIFIC PUBLICATIONS DEVOTED TO SUSTAINABLE DEVELOPMENT GOAL 13 USING NLP TECHNIQUES

dc.contributor.authorPentayev, Alpamys
dc.date.accessioned2022-06-10T04:34:55Z
dc.date.available2022-06-10T04:34:55Z
dc.date.issued2022-05
dc.description.abstractImprovements in search engines for traditional text documents have made it possible to retrieve massive text information with efficient use of resources efficiently. As the number of published scientific articles on climate change increases, analyzing trends and the current state of this field of science becomes a very time-consuming and time-consuming task. To address the ever-growing demands for systematic literature review techniques, rapid review and scoping reviews techniques have emerged. This work carried out statistical analysis of publications and popular NLP methods such as vectorization, metadata preprocessing, bag-of-words, TF-IDF, lemmatization, and others. We also applied the LDA topic modelling to identify topics from abstracts of scientific publications. According to the statistical analysis outcomes of a large dataset from the CORE API, we obtained the following results: (1) there was a significant increase in research since 2010 and peaked in 2016 with 865980 publications on the climate change field of study; (2) the two most productive institutions are the University of Pennsylvania and the Department of Energy, the United States for Climate Impact Research; (3) in almost all five centrality measures, Intergovernmental Panel on Climate Change is leading as the most productive author; (4) an obvious fact is the high level of involvement in climate change research of Western countries, while according to the results of the analysis, the second place was also taken by the representative of Asia - Japan. Additionally, we created citations and co-authorship networks using the Neo4j graph database and calculated five centrality measures, including Page Rank, Eigenvector centralities, and others.en_US
dc.identifier.citationPentayev, A. (2022). Bibliometric analysis of scientific publications devoted to Sustainable Development Goal 13 using NLP techniques (Unpublished master's thesis). Nazarbayev University, Nur-Sultan, Kazakhstanen_US
dc.identifier.urihttp://nur.nu.edu.kz/handle/123456789/6199
dc.language.isoenen_US
dc.publisherNazarbayev University School of Engineering and Digital Sciencesen_US
dc.rightsAttribution-NonCommercial-ShareAlike 3.0 United States*
dc.rights.urihttp://creativecommons.org/licenses/by-nc-sa/3.0/us/*
dc.subjecttype of access: gated accessen_US
dc.subjectNLP techniquesen_US
dc.subjectmassive text informationen_US
dc.subjectNatural language processingen_US
dc.subjectResearch Subject Categories::TECHNOLOGYen_US
dc.titleBIBLIOMETRIC ANALYSIS OF SCIENTIFIC PUBLICATIONS DEVOTED TO SUSTAINABLE DEVELOPMENT GOAL 13 USING NLP TECHNIQUESen_US
dc.typeMaster's thesisen_US
workflow.import.sourcescience

Files

Original bundle

Now showing 1 - 2 of 2
Loading...
Thumbnail Image
Name:
Thesis - Alpamys Pentayev.pdf
Size:
2.58 MB
Format:
Adobe Portable Document Format
Description:
Thesis
Loading...
Thumbnail Image
Name:
Presentation - Alpamys Pentayev.pptx
Size:
1.8 MB
Format:
Microsoft Powerpoint XML
Description:
Presentation