PARALLEL NEWS CLUSTERING AND TOPIC MODELING APPROACHES

dc.contributor.authorShomanov, A S
dc.contributor.authorMansurova, M E
dc.date.accessioned2021-09-16T07:30:38Z
dc.date.available2021-09-16T07:30:38Z
dc.date.issued2021
dc.description.abstractAt the current age there is an urgent need in developing massively scalable and efficient tools to Big Data processing. Even the smallest companies nowadays inevitably require more and more resources for data processing routines that could enhance decision making and reliably predict and simulate different scenarios. In the current paper we present our combined work on different massively scalable approaches for the task of clustering and topic modeling of the dataset, collected by crawling Kazakhstan news websites. In particular, we propose Apache Spark parallel solutions to news clustering and topic modeling problems and, additionally, we describe results of implementing document clustering using developed partitioned global address space Mapreduce system. In our work we describe our experience in solving these problems and investigate the efficiency and scalability of the proposed solutions.en_US
dc.identifier.citationShomanov, A. S., & Mansurova, M. E. (2021). Parallel news clustering and topic modeling approaches. Journal of Physics: Conference Series, 1727, 012018. https://doi.org/10.1088/1742-6596/1727/1/012018en_US
dc.identifier.urihttp://nur.nu.edu.kz/handle/123456789/5793
dc.language.isoenen_US
dc.publisherJournal of Physics: Conference Seriesen_US
dc.rightsAttribution-NonCommercial-ShareAlike 3.0 United States*
dc.rights.urihttp://creativecommons.org/licenses/by-nc-sa/3.0/us/*
dc.subjectType of access: Open Accessen_US
dc.subjectBig Data processingen_US
dc.titlePARALLEL NEWS CLUSTERING AND TOPIC MODELING APPROACHESen_US
dc.typeArticleen_US
workflow.import.sourcescience

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Shomanov_2021_J._Phys.__Conf._Ser._1727_012018.pdf
Size:
627.02 KB
Format:
Adobe Portable Document Format
Description:
Article

Collections