A text mining and topic modeling based bibliometric exploration of information science research

Tipawan Silwattananusarn, Pachisa Kulkanjanapiban


This study investigates the evolution of information science research based on bibliometric analysis and semantic mining. The study discusses the value and application of metadata tagging and topic modeling. Forty-two thousand seven hundred thirty-eight articles were extracted from Clarivate Analytic's Web of Science Core Collection 2010-2020. This study was divided into two phases. Firstly, bibliometric analyzes were performed with VOSviewer. Secondly, the topic identification and evolution trends of information science research were conducted through the topic modeling approach latent dirichlet allocation (LDA) is often used to extract themes from a corpus, and the topic model was a representation of a collection of documents that is simplified using topic-modeling-toolkit (TMT). The top 10 core topics (tags) were information research design, information health-based, model data public, study information studies, analysis effect implications, knowledge support web, data research, social research study, study media information, and research impact time for the studied period. Not only does topic modeling assist in identifying popular topics or related areas within a researcher's area, but it may be used to discover emerging topics or areas of study throughout time.


bibliometrics; latent dirichlet allocation; research trends; text mining; topic modeling;

Full Text:


DOI: http://doi.org/10.11591/ijai.v11.i3.pp1057-1065


  • There are currently no refbacks.

View IJAI Stats

Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.