Chinese paper classification based on pre-trained language model and hybrid deep learning method

Xin Luo; Sofianita Mutalib; Syaripah Ruzaini Syed Aris

doi:10.11591/ijai.v14.i1.pp641-649

Chinese paper classification based on pre-trained language model and hybrid deep learning method

Xin Luo, Sofianita Mutalib, Syaripah Ruzaini Syed Aris

Abstract

With the explosive growth in the number of published papers, researchers must filter papers by category to improve retrieval efficiency. The features of data can be learned through complex network structures of deep learning models without the need for manual definition and extraction in advance, resulting in better processing performance for large datasets. In our study, the pre-trained language model bidirectional encoder representations from transformers (BERT) and other deep learning models were applied to paper classification. A large-scale chinese scientific literature dataset was used, including abstracts, keywords, titles, disciplines, and categories from 396 k papers. Currently, there is little in-depth research on the role of titles, abstracts, and keywords in classification and how they are used in combination. To address this issue, we evaluated classification results by employing different title, abstract, and keywords concatenation methods to generate model input data, and compared the effects of a single sentence or sentence pair data input methods. We also adopted an ensemble learning approach to integrate the results of models that processed titles, keywords, and abstracts independently to find the best combination. Finally, we studied the combination of different types of models, such as the combination of BERT and convolutional neural networks (CNN), and measured the performance by accuracy, weighted average precision, weighted average recall, and weighted average F1 score.

Keywords

Bidirectional encoder representations from transformers; Chinese scientific literature dataset; Deep learning model; Model combination; Paper classification; Pre-training language model

Full Text:

PDF

DOI: http://doi.org/10.11591/ijai.v14.i1.pp641-649

Refbacks

There are currently no refbacks.

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

IAES International Journal of Artificial Intelligence (IJ-AI)
ISSN/e-ISSN 2089-4872/2252-8938
This journal is published by the Institute of Advanced Engineering and Science (IAES).

View IJAI Stats

Username
Password
Remember me