In this paper, an interactive multitask learning method for Chinese text sentiment classification is proposed. Here, the classic BiLSTM+attention+CRF model is used to obtain full use of the interaction relationship between tasks, and it simultaneously solves the two tasks of emotional dictionary expansion and sentiment classification. The proposed method divides text sentiment classification and emotional dictionary expansion into primary task and subtask, and it adopts the Enhanced Language Representation with Informative Entities (ERNIE) model as the text representation learning model for the primary task. Then, through the maximum pooling layer and the fully connected layer, the text sentiment classification task is completed. Meanwhile, the classical BiLSTM+attention+CRF model is used to extract emotional words from the text in the subtask. In addition, the multitask information interaction mechanism is used, and the prediction information on the autonomous subtask is fed back into the potential representation of the two tasks. After iterative training, the performance of the two tasks is further optimized. Micro-blogs with COVID-19 are used here as the subject to form the experimental data set. The results demonstrate the superiority of the proposed method over other approaches, and they further verify the superiority of ERNIE over BERT, RoBERTa and XLNet for the sentiment classification of Chinese text.
In order to obtain high quality and large-scale labelled data for information security research, we propose a new approach that combines a generative adversarial network with the BiLSTM-Attention-CRF model to obtain labelled data from crowd annotations. We use the generative adversarial network to find common features in crowd annotations and then consider them in conjunction with the domain dictionary feature and sentence dependency feature as additional features to be introduced into the BiLSTM-Attention-CRF model, which is then used to carry out named entity recognition in crowdsourcing. Finally, we create a dataset to evaluate our models using information security data. The experimental results show that our model has better performance than the other baseline models.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.