Based on complex adaptive system theory and information theory for investigating heterogeneous situations, this paper develops an outlier knowledge management framework based on three aspects—dimension, object, and situation—for dealing with extreme public health events. In the context of the COVID-19 pandemic, we apply advanced natural language processing (NLP) technology to conduct data mining and feature extraction on the microblog data from the Wuhan area and the imported case province (Henan Province) during the high and median operating periods of the epidemic. Our experiment indicates that the semantic and sentiment vocabulary of words, the sentiment curve, and the portrait of patients seeking help were all heterogeneous in the context of COVID-19. We extract and acquire the outlier knowledge of COVID-19 and incorporate it into the outlier knowledge base of extreme public health events for knowledge sharing and transformation. The knowledge base serves as a think tank for public opinion guidance and platform suggestions for dealing with extreme public health events. This paper provides novel ideas and methods for outlier knowledge management in healthcare contexts.
This paper investigates the soybean futures price prediction problem from a new perspective and proposes an effective prediction model named Two‐Stage Hybrid Long Short‐Term Memory (TSH‐LSTM) by using text data from social media. First, the unstructured text is transformed into structured data by sentiment analysis and text classification methods. The improved sentiment score is computed by combining the degree centrality of sentiment words based on the sentiment dictionary method, and the characteristics of price fluctuations in texts are learned through the text Recurrent Convolutional Neural Networks. Second, the significant relationship between social media features and soybean futures price is assessed through stepwise regression, and the results of such an assessment are used as a basis for the identification of significant factors as input variables of the prediction model. Finally, the TSH‐LSTM prediction model is designed, and the final prediction result is acquired through the combination of prediction results of each stage using the error reciprocal method. The empirical results indicate that the incorporation of the social media text feature helps improve forecasting performances. Specifically, the proposed TSH‐LSTM is more accurate than univariate LSTM, multivariate LSTM, and eXtreme Gradient Boosting.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.