Inducing stock market lexicons from disparate Chinese texts

Zhao, Futao; Yao, Zhong; Luan, Jing; Líu, Hao

doi:10.1108/imds-04-2019-0254

Cited by 4 publications

(5 citation statements)

References 47 publications

(64 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In addition, another interesting finding is that for any chosen classifier, the performance of predicting whether a tweet will be retweeted or not is relatively better than that of predicting retweet volume in terms of all result indicators. This result echoes the findings of Zhao et al . (2020).…”

Section: Resultssupporting

confidence: 92%

“…Using the five sets of features mentioned above, RF shows the best prediction performance for the two different dependent variables, which is consistent with the results reported in previous studies on users' retweet prediction (Sharma and Gupta, 2022). Furthermore, the superiority of the random forest algorithm has also been verified in other fields, such as content donation prediction (Zhao et al ., 2020), online reviews helpfulness prediction (Lee et al ., 2018), crash injury severity prediction (Santos et al ., 2022) and even energy consumption prediction (Ding et al ., 2021). In addition, another interesting finding is that for any chosen classifier, the performance of predicting whether a tweet will be retweeted or not is relatively better than that of predicting retweet volume in terms of all result indicators.…”

Section: Resultsmentioning

confidence: 98%

“…First, in terms of the author's basic features, the gender of an author was considered because prior research has shown the role of gender in users' donation behavior (Zhao et al ., 2020). The gender in this work could be female, male, confidential and default null value.…”

Section: Methodsmentioning

confidence: 99%

“…These features were extracted before data analysis, and are expected to influence users' decisions according to previous research on other topics, such as information persuasion (Fishbein and Ajzen, 1981), review helpfulness (Hong et al ., 2017), and user engagement (Chen et al ., 2020). More importantly, Jieba, a popular language analysis tool that focuses on Chinese sentence segmentation (Zhao et al ., 2020), was utilized for word segmentation and part-of-speech (POS) tagging. Then the number of nouns, verbs, adjectives and modal particles were incorporated as features of POS tags.…”

Section: Methodsmentioning

confidence: 99%

See 3 more Smart Citations

How to identify influential content: Predicting retweets in online financial community

Zhong

Zhao

et al. 2023

AJIM

View full text Add to dashboard Cite

PurposeRetail investors are prone to be affected by information dissemination in social media with the rapid development of Web 2.0. The purpose of this study is to recognize the factors that may impact users' retweet behavior, namely information dissemination in the online financial community, through machine learning techniques.Design/methodology/approachThis paper crawled data from the Chinese online financial community (Xueqiu.com) and extracted author-related, content-related, situation-related, stock-related and stock market-related features from the dataset. The best information dissemination prediction model based on these features was determined by evaluating five classifiers with various performance metrics, and the predictability of different feature groups was tested.FindingsFive prevalent classifiers were evaluated with various performance metrics and the random forest classifier was proven to be the best retweet prediction model in the authors’ experiments. Moreover, the predictability of author-related, content-related and market-related features was illustrated to be relatively better than that of the other two feature groups. Several particularly important features, such as the author's followers and the rise and fall of the stock index, were recognized in this paper at last.Originality/valueThis study contributes to in-depth research on information dissemination in the financial domain. The findings of this study have important practical implications for government regulators to supervise public opinion in the financial market.

show abstract

Section: Resultssupporting

confidence: 92%

Section: Resultsmentioning

confidence: 98%

Section: Methodsmentioning

confidence: 99%

Section: Methodsmentioning

confidence: 99%

See 2 more Smart Citations

How to identify influential content: Predicting retweets in online financial community

Zhong

Zhao

et al. 2023

AJIM

View full text Add to dashboard Cite

show abstract

“…Word2vec provides a modeling method that extracts feature vectors of words from their contexts to express deep semantic information about the words (Mikolov et al, 2013). Word2vec is also an effective tool to obtain semantic similarity (Zhao et al, 2020). Therefore, to better quantitatively analyze technology trends, we combine the LDA and Word2vec to capture technical information about potential topics in patent texts at the semantic level.…”

Section: Patent Semantic Analysismentioning

confidence: 99%

Tracing the technological trajectory of coal slurry pipeline transportation technology: An HMM-based topic modeling approach

Wang

Li²,

Feng³

2022

Front. Energy Res.

View full text Add to dashboard Cite

Coal slurry pipeline transportation is an important way to realize green coal logistics. However, there are still challenges in understanding the cognitive aspects of coal slurry pipeline transportation technology development trajectory. This study attempts to trace and predict the technology trend from patent texts through the stochastic process analysis of topic evolution. It helps understand the challenges in the development process of coal slurry pipeline transportation technology. And capture trends and development characteristics of the technology to improve research and development (R&D) efficiency and sustainability. As a result, this study extracts potential technology topics from patent text by using the Latent Dirichlet Distribution method. Then, a Word2vec-based topic word vector model is applied to calculate the cosine similarity between topics. And the HMM-based topic evolution trend model is constructed by introducing the Hidden Markov Model (HMM) which can portray a dual stochastic process. Finally, it is used to analyze and predict trends in the technological evolution of this field. It was found that the advancement of technology related to pulping is fundamental to promoting the development of coal slurry pipeline transportation technology, which is also a common research topic. Finally, technologies related to pipeline transportation capacity enhancement and the industrial application of coal slurry will be the focus of future R&D in this field with broad research and application prospects. This study is intended to provide directions for sustainable R&D activities in coal slurry pipeline transportation technology, facilitate interdisciplinary discussions, and provide objective data for future decisions making for scientists and R&D managers in this field.

show abstract

The role of digital transformation practices in the operations improvement in manufacturing firms: A practice-based view

Tian,

Chen,

Tian

et al. 2023

International Journal of Production Economics

View full text Add to dashboard Cite

Inducing stock market lexicons from disparate Chinese texts

Cited by 4 publications

References 47 publications

How to identify influential content: Predicting retweets in online financial community

How to identify influential content: Predicting retweets in online financial community

Tracing the technological trajectory of coal slurry pipeline transportation technology: An HMM-based topic modeling approach

The role of digital transformation practices in the operations improvement in manufacturing firms: A practice-based view

Contact Info

Product

Resources

About