2019
DOI: 10.1108/imds-04-2019-0254
|View full text |Cite
|
Sign up to set email alerts
|

Inducing stock market lexicons from disparate Chinese texts

Abstract: Purpose The purpose of this paper is to propose a methodology to construct a stock market sentiment lexicon by incorporating domain-specific knowledge extracted from diverse Chinese media outlets. Design/methodology/approach This paper presents a novel method to automatically generate financial lexicons using a unique data set that comprises news articles, analyst reports and social media. Specifically, a novel method based on keyword extraction is used to build a high-quality seed lexicon and an ensemble me… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

1
4
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
4

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(5 citation statements)
references
References 47 publications
(64 reference statements)
1
4
0
Order By: Relevance
“…In addition, another interesting finding is that for any chosen classifier, the performance of predicting whether a tweet will be retweeted or not is relatively better than that of predicting retweet volume in terms of all result indicators. This result echoes the findings of Zhao et al . (2020).…”
Section: Resultssupporting
confidence: 92%
See 3 more Smart Citations
“…In addition, another interesting finding is that for any chosen classifier, the performance of predicting whether a tweet will be retweeted or not is relatively better than that of predicting retweet volume in terms of all result indicators. This result echoes the findings of Zhao et al . (2020).…”
Section: Resultssupporting
confidence: 92%
“…Using the five sets of features mentioned above, RF shows the best prediction performance for the two different dependent variables, which is consistent with the results reported in previous studies on users' retweet prediction (Sharma and Gupta, 2022). Furthermore, the superiority of the random forest algorithm has also been verified in other fields, such as content donation prediction (Zhao et al ., 2020), online reviews helpfulness prediction (Lee et al ., 2018), crash injury severity prediction (Santos et al ., 2022) and even energy consumption prediction (Ding et al ., 2021). In addition, another interesting finding is that for any chosen classifier, the performance of predicting whether a tweet will be retweeted or not is relatively better than that of predicting retweet volume in terms of all result indicators.…”
Section: Resultsmentioning
confidence: 98%
See 2 more Smart Citations
“…Word2vec provides a modeling method that extracts feature vectors of words from their contexts to express deep semantic information about the words (Mikolov et al, 2013). Word2vec is also an effective tool to obtain semantic similarity (Zhao et al, 2020). Therefore, to better quantitatively analyze technology trends, we combine the LDA and Word2vec to capture technical information about potential topics in patent texts at the semantic level.…”
Section: Patent Semantic Analysismentioning
confidence: 99%