Hsinchun Chen scite author profile

РозділПрізвище, ініціали та посада консультанта Підпис, дата завдання видав завдання прийняв Охорона праці Дмитроца Л.П., к.т.н., доц. кафедри КН Безпека в надзвичайних ситуаціях Клепчик В.М., проректор з адміністративно-господарської роботи та будівництва 7. Дата видачі завдання 27 вересня 2021 року КАЛЕНДАРНИЙ ПЛАН № з/п Назва етапів роботи Термін виконання етапів роботи Примітка

show abstract

Credit rating analysis with support vector machines and neural networks: a market comparative study

Huang

Chen

Hsu

et al. 2004

Decision Support Systems

836

445

View full text Add to dashboard Cite

A framework for authorship identification of online messages: Writing‐style features and classification techniques

Zheng

Chen

et al. 2005

J. Am. Soc. Inf. Sci.

494

419

View full text Add to dashboard Cite

With the rapid proliferation of Internet technologies and applications, misuse of online messages for inappropriate or illegal purposes has become a major concern for society. The anonymous nature of online-message distribution makes identity tracing a critical problem. We developed a framework for authorship identification of online messages to address the identity-tracing problem. In this framework, four types of writing-style features (lexical, syntactic, structural, and content-specific features) are extracted and inductive learning algorithms are used to build feature-based classification models to identify authorship of online messages. To examine this framework, we conducted experiments on English and Chinese online-newsgroup messages. We compared the discriminating power of the four types of features and of three classification techniques: decision trees, backpropagation neural networks, and support vector machines. The experimental results showed that the proposed approach was able to identify authors of online messages with satisfactory accuracy of 70 to 95%. All four types of message features contributed to discriminating authors of online messages. Support vector machines outperformed the other two classification techniques in our experiments. The high performance we achieved for both the English and Chinese datasets showed the potential of this approach in a multiplelanguage context.

show abstract

The information content of mandatory risk factor disclosures in corporate filings

et al. 2013

View full text Add to dashboard Cite

Textual analysis of stock market prediction using breaking financial news

Schumaker

Chen

2009

ACM Trans. Inf. Syst.

618

359

View full text Add to dashboard Cite

Our research examines a predictive machine learning approach for financial news articles analysis using several different textual representations: bag of words, noun phrases, and named entities. Through this approach, we investigated 9,211 financial news articles and 10,259,042 stock quotes covering the S&P 500 stocks during a five week period. We applied our analysis to estimate a discrete stock price twenty minutes after a news article was released. Using a support vector machine (SVM) derivative specially tailored for discrete numeric prediction and models containing different stock-specific variables, we show that the model containing both article terms and stock price at the time of article release had the best performance in closeness to the actual future stock price (MSE 0.04261), the same direction of price movement as the future price (57.1% directional accuracy) and the highest return using a simulated trading engine (2.06% return). We further investigated the different textual representations and found that a Proper Noun scheme performs better than the de facto standard of Bag of Words in all three metrics.

show abstract

Applying associative retrieval techniques to alleviate the sparsity problem in collaborative filtering

Huang

Chen

Zeng

2004

ACM Trans. Inf. Syst.

535

289

View full text Add to dashboard Cite

Recommender systems are being widely applied in many application settings to suggest products, services, and information items to potential consumers. Collaborative filtering, the most successful recommendation approach, makes recommendations based on past transactions and feedback from consumers sharing similar interests. A major problem limiting the usefulness of collaborative filtering is the sparsity problem, which refers to a situation in which transactional or feedback data is sparse and insufficient to identify similarities in consumer interests. In this article, we propose to deal with this sparsity problem by applying an associative retrieval framework and related spreading activation algorithms to explore transitive associations among consumers through their past transactions and feedback. Such transitive associations are a valuable source of information to help infer consumer interests and can be explored to deal with the sparsity problem. To evaluate the effectiveness of our approach, we have conducted an experimental study using a data set from an online bookstore. We experimented with three spreading activation algorithms including a constrained Leaky Capacitor algorithm, a branch-and-bound serial symbolic search algorithm, and a Hopfield net parallel relaxation search algorithm. These algorithms were compared with several collaborative filtering approaches that do not consider the transitive associations: a simple graph search approach, two variations of the user-based approach, and an item-based approach. Our experimental results indicate that spreading activation-based approaches significantly outperformed the other collaborative filtering methods as measured by recommendation precision, recall, the F-measure, and the rank score. We also observed the over-activation effect of the spreading activation approach, that is, incorporating transitive associations with past transactional data that is not sparse may "dilute" the data used to infer user preferences and lead to degradation in recommendation performance.

show abstract

The Information Content of Mandatory Risk Factor Disclosures in Corporate Filings

et al. 2010

View full text Add to dashboard Cite

Social Media Analytics and Intelligence

et al. 2010

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

334 Leonard St

Brooklyn, NY 11211

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Hsinchun Chen

Business Intelligence and Analytics: From Big Data to Big Impact

Credit rating analysis with support vector machines and neural networks: a market comparative study

A framework for authorship identification of online messages: Writing‐style features and classification techniques

The information content of mandatory risk factor disclosures in corporate filings

Textual analysis of stock market prediction using breaking financial news

Applying associative retrieval techniques to alleviate the sparsity problem in collaborative filtering

The Information Content of Mandatory Risk Factor Disclosures in Corporate Filings

Social Media Analytics and Intelligence

Contact Info

Product

Resources

About