Chongsheng Zhang scite author profile

Current benchmark reports of classification algorithms generally concern common classifiers and their variants but do not include many algorithms that have been introduced in recent years. Moreover, important properties such as the dependency on number of classes and features and CPU running time are typically not examined. In this paper, we carry out a comparative empirical study on both established classifiers and more recently proposed ones on 71 data sets originating from different domains, publicly available at UCI and KEEL repositories. The list of 11 algorithms studied includes Extreme Learning Machine (ELM), Sparse Representation based Classification (SRC), and Deep Learning (DL), which have not been thoroughly investigated in existing comparative studies. It is found that Stochastic Gradient Boosting Trees (GBDT) matches or exceeds the prediction performance of Support Vector Machines (SVM) and Random Forests (RF), while being the fastest algorithm in terms of prediction efficiency. ELM also yields good accuracy results, ranking in the top-5 , alongside GBDT, RF, SVM, and C4.5 but this performance varies widely across all data sets. Unsurprisingly, top accuracy performers have average or slow training time efficiency. DL is the worst performer in terms of accuracy but second fastest in prediction efficiency. SRC shows good accuracy performance but it is the slowest classifier in both training and testing.

show abstract

An empirical comparison on state-of-the-art multi-class imbalance learning algorithms and a new diversified ensemble learning scheme

Zhang

2018

Knowledge-Based Systems

162

View full text Add to dashboard Cite

Multi-Imbalance: An open-source software for multi-class imbalance learning

Zhang

et al. 2019

Knowledge-Based Systems

140

View full text Add to dashboard Cite

On Incremental Learning for Gradient Boosting Decision Trees

Zhang

Shi

et al. 2019

Neural Process Lett

View full text Add to dashboard Cite

Turning wingbeat sounds into spectrum images for acoustic insect classification

et al. 2017

View full text Add to dashboard Cite

A novel acoustic insect classifier on deep convolutional feature of frequency spectrum images generated by their wingbeat sounds is introduced. By visualising insect wingbeat sound, the proposed method is the first attempt to convert time-series acoustic signal processing to image recognition, which has recently gained significant improvement with convolutional neural networks. Experiments show the better accuracy of the proposed method on the public UCR flying insect datasets compared with the state-of-the-art methods.

show abstract

Exploring Pattern Mining Algorithms for Hashtag Retrieval Problem

et al. 2020

View full text Add to dashboard Cite

Hashtag is an iconic feature to retrieve the hot topics of discussion on Twitter or other social networks. This paper incorporates the pattern mining approaches to improve the accuracy of retrieving the relevant information and speeding up the search performance. A novel algorithm called PM-HR (Pattern Mining for Hashtag Retrieval) is designed to first transform the set of tweets into a transactional database by considering two different strategies (trivial and temporal). After that, the set of the relevant patterns is discovered, and then used as a knowledge-based system for finding the relevant tweets based on users' queries under the similarity search process. Extensive results are carried out on large and different tweet collections, and the proposed PM-HR outperforms the baseline hashtag retrieval approaches in terms of runtime, and it is very competitive in terms of accuracy. INDEX TERMS Hashtag retrieval, pattern mining, scalability.

show abstract

Discovering Highly Informative Feature Sets from Data Streams

Zhang

Masséglia

2010

View full text Add to dashboard Cite

An empirical evaluation of high utility itemset mining algorithms

Zhang

Almpanidis

Wang

et al. 2018

Expert Systems with Applications

View full text Add to dashboard Cite

12 3 4 5 6

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.