Hongwu Qin scite author profile

Learning models used for prediction purposes are mostly developed without paying much cognizance to the size of datasets that can produce models of high accuracy and better generalization. Although, the general believe is that, large dataset is needed to construct a predictive learning model. To describe a data set as large in size, perhaps, is circumstance dependent, thus, what constitutes a dataset to be considered as being big or small is vague. In this paper, the ability of the predictive model to generalize with respect to a particular size of data when simulated with new untrained input is examined. The study experiments on three different sizes of data using Matlab program to create predictive models with a view to establishing if the size of data has any effect on the accuracy of a model. The simulated output of each model is measured using the Mean Absolute Error (MAE) and comparisons are made. Findings from this study reveals that, the quantity of data partitioned for the purpose of training must be of good representation of the entire sets and sufficient enough to span through the input space. The results of simulating the three network models also shows that, the learning model with the largest size of training sets appears to be the most accurate and consistently delivers a much better and stable results.

show abstract

A new efficient normal parameter reduction algorithm of soft sets

Sulaiman

Qin

et al. 2011

Computers & Mathematics with Applications

View full text Add to dashboard Cite

The Parameter Reduction of the Interval-Valued Fuzzy Soft Sets and Its Related Algorithms

Qin

Sulaiman

et al. 2014

IEEE Trans. Fuzzy Syst.

View full text Add to dashboard Cite

DFIS: A novel data filling approach for an incomplete soft set

Qin¹,

Ma²,

Herawan³

et al. 2012

View full text Add to dashboard Cite

The research on incomplete soft sets is an integral part of the research on soft sets and has been initiated recently. However, the existing approach for dealing with incomplete soft sets is only applicable to decision making and has low forecasting accuracy. In order to solve these problems, in this paper we propose a novel data filling approach for incomplete soft sets. The missing data are filled in terms of the association degree between the parameters when a stronger association exists between the parameters or in terms of the distribution of other available objects when no stronger association exists between the parameters. Data filling converts an incomplete soft set into a complete soft set, which makes the soft set applicable not only to decision making but also to other areas. The comparison results elaborated between the two approaches through UCI benchmark datasets illustrate that our approach outperforms the existing one with respect to the forecasting accuracy.

show abstract

Data Filling Approach of Soft Sets under Incomplete Information

Qin

Herawan

et al. 2011

View full text Add to dashboard Cite

Neural Networks Optimization through Genetic Algorithm Searches: A Review

Chiroma¹,

Noor²,

Kareem³

et al. 2017

Appl. Math. Inf. Sci

View full text Add to dashboard Cite

A novel soft set approach in selecting clustering attribute

Qin

Zain

et al. 2012

Knowledge-Based Systems

View full text Add to dashboard Cite

A novel liquidchip platform for simultaneous detection of 70 alleles of DNA somatic mutations on EGFR, KRAS, BRAF and PIK3CA from formalin-fixed and paraffin-embedded slides containing tumor tissue

Li¹,

Luo²,

He³

et al. 2010

Clinical Chemistry and Laboratory Medicine

View full text Add to dashboard Cite

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Hongwu Qin

Evaluating the Effect of Dataset Size on Predictive Model Using Supervised Learning Technique

A new efficient normal parameter reduction algorithm of soft sets

The Parameter Reduction of the Interval-Valued Fuzzy Soft Sets and Its Related Algorithms

DFIS: A novel data filling approach for an incomplete soft set

Data Filling Approach of Soft Sets under Incomplete Information

Neural Networks Optimization through Genetic Algorithm Searches: A Review

A novel soft set approach in selecting clustering attribute

A novel liquidchip platform for simultaneous detection of 70 alleles of DNA somatic mutations on EGFR, KRAS, BRAF and PIK3CA from formalin-fixed and paraffin-embedded slides containing tumor tissue

Contact Info

Product

Resources

About