“…Therefore, two options needed to be investigated: Balance the data (using the ubBalance package in R), impute the missing values (using the missForest package in R), and then apply the C5.0 DT classifier for feature reduction Apply the C5.0 DT classifier for feature reduction with the missing values present in BORN and PRAMS, and then carry out the preprocessing steps (balance the data and fill in missing values)The results are summarized in the next chapter. The first step in creating the C5.0 DT classifiers was to modify a file called "mortality.names" this file contained information about the features and classes (files are labeled as mortality/nonmortality throughout this research, due to past work done by Hasmik on the ANN Builder[13], her work was focused on neonatal mortality risk estimation models using Artificial Neural Networks) In the mortality.names file the OUTCOME feature represented the target attribute, the CASE ID was the label attribute and the rest of the features in: PRAMS_Parous, PRAMS_Nulliparous, BORN_Parous or BORN_Nulliparous were defined to be continuous (numeric) or discrete (nominal). The next step was to create two csv files, one labeled mortality.csv and the other labeled nonmortality.csv.…”