“…Thissectionprovidesagenericoverviewofthehierarchicalensemblemethodologyforsolvingthe multi-classclassificationproblem.Thehierarchicalensemblemethodologyisarelativelyrecently proposedapproachtoaddressthemulti-classclassificationproblemwhichinvolvesthegeneration ofahierarchical"meta-algorithm" (Kumaretal.,2002;Madzarovetal.,2008).Acommonstructure adoptedforhierarchicalclassification,asnotedintheprevioussection,isabinarytreestructure constructed in either a bottom-up or top-down manner (Beygelzimer et al, 2007;Kumar et al, 2002).Inthetop-downapproach,therootnodecontainsthecompletesetofclasslabels{c 1 , c 2 , …, c n }.Startingfromtheroot,thesetofclasslabelsateachnodeisrecursivelysplit,andaclassifieris trainedtodistinguishbetweenthetwosubsets.Usingthebottom-upapproachamergingprocessis adoptedsimilartoagglomerativehierarchicalclustering.Thetwonodeswiththeclosestdistanceare mergedtoformanodedescribinganewmeta-class (Beygelzimeretal.,2007).Anexamplebinary treehierarchyispresentedinFigure1. Attheroot,wediscriminatebetweentwogroupsofclass labels{a, b, c}and{d, e}.Atthenextlevel,wedistinguishbetweensmallergroups,andsoon,till wereachnodeswithclassifiersthatcanassignasingleclasslabeltoagivenrecord. When considering hierarchical classification models the necessary class partitioning can be conductedusingavarietyofmethodssuchasdatasplittingorclustering.Theperformanceofthe binarytreeapproach,themostcommonlyusedhierarchicalensemblemodel,issignificantlyinfluenced by the adopted class partitioning method; inappropriate choices can result in poor performance (Alshdaifat,Coenen,&Dures,2013a,2013b,2014.Otherthanthenatureofthegroupingmethod tobeadopted,aseconddrawbackofthebinarytreebasedhierarchicalensemblemodelisthatifa recordismisclassifiedearlyonintheclassificationprocess(neartherootofthehierarchy)itwill continuetobemisclassifiedatdeeperlevels;thesocalled"successivemisclassification"problem. Inpreviousworktheauthorshavesuggestedamultiple-pathstrategy,whichallowsformorethan onepathtobefollowedwithinthebinarytreeduringtheclassificationstage.Thismultiple-path strategywasfacilitatedbytheuseofclassifiers,suchasNaiveBayesorClassificationAssociation RuleMining(CARM),whichfeatureprobabilityorconfidencevaluesthatcanbeusedtodetermine whereonepathshouldbefollowedandwheretwopathsshouldbefollowed.However,themulti-path strategyonlypartiallyresolvesthesuccessivemisclassificationproblem,fundamentallythebinary treestructureisnotsufficientlyexpressivetocapturethenatureofmulti-classclassification.…”