“…A good prediction method must be combined with an effective feature extraction scheme to achieve better prediction results. At present, there are many popular feature extraction schemes, including amino acid composition (AAC) (Li and Wang, 2016;Meher et al, 2017;Chung et al, 2019;Lv et al, 2019a,b), pseudo amino acid composition (PseAAC) (Shen and Chou, 2008;Khosraviana et al, 2013;Hajisharifi et al, 2014;Zare et al, 2015), physicochemical properties (Melo et al, 2011;Shua et al, 2013;Agrawal et al, 2018;Bhadra et al, 2018;Chung et al, 2019;Schaduangrat et al, 2019;Lv et al, 2020a;Zhang et al, 2020), binary position map (Chung et al, 2019), position specific scoring matrix (PSSM) (An et al, 2019; FIGURE 1 | The overall framework of our classifier. Training data set from DS1 or seven training data sets from DS2 are computed separately through amino acid reduction, dipeptide feature extraction, supporting vector machine model training and 10-fold cross-validation model evaluation.…”