“…These methods extract various features from amino acid residues and use them as the input to train the machine-learning models for classification. Several algorithms like SVM (Cai et al, 2003;Kumar et al, 2008;Murakami et al, 2010;Wang et al, 2010;Walia et al, 2014;Yang et al, 2015a;Bressin et al, 2019;Su et al, 2019;Qiu et al, 2020), neural networks (Alipanahi et al, 2015;Peng et al, 2017;Yan and Kurgan, 2017;Deng et al, 2018;Zhao and Du, 2020;Zhang et al, 2021b;Sun et al, 2021;Zhang et al, 2022), naive bayes classifier (Sharan et al, 2017;Deng et al, 2021; etc, as listed in Supplementary Table S1, have been successfully implemented. A common limitation faced by such ML-based methods is that the extracted features may be poorly representative of the physicochemical and environmental properties of amino acid residues, or their simplistic combination may introduce redundancy and affect overall prediction power of the approaches.…”