2021
DOI: 10.1155/2021/9969751
|View full text |Cite
|
Sign up to set email alerts
|

Genomic Island Prediction via Chi-Square Test and Random Forest Algorithm

Abstract: Genomic islands are related to microbial adaptation and carry different genomic characteristics from the host. Therefore, many methods have been proposed to detect genomic islands from the rest of the genome by evaluating its sequence composition. Many sequence features have been proposed, but many of them have not been applied to the identification of genomic islands. In this paper, we present a scheme to predict genomic islands using the chi-square test and random forest algorithm. We extract seven kinds of … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
13
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
8
1

Relationship

0
9

Authors

Journals

citations
Cited by 34 publications
(17 citation statements)
references
References 48 publications
0
13
0
Order By: Relevance
“…Here, two classic base classifiers, namely, SVM ( Cortes and Vapnik, 1995 ) and RF ( Breiman, 2001 ), were used, which were widely applied in tackling many biological problems ( Kandaswamy et al, 2011 ; Nguyen et al, 2015 ; Chen et al, 2017 ; Zhou JP. et al, 2020 ; Zhou J.-P. et al, 2020 ; Liang et al, 2020 ; Liu et al, 2021 ; Onesime et al, 2021 ; Wang et al, 2021 ; Zhu et al, 2021 ; Chen et al, 2022 ; Ding et al, 2022 ; Li et al, 2022 ; Wu and Chen, 2022 ).…”
Section: Methodsmentioning
confidence: 99%
“…Here, two classic base classifiers, namely, SVM ( Cortes and Vapnik, 1995 ) and RF ( Breiman, 2001 ), were used, which were widely applied in tackling many biological problems ( Kandaswamy et al, 2011 ; Nguyen et al, 2015 ; Chen et al, 2017 ; Zhou JP. et al, 2020 ; Zhou J.-P. et al, 2020 ; Liang et al, 2020 ; Liu et al, 2021 ; Onesime et al, 2021 ; Wang et al, 2021 ; Zhu et al, 2021 ; Chen et al, 2022 ; Ding et al, 2022 ; Li et al, 2022 ; Wu and Chen, 2022 ).…”
Section: Methodsmentioning
confidence: 99%
“…RF integrates the predictions of all decision trees with majority voting. RF is deemed as a powerful classification algorithm and has wide applications in tackling many biological problems ( Kandaswamy et al, 2011 ; Casanova et al, 2014 ; Marques et al, 2016 ; Jia et al, 2020 ; Liang et al, 2020 ; Zhang et al, 2021b ; Chen et al, 2021 ; Onesime et al, 2021 ; Chen et al, 2022 ; Ding et al, 2022 ; Yang and Chen, 2022 ).…”
Section: Methodsmentioning
confidence: 99%
“…In order to further study the arrangement of different structural elements, some important structural fragments or patterns have been proposed one after another [ 51 ]. The length of the longest segment (MaxSeg SE ) is defined as the following formula: where MaxLen represents the function of the maximal segment length, and SEG SE is the composed of each segment of the structure element SE [ 52 ].…”
Section: Methodsmentioning
confidence: 99%
“…In order to further study the arrangement of different structural elements, some important structural fragments or patterns have been proposed one after another [51]. The length of the longest segment ðMaxSeg SE Þ is defined as the following formula:…”
Section: Evolutionary Profilementioning
confidence: 99%