2017
DOI: 10.7717/peerj.3219
|View full text |Cite
|
Sign up to set email alerts
|

CRF: detection of CRISPR arrays using random forest

Abstract: CRISPRs (clustered regularly interspaced short palindromic repeats) are particular repeat sequences found in wide range of bacteria and archaea genomes. Several tools are available for detecting CRISPR arrays in the genomes of both domains. Here we developed a new web-based CRISPR detection tool named CRF (CRISPR Finder by Random Forest). Different from other CRISPR detection tools, a random forest classifier was used in CRF to filter out invalid CRISPR arrays from all putative candidates and accordingly enhan… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
15
0

Year Published

2017
2017
2023
2023

Publication Types

Select...
5
4

Relationship

1
8

Authors

Journals

citations
Cited by 11 publications
(15 citation statements)
references
References 26 publications
(37 reference statements)
0
15
0
Order By: Relevance
“…Several programs have been developed to identify CRISPR arrays in genomic sequences, the most frequently cited being CRISPRFinder ( 7 ), CRT ( 8 ) and PILER-CR ( 9 ). Additional programs such as CRISPRDetect ( 10 ), CRISPRdigger ( 11 ) and CRF ( 12 ) are also available. Three programs have been proposed for CRISPR array strand prediction based on the characteristics of the CRISPR repeat and the leader: CRISPRDirection using CRISPRDetect ( 10 , 13 ), CRISPRstrand ( 14 , 15 ) and CRISPRleader ( 16 ).…”
Section: Introductionmentioning
confidence: 99%
“…Several programs have been developed to identify CRISPR arrays in genomic sequences, the most frequently cited being CRISPRFinder ( 7 ), CRT ( 8 ) and PILER-CR ( 9 ). Additional programs such as CRISPRDetect ( 10 ), CRISPRdigger ( 11 ) and CRF ( 12 ) are also available. Three programs have been proposed for CRISPR array strand prediction based on the characteristics of the CRISPR repeat and the leader: CRISPRDirection using CRISPRDetect ( 10 , 13 ), CRISPRstrand ( 14 , 15 ) and CRISPRleader ( 16 ).…”
Section: Introductionmentioning
confidence: 99%
“…It searches each read for DR sequences matching query DRs specified by the user. These DRs can be selected from CRISPR arrays detected with genomic CRISPR detection tools such as PILER-CR ( Edgar, 2007 ), CRF ( Wang & Liang, 2017 ), or CRISPRFinder ( Grissa, Vergnaud & Pourcel, 2007b ) in fully assembled microbial genomes or assembled metagenomic contigs. The steps of the MetaCRAST pipeline are outlined in Fig.…”
Section: Methodsmentioning
confidence: 99%
“…For instance, if users want to compare the CRISPR arrays from different species, CRISPRcompar ( 27 ) comprising CRISPRcomparison and CRISPRtionary and basically derived from CRISPRFinder must be the best choice. Besides, CRF ( 28 ) based on CRT added random forest algorithm to make an extra filtration for invalid CRISPR arrays, but this learning-based tool may partially lose the ability to discover new CRISPRs. Beyond that, three representative tools are designed for CRISPR strand prediction using the characteristics of leader and repeat that include CRISPRstrand ( 29 ), CRISPRleader ( 31 ), and CRISPRDirection ( 37 ).…”
Section: Crispr/cas System Identificationmentioning
confidence: 99%