2015
DOI: 10.1093/bioinformatics/btv421
|View full text |Cite
|
Sign up to set email alerts
|

Roary: rapid large-scale prokaryote pan genome analysis

Abstract: Summary: A typical prokaryote population sequencing study can now consist of hundreds or thousands of isolates. Interrogating these datasets can provide detailed insights into the genetic structure of prokaryotic genomes. We introduce Roary, a tool that rapidly builds large-scale pan genomes, identifying the core and accessory genes. Roary makes construction of the pan genome of thousands of prokaryote samples possible on a standard desktop without compromising on the accuracy of results. Using a single CPU Ro… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

5
3,101
0
6

Year Published

2017
2017
2021
2021

Publication Types

Select...
5
2

Relationship

1
6

Authors

Journals

citations
Cited by 3,813 publications
(3,114 citation statements)
references
References 9 publications
(12 reference statements)
5
3,101
0
6
Order By: Relevance
“…We generated alignments by mapping reads to the TIGR4 reference using bwa-mem v0.7.10 with default settings 21 , and called variants from these alignments using samtools v1.2 mpileup and bcftools call 22 . We used Roary 1.007001 23 with a 95% BLAST ID cutoff to construct a pan-genome from the annotated assemblies, from which a core gene alignment was extracted. We then created alignments by two further methods.…”
Section: Methodsmentioning
confidence: 99%
“…We generated alignments by mapping reads to the TIGR4 reference using bwa-mem v0.7.10 with default settings 21 , and called variants from these alignments using samtools v1.2 mpileup and bcftools call 22 . We used Roary 1.007001 23 with a 95% BLAST ID cutoff to construct a pan-genome from the annotated assemblies, from which a core gene alignment was extracted. We then created alignments by two further methods.…”
Section: Methodsmentioning
confidence: 99%
“…Patterns of gene loss or gain across strains can be inferred from these inventories. Toward this end, we annotated each genome assembly using Prokka (Seemann 2014) and used Roary (Page et al 2015) to cluster orthologous coding sequences and generate a gene possession matrix (assemblies with more than 500 contigs were excluded). This identified 2148 orthologous clusters present in >99% of 209 strains (core genes), along with an additional 36,627 present in only a subset of strains (Table 4; Supplemental Text S3; Supplemental Fig.…”
Section: Pan-genome Analyses Identified Clade-specific Gene Content Imentioning
confidence: 99%
“…By using the Roary gene presence matrix (homologous clusters) (Page et al 2015), we found 8409 genes lost from at least one longitudinal series (Fig. 6C).…”
Section: Recurrent Genomic Changes In B Cenocepacia During Long-termmentioning
confidence: 99%
See 2 more Smart Citations