2019
DOI: 10.3389/fmicb.2019.00834
|View full text |Cite
|
Sign up to set email alerts
|

Large-Scale Genomics Reveals the Genetic Characteristics of Seven Species and Importance of Phylogenetic Distance for Estimating Pan-Genome Size

Abstract: For more than a decade, pan-genome analysis has been applied as an effective method for explaining the genetic contents variation of prokaryotic species. However, genomic characteristics and detailed structures of gene pools have not been fully clarified, because most studies have used a small number of genomes. Here, we constructed pan-genomes of seven species in order to elucidate variations in the genetic contents of >27,000 genomes belonging to Streptococcus pneumoniae , … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

5
49
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 57 publications
(54 citation statements)
references
References 28 publications
5
49
0
Order By: Relevance
“…It is notable that many accessory genes were unique to a single strain (2797 genes, representing 34.1% of the pan-genome). The large accessory genome in our results is not uncommon and has been reported in different bacterial pathogens, including S. aureus 17,18 and Staphylococcus haemolyticus 19 . However, we recognize that pangenome sizes and precise characterization of core and accessory genes are greatly influenced by the phylogenetic distance, sequence quality of genomes, the number of genomes being compared, assembly and annotation methods, and choice of threshold values to define orthologous genes.…”
Section: Resultssupporting
confidence: 68%
“…It is notable that many accessory genes were unique to a single strain (2797 genes, representing 34.1% of the pan-genome). The large accessory genome in our results is not uncommon and has been reported in different bacterial pathogens, including S. aureus 17,18 and Staphylococcus haemolyticus 19 . However, we recognize that pangenome sizes and precise characterization of core and accessory genes are greatly influenced by the phylogenetic distance, sequence quality of genomes, the number of genomes being compared, assembly and annotation methods, and choice of threshold values to define orthologous genes.…”
Section: Resultssupporting
confidence: 68%
“…Although our initial results revealed more open pangenome ( alpha ) in Clade B (0.762) than Clade A (0.807) or Clade C (0.806) or the whole collection (0.823), alpha was higher for small (<250) sample sizes here (Fig. 6) as indicated before 41 . Alpha estimates averaged across the sample size placed Clade C as more open than Clade A or Clade B (Table 3), highlighting a partial dependence of alpha on sample size that was removed once the sample sizes >250 when the relative rate of new genes became constant (Supplementary Fig.…”
Section: Resultssupporting
confidence: 49%
“…Previous work on E. coli 41 employed a metric of pangenome openness ( alpha ) that was similarly applied to our Roary pangenome results here to compare with previous findings that n = 648 Clade C isolates had a marginally more open genome (smaller alpha ) than n = 140 from Clade B, and both were more open than n = 70 Clade A isolates 31 . Although our initial results revealed more open pangenome ( alpha ) in Clade B (0.762) than Clade A (0.807) or Clade C (0.806) or the whole collection (0.823), alpha was higher for small (<250) sample sizes here (Fig.…”
Section: Resultsmentioning
confidence: 95%
See 1 more Smart Citation
“…The pan-genome analysis showed that our ESBL-EC isolates had an expanding pan-genome and this result is consistent with those of previous studies in pan-genome analyses of E. coli [70,71]. An open pan-genomic trait of E. coli probably contributes to the diversity of species [39].…”
Section: Discussionsupporting
confidence: 91%