On the reconstruction of the ancestral bacterial genomes in genus Mycobacterium and Brucella

Guyeux, Christophe; Al-Nuaimi, Bashar; AlKindy, Bassam; Couchot, Jean-François; Salomon, Michel

doi:10.1186/s12918-018-0618-2

Cited by 5 publications

(6 citation statements)

References 40 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In this article, we present a new general procedure termed ‘CRISPRbuilder-TB’ to reconstruct Mycobacterium tuberculosis CRISPR-Cas loci from short reads under a semi-automatized process. CRISPRbuilder-TB is inspired by previous developments [ 24 , 25 ] and particularizes the De Bruijn approach to the specific case of reconstructing CRISPR loci based on SRA, with the condition that its standard DR sequence is known. Applied on TB SRA without any genome assembly step, CRISPRbuilder-TB proved reliable and robust with reads of more than 75bp.…”

Section: Introductionmentioning

confidence: 99%

CRISPRbuilder-TB: “CRISPR-builder for tuberculosis”. Exhaustive reconstruction of the CRISPR locus in mycobacterium tuberculosis complex using SRA

et al. 2021

Self Cite

View full text Add to dashboard Cite

Mycobacterium tuberculosis complex (MTC) CRISPR locus diversity has long been studied solely investigating the presence/absence of a known set of spacers. Unveiling the genetic mechanisms of its evolution requires a more exhaustive reconstruction in a large amount of representative strains. In this article, we point out and resolve, with a new pipeline, the problem of CRISPR reconstruction based directly on short read sequences in M. tuberculosis. We first show that the process we set up, that we coin as “CRISPRbuilder-TB” (https://github.com/cguyeux/CRISPRbuilder-TB), allows an efficient reconstruction of simulated or real CRISPRs, even when including complex evolutionary steps like the insertions of mobile elements. Compared to more generalist tools, the whole process is much more precise and robust, and requires only minimal manual investigation. Second, we show that more than 1/3 of the currently complete genomes available for this complex in the public databases contain largely erroneous CRISPR loci. Third, we highlight how both the classical experimental in vitro approach and the basic in silico spoligotyping provided by existing analytic tools miss a whole diversity of this locus in MTC, by not capturing duplications, spacer and direct repeats variants, and IS6110 insertion locations. This description is extended in a second article that describes MTC-CRISPR diversity and suggests general rules for its evolution. This work opens perspectives for an in-depth exploration of M. tuberculosis CRISPR loci diversity and of mechanisms involved in its evolution and its functionality, as well as its adaptation to other CRISPR locus-harboring bacterial species.

show abstract

Section: Introductionmentioning

confidence: 99%

CRISPRbuilder-TB: “CRISPR-builder for tuberculosis”. Exhaustive reconstruction of the CRISPR locus in mycobacterium tuberculosis complex using SRA

et al. 2021

Self Cite

View full text Add to dashboard Cite

show abstract

“…It is reliable and robust provided that the reads have sufficient coverage and sizes long enough to span more than one DR. This tool, based on the analyses carried out in [17, 18] particularizes the De Bruijn approach to the specific case of the CRISPR locus and is the main contribution of this article. We show its usefulness both by showing that it can reconstruct CRISPR of reliable reference genomes, and by presenting that mean quality of CRISPR-Cas reconstruction is poor in other assembled genomes available in the public databases.…”

Section: Introductionmentioning

confidence: 99%

Exhaustive reconstruction of the CRISPR locus inMycobacterium tuberculosiscomplex using short reads

Guyeux

Sola

Refrégier

2019

Preprint

Self Cite

View full text Add to dashboard Cite

Spoligotyping, a graphical partial display of the CRISPR locus that can be produced in vitro or in silico, is an important tool for analyzing the diversity of given Mycobacterium tuberculosis complex (MTC) isolates. As other CRISPR loci, this locus is made up of an alternation between direct repeats and spacers, and flanked by cas genes. Unveiling the genetic mechanisms of its evolution requires to have a fairly large amount of fully reconstructed loci among all MTC lineages.In this article, we point out and resolve the problem of CRISPR reconstruction based on short read sequences. We first show that more than 1/3 of the currently assembled genomes available for this complex contain a CRISPR locus erroneously reconstructed, and errors can be very significant. Second, we present a new computational method allowing this locus to be reconstructed extensively and reliably in silico using short read sequencing runs. Third, using this method, we describe new structural characteristics of CRISPR locus by lineages. We show how both the classical experimental in vitro approach and the basic in silico spoligotyping provided by existing analytic tools miss a whole diversity of this locus in MTC, by not capturing duplications, spacer and direct repeats variants, and IS6110 insertion locations. This description is extended in a second article that presents general rules for the evolution of the CRISPR locus in MTC.This work opens new perspectives for a larger exploration of CRISPR loci diversity and of mechanisms involved in its evolution and its functionality.

show abstract

“…We also find, in the upper right part, lineages 2 to 4, and in the lower left part, lineages 1, 5, 6 and animal, and we know that these two subgroups are phylogenetically separated. The clusters of lineages 1 to 4 extend to the center of the cloud, arguing for a common origin of the tuberculosis complex, whose common ancestor could be M. canettii [ 20 ].…”

Section: Spolmap: Enriching the Visualization Of Crispr Diversitymentioning

confidence: 99%

Investigating the Diversity of Tuberculosis Spoligotypes with Dimensionality Reduction and Graph Theory

Senelle

Guyeux

Refrégier

et al. 2022

Genes

Self Cite

View full text Add to dashboard Cite

The spoligotype is a graphical description of the CRISPR locus present in Mycobacterium tuberculosis, which has the particularity of having only 68 possible spacers. This spoligotype, which can be easily obtained either in vitro or in silico, allows to have a summary information of lineage or even antibiotic resistance (when known to be associated to a particular cluster) at a lower cost. The objective of this article is to show that this representation is richer than it seems, and that it is under-exploited until now. We first recall an original way to represent these spoligotypes as points in the plane, allowing to highlight possible sub-lineages, particularities in the animal strains, etc. This graphical representation shows clusters and a skeleton in the form of a graph, which led us to see these spoligotypes as vertices of an unconnected directed graph. In this paper, we therefore propose to exploit in detail the description of the variety of spoligotypes using a graph, and we show to what extent such a description can be informative.

show abstract

On the reconstruction of the ancestral bacterial genomes in genus Mycobacterium and Brucella

Cited by 5 publications

References 40 publications

CRISPRbuilder-TB: “CRISPR-builder for tuberculosis”. Exhaustive reconstruction of the CRISPR locus in mycobacterium tuberculosis complex using SRA

CRISPRbuilder-TB: “CRISPR-builder for tuberculosis”. Exhaustive reconstruction of the CRISPR locus in mycobacterium tuberculosis complex using SRA

Exhaustive reconstruction of the CRISPR locus inMycobacterium tuberculosiscomplex using short reads

Investigating the Diversity of Tuberculosis Spoligotypes with Dimensionality Reduction and Graph Theory

Contact Info

Product

Resources

About