The platform will undergo maintenance on Sep 14 at about 7:45 AM EST and will be unavailable for approximately 2 hours.
2016
DOI: 10.1093/gbe/evw083
|View full text |Cite
|
Sign up to set email alerts
|

Breaking Good: Accounting for Fragility of Genomic Regions in Rearrangement Distance Estimation

Abstract: Models of evolution by genome rearrangements are prone to two types of flaws: One is to ignore the diversity of susceptibility to breakage across genomic regions, and the other is to suppose that susceptibility values are given. Without necessarily supposing their precise localization, we call “solid” the regions that are improbably broken by rearrangements and “fragile” the regions outside solid ones. We propose a model of evolution by inversions where breakage probabilities vary across fragile regions and ov… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
30
0

Year Published

2016
2016
2019
2019

Publication Types

Select...
5
1
1

Relationship

3
4

Authors

Journals

citations
Cited by 43 publications
(39 citation statements)
references
References 52 publications
1
30
0
Order By: Relevance
“…Simulations with Zombi are fast: with a starting genome of 500 genes and a species tree of 2000 taxa (extinct + extant), it takes around 1 minute on a 3.4Ghz laptop to simulate all the genomes ( Figure S6). We validated that the distribution of waiting times between successive events was following an exponential distribution ( Figure S7 and S8), that the distribution of intergene sizes at equilibrium was following a flat Dirichlet distribution, as expected from Biller et al 2016 ( Figure S9), that the number of events and their extension occurs with a frequency according to their respective rates ( Figure S10) and that the gene family size distribution followed a power-law when duplication rates are higher than loss rates and stretched-exponential in the opposite case ( Figure S11). We also checked by hand the validity of many simple scenarios to detect possible inconsistencies in the algorithm.…”
Section: Performance and Validationsupporting
confidence: 72%
See 1 more Smart Citation
“…Simulations with Zombi are fast: with a starting genome of 500 genes and a species tree of 2000 taxa (extinct + extant), it takes around 1 minute on a 3.4Ghz laptop to simulate all the genomes ( Figure S6). We validated that the distribution of waiting times between successive events was following an exponential distribution ( Figure S7 and S8), that the distribution of intergene sizes at equilibrium was following a flat Dirichlet distribution, as expected from Biller et al 2016 ( Figure S9), that the number of events and their extension occurs with a frequency according to their respective rates ( Figure S10) and that the gene family size distribution followed a power-law when duplication rates are higher than loss rates and stretched-exponential in the opposite case ( Figure S11). We also checked by hand the validity of many simple scenarios to detect possible inconsistencies in the algorithm.…”
Section: Performance and Validationsupporting
confidence: 72%
“…For example, it is possible to use a species tree input by the user, to generate species trees with variable extinction and speciation rates, or to control the number of living lineages at each unit of time ( Figure S5). At the genome level, Zombi can simulate genomes using branch-specific rates (Gu mode, allowing the user to simulate very specific scenarios such as one in which a certain lineage experiences a massive loss of genes), gene-family specific rates (Gm mode, which makes easier the process of using rates estimated from real datasets) and genomes accounting for intergenic regions (Gf mode) of variable length (drawn from a flat Dirichlet distribution (Biller et al 2016) . At the sequence level, finally, the user can fine-tune the substitution rates to make them branch specific.…”
Section: Advanced Featuresmentioning
confidence: 99%
“…First, the definition of weighted genomes [1, 3] opens combinatorial questions, one of which being the transformation of a genome into another in a minimum number of steps. In a previous paper [3] we solved the strict version of this problem, where genomes were forced to have the same total intergene sizes and only wDCJs were allowed.…”
Section: Discussionmentioning
confidence: 99%
“…In a previous publication [1], we have argued that intergenic sizes were a crucial parameter to infer genome rearrangement distances. Indeed, ignoring this information, as all published distance estimations were doing so far [2], leads to strong biases in all estimations and validation procedures.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation