2019
DOI: 10.7717/peerj.6303
|View full text |Cite
|
Sign up to set email alerts
|

GenoDup Pipeline: a tool to detect genome duplication using the dS-based method

Abstract: Understanding whole genome duplication (WGD), or polyploidy, is fundamental to investigating the origin and diversification of organisms in evolutionary biology. The wealth of genomic data generated by next generation sequencing (NGS) has resulted in an urgent need for handy and accurate tools to detect WGD. Here, I present a useful and user-friendly pipeline called GenoDup for inferring WGD using the dS-based method. I have successfully applied GenoDup to identify WGD in empirical data from both plants and an… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
8

Relationship

1
7

Authors

Journals

citations
Cited by 8 publications
(5 citation statements)
references
References 31 publications
0
5
0
Order By: Relevance
“…Ancient WGD is usually inferred using the dS-based method, but artificial signals in dS distributions have been reported in previous studies, because of dS saturation (dS value >1) or using poorly annotated genomes or allelic variations or low retention rates (Rabier et al., 2014, Tiley et al., 2016, Vanneste et al., 2012, Mao, 2019). There is an extra peak in the dS distribution of anchor gene pairs in A .…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…Ancient WGD is usually inferred using the dS-based method, but artificial signals in dS distributions have been reported in previous studies, because of dS saturation (dS value >1) or using poorly annotated genomes or allelic variations or low retention rates (Rabier et al., 2014, Tiley et al., 2016, Vanneste et al., 2012, Mao, 2019). There is an extra peak in the dS distribution of anchor gene pairs in A .…”
Section: Discussionmentioning
confidence: 99%
“…In the past 10 years, next-generation sequencing has generated a wealth of genomic data at vastly decreased cost and reduced efforts (Goodwin et al., 2016, Hardwick et al., 2017). Three main methods were developed to identify WGD, based on (1) analysis of the rate of synonymous substitutions per synonymous site (dS) of duplicated genes within a genome (dS-based method) (Blanc et al., 2003, Lynch and Conery, 2000, Vanneste et al., 2014, Mao, 2019), (2) phylogenetic analysis of gene families among multiple genomes (phylogenomic analysis) (Blomme et al., 2006, Jiao et al., 2011), and (3) synteny block identification compared with sister lineages without WGD (synteny analysis) (Bowers et al., 2003, Dehal and Boore, 2005, Zhang et al., 2017). The dS-based method and phylogenomic analysis only require gene family information, without genome assembly.…”
Section: Introductionmentioning
confidence: 99%
“…From the synteny table, gene pairs between C. neglecta and subgenomes of C. sativa as well as A. thaliana were identified and used to calculate the rates of synonymous substitutions per synonymous site ( K s) using GenoDup pipeline (Mao, 2019 ). This pipeline utilized protein and nucleotide sequences of a gene pair to calculate ( K s).…”
Section: Methodsmentioning
confidence: 99%
“…For every collinear gene pair in the genome, we calculated the Ks value for each pair. In addition, the Ks value was calculated using the paralogous gene pair generated by orthologous gene clustering using the GenoDup‐Pipeline with “‐n 5” parameter (Mao, 2019). A plant average Ks /year rate of 6.1 × 10 −9 was used to predict the divergence time of the WGD event in diploid plants (Fawcett et al, 2009).…”
Section: Methodsmentioning
confidence: 99%