2018
DOI: 10.1101/270009
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Factors influencing gene family size variation among related species in a plant family

Abstract: the research plans; P.W. performed most of the analysis; B.M., N.P. and F.M. provided technical assistance to P.W.; P.W., M.L.-S., and S.-H.S. wrote the article. AbstractGene duplication and loss contribute to gene content differences as well as phenotypic divergence across species. However, the extent to which gene content varies among closely related plant species and the factors responsible for such variation remain unclear. Here, we used the Solanaceae family as a model to investigate differences in gene f… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
16
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
4
2

Relationship

2
4

Authors

Journals

citations
Cited by 12 publications
(17 citation statements)
references
References 91 publications
(97 reference statements)
1
16
0
Order By: Relevance
“…Custom HMM models for major NLR exons (FISNA-NACHT, and PRY-SPRY/ B30.2) were generated and utilized during this process (see Additional file 29 for hmm models). The majority of identified FISNA-NACHT exons contained frameshifts or a large insertion, indicating either pseudogenization, acquisition of new introns, problems with the assembly, or a combination of the three [228]. For subsequent phylogenetic analyses, only the 61 clearly intact NLRs were used.…”
Section: Immune Systemmentioning
confidence: 99%
“…Custom HMM models for major NLR exons (FISNA-NACHT, and PRY-SPRY/ B30.2) were generated and utilized during this process (see Additional file 29 for hmm models). The majority of identified FISNA-NACHT exons contained frameshifts or a large insertion, indicating either pseudogenization, acquisition of new introns, problems with the assembly, or a combination of the three [228]. For subsequent phylogenetic analyses, only the 61 clearly intact NLRs were used.…”
Section: Immune Systemmentioning
confidence: 99%
“…The most recent duplication points for genes appearing to originate from multiple duplication nodes were defined by the highest-numbered node they belonged to (Figure S7). Pseudogenes in tomato were determined as in Wang et al (2018) where genomic regions with significant similarity to proteincoding genes but with premature stops/frameshifts and/or were truncated were treated as pseudogenes (64). Detailed methods and parsing scripts for different features can be found in: https://github.com/ShiuLab/SM-gene_prediction_Slycopersicum.…”
Section: Evolutionary Featuresmentioning
confidence: 99%
“…Aside from gene annotations, we defined or obtained additional genome features including pseudogenes, transposable elements (TE), simple sequence repeat (SSR; stretch of DNA, 2 ~ 64bp, repeated >1 time and the repetitions are immediately adjacent to each other), and tandemly duplicated genes. Pseudogenes were defined as genomic regions with significant similarity to protein-coding genes had premature stops/frameshifts and/or were truncated as described in (27). Transposable element (TE) annotation was based on SGN ITAG2.4 release.…”
Section: Genome and Functional Annotations Definitions Of Genome Feamentioning
confidence: 99%
“…Minscore = 50, Maxperiod = 500 (28). Tandemly duplicated genes were identified using MCScanX-transposed (29), as described previously (27), where paralogs are directly adjacent to each other, or separated by…”
Section: Genome and Functional Annotations Definitions Of Genome Feamentioning
confidence: 99%