2020
DOI: 10.1038/s41598-020-73937-w
|View full text |Cite
|
Sign up to set email alerts
|

Evolutionary analysis of the Moringa oleifera genome reveals a recent burst of plastid to nucleus gene duplications

Abstract: It is necessary to identify suitable alternative crops to ensure the nutritional demands of a growing global population. The genome of Moringa oleifera, a fast-growing drought-tolerant orphan crop with highly valuable agronomical, nutritional and pharmaceutical properties, has recently been reported. We model here gene family evolution in Moringa as compared with ten other flowering plant species. Despite the reduced number of genes in the compact Moringa genome, 101 gene families, grouping 957 genes, were fou… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
8
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
3
3
2

Relationship

1
7

Authors

Journals

citations
Cited by 14 publications
(9 citation statements)
references
References 84 publications
0
8
0
Order By: Relevance
“…In terms of structural gene annotation, a combination of three strategies (ab initio prediction, protein homology, and RNA‐seq‐based evidence) and manual curation resulted in 22,714 protein‐coding genes, 93.1% of which (21,143 genes) are supported by transcriptome data together with isoforms from Iso‐seq in this new moringa assembly (AOCC v2), a number substantially higher than that of the previous one (AOCC v1), which represented approximately 77.9% of its estimated genome size and was annotated with a total of 18,451 genes (Chang et al., 2019; Ojeda‐Lopez et al., 2020). On average, predicted protein‐coding genes are 3,340 bp long and contain 5.5 exons, values that are slightly higher than that observed in the previous assembly of the moringa genome (Chang et al., 2019; Ojeda‐Lopez et al., 2020) (Table 2). Five libraries of RNA‐seq data from different moringa tissues were downloaded from the NCBI Short Read Sequence Archive (Supplemental Table S1) and aligned to the new moringa genome assembly (Pasha et al., 2020).…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…In terms of structural gene annotation, a combination of three strategies (ab initio prediction, protein homology, and RNA‐seq‐based evidence) and manual curation resulted in 22,714 protein‐coding genes, 93.1% of which (21,143 genes) are supported by transcriptome data together with isoforms from Iso‐seq in this new moringa assembly (AOCC v2), a number substantially higher than that of the previous one (AOCC v1), which represented approximately 77.9% of its estimated genome size and was annotated with a total of 18,451 genes (Chang et al., 2019; Ojeda‐Lopez et al., 2020). On average, predicted protein‐coding genes are 3,340 bp long and contain 5.5 exons, values that are slightly higher than that observed in the previous assembly of the moringa genome (Chang et al., 2019; Ojeda‐Lopez et al., 2020) (Table 2). Five libraries of RNA‐seq data from different moringa tissues were downloaded from the NCBI Short Read Sequence Archive (Supplemental Table S1) and aligned to the new moringa genome assembly (Pasha et al., 2020).…”
Section: Resultsmentioning
confidence: 99%
“…We found at least 11 glucosinolate‐related orthogroups including tandemly arranged genes in moringa. Two notable examples are formed by orthogroup HOG0000577, annotated as myrosinases, and orthogroup HOG0009822 annotated as encoding for GDSL‐type esterases/lipases, both of which were found to be specifically expanded in moringa and in agreement with what had been previously observed (Ojeda‐Lopez et al., 2020). Glucosinolates are relatively stable in nature and have no biological activity.…”
Section: Discussionmentioning
confidence: 99%
“…ICL penalizes clusters that overlap excessively and tends to prefer clusters with well-delineated separation (Biernacki et al , 2000). Our approach is adapted from previously described methods (Lang et al , 2018; Li et al , 2018; Shingate et al , 2020; Mabry et al , 2020; Ojeda-López et al , 2020).…”
Section: Methodsmentioning
confidence: 99%
“…Here, we provide a bioinformatic method to determine and investigate the source of NUMPTs in genome assemblies. Accurate identification of NUMTs and NUPTs is important for genome assemblies, and for understanding their role during evolution [76,77].…”
Section: Plos Onementioning
confidence: 99%