2019
DOI: 10.1101/2019.12.17.880294
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Foster thy young: Enhanced prediction of orphan genes in assembled genomes

Abstract: The evolutionary rapid emergence of new genes gives rise to "orphan genes" that share no sequence homology to genes in closely related genomes. These genes provide organisms with a reservoir of genetic elements to quickly respond to changing selection pressures. Gene annotation pipelines that combine ab initio machine-learning with sequence homology-based searches are efficient in identifying basal genes with a long evolutionary history. However, their ability to identify orphan genes and other young genes has… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
13
0

Year Published

2020
2020
2021
2021

Publication Types

Select...
4
3

Relationship

5
2

Authors

Journals

citations
Cited by 11 publications
(13 citation statements)
references
References 144 publications
(277 reference statements)
0
13
0
Order By: Relevance
“…The 26 NAM genomes were annotated using a hybrid evidence and ab initio based gene prediction pipeline (81). Evidence-based predictions were directly inferred from the assembled transcripts, which were generated using five different genome-guided transcript assembly RNA-seq reads from each library were mapped to their respective NAM genomes using STAR (v2.5.3a) (88) with an iterative 2-pass mapping approach in which splice junctions generated from the first round were used to refine alignments in the subsequent round.…”
Section: Gene Model Annotationmentioning
confidence: 99%
“…The 26 NAM genomes were annotated using a hybrid evidence and ab initio based gene prediction pipeline (81). Evidence-based predictions were directly inferred from the assembled transcripts, which were generated using five different genome-guided transcript assembly RNA-seq reads from each library were mapped to their respective NAM genomes using STAR (v2.5.3a) (88) with an iterative 2-pass mapping approach in which splice junctions generated from the first round were used to refine alignments in the subsequent round.…”
Section: Gene Model Annotationmentioning
confidence: 99%
“…e ., species-specific genes). As ab initio predictions of young genes can be unreliable (Seetharam et al, 2019), these were excluded. Finally, redundant copies of genes between direct evidence and ab initio predictions were identified and removed using Mikado compare (2.0rc2; (Venturini et al, 2018)) and merging was performed locus by locus, incorporating additional isoforms when necessary.…”
Section: Methodsmentioning
confidence: 99%
“…We compared the accuracy of TSEBRA with EVM, one of the most cited combiner tools to date. EVM was previously used to combine BRAKER with other predictions [39], to combine BRAKER2 predictions with RNA-seq evidence [40,41] or other RNA-seq based predictions [42], and even for combining multiple BRAKER predictions [43]. Still, it is not the most suitable task for EVM to create a BRAKER-only combination.…”
Section: Comparison With Evidencemodelermentioning
confidence: 99%