2016
DOI: 10.1093/nar/gkw1119
|View full text |Cite
|
Sign up to set email alerts
|

OrthoDB v9.1: cataloging evolutionary and functional annotations for animal, fungal, plant, archaeal, bacterial and viral orthologs

Abstract: OrthoDB is a comprehensive catalog of orthologs, genes inherited by extant species from a single gene in their last common ancestor. In 2016 OrthoDB reached its 9th release, growing to over 22 million genes from over 5000 species, now adding plants, archaea and viruses. In this update we focused on usability of this fast-growing wealth of data: updating the user and programmatic interfaces to browse and query the data, and further enhancing the already extensive integration of available gene functional annotat… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

2
365
0
2

Year Published

2017
2017
2023
2023

Publication Types

Select...
9
1

Relationship

0
10

Authors

Journals

citations
Cited by 421 publications
(369 citation statements)
references
References 34 publications
2
365
0
2
Order By: Relevance
“…We used a set of 1,759 BUSCO genes inferred to be saccharomyceta-specific (“saccharomyceta” is an informal taxonomic rank used by many databases, including NCBI: https://www.ncbi.nlm.nih.gov/Taxonomy/Browser/wwwtax.cgi?mode=Info&id=716545) and single-copy in at least 90% of 60 genomes in the OrthoDB Version 9 database (Zdobnov et al, 2017) to evaluate the completeness of the 332 genome assemblies. Briefly, for each BUSCO gene, its consensus orthologous protein sequence among the 60 reference genomes was used as query in a tBLASTn search against each genome to identify up to three putative genomic regions, and the gene structure of each putative genomic region was predicted by AUGUSTUS v 3.2.2 (Stanke and Waack, 2003).…”
Section: Methods Detailsmentioning
confidence: 99%
“…We used a set of 1,759 BUSCO genes inferred to be saccharomyceta-specific (“saccharomyceta” is an informal taxonomic rank used by many databases, including NCBI: https://www.ncbi.nlm.nih.gov/Taxonomy/Browser/wwwtax.cgi?mode=Info&id=716545) and single-copy in at least 90% of 60 genomes in the OrthoDB Version 9 database (Zdobnov et al, 2017) to evaluate the completeness of the 332 genome assemblies. Briefly, for each BUSCO gene, its consensus orthologous protein sequence among the 60 reference genomes was used as query in a tBLASTn search against each genome to identify up to three putative genomic regions, and the gene structure of each putative genomic region was predicted by AUGUSTUS v 3.2.2 (Stanke and Waack, 2003).…”
Section: Methods Detailsmentioning
confidence: 99%
“…All contigs <200 bp were removed prior to final assembly submission. To test the quality of the assembly, the aves_odb9 dataset of single copy, orthologous, Avian specific genes from orthodb version 9 (Zdobnov et al., ) was selected to check their status (present, duplicated, fragment or missing) with busco version 3.0.2 (Waterhouse et al., ) in the Galgal4, Galgal5 and GRCg6a assemblies of the chicken genome and in our NumMel1 guinea fowl assembly. The assembly is publicly available in NCBI Assembly under the name of NumMel1.0 (accession GCA_002078875.2).…”
Section: Methodsmentioning
confidence: 99%
“…As many of the transcriptome libraries were assembled using pooled individuals and developmental stages, cd‐hit v.4.6.1 (Godzik and Li, ) was used to select the longest transcript with a 98% sequence identity threshold (‑c 0.98). Single‐copy nuclear orthologues were then identified using the orthoDB Diptera gene set ( diptera_odb9 , 2799 genes) (Zdobnov et al, ) with busco (v.3) (Waterhouse et al, ) and augustus (Stanke and Morgenstern, ). Orthologous genes annotated in all samples ( n = 16) were extracted and nucleotides aligned based on reading frame using geneous v.10.2.6 with the mafft FFT‐NS‐2 algorithm (Katoh and Standley, ).…”
Section: Methodsmentioning
confidence: 99%