2020
DOI: 10.1186/s12915-020-0756-z
|View full text |Cite|
|
Sign up to set email alerts
|

Improving the usability and comprehensiveness of microbial databases

Abstract: Metagenomics studies leverage genomic reference databases to generate discoveries in basic science and translational research. However, current microbial studies use disparate reference databases that lack consistent standards of specimen inclusion, data preparation, taxon labelling and accessibility, hindering their quality and comprehensiveness, and calling for the establishment of recommendations for reference genome database assembly. Here, we analyze existing fungal and bacterial databases and discuss gui… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
13
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
8
1

Relationship

2
7

Authors

Journals

citations
Cited by 18 publications
(14 citation statements)
references
References 9 publications
0
13
0
Order By: Relevance
“…The latter groups are assigned a unique digital object identifier (DOI) in the interim. Loeffler et al [ 80 ] discuss the importance of reference genomes to metagenomics studies but pertinently warn of the poor integration of new sequences into the existing database as well as the cooperative interactions between different databases with the resulting lack of a credibly comprehensive database despite progress in sequence curation. The direct implication is the doubtful identification of any fungi that are analysed using only one reference database [ 81 ].…”
Section: Advances In Molecular Characterisation Of Fungimentioning
confidence: 99%
“…The latter groups are assigned a unique digital object identifier (DOI) in the interim. Loeffler et al [ 80 ] discuss the importance of reference genomes to metagenomics studies but pertinently warn of the poor integration of new sequences into the existing database as well as the cooperative interactions between different databases with the resulting lack of a credibly comprehensive database despite progress in sequence curation. The direct implication is the doubtful identification of any fungi that are analysed using only one reference database [ 81 ].…”
Section: Advances In Molecular Characterisation Of Fungimentioning
confidence: 99%
“…These reads of low mapping quality are usually discarded. Furthermore, some species have no reliable reference, which is common in microbes [12].…”
Section: Introductionmentioning
confidence: 99%
“…This is an important issue - because as new genome data is generated, training data sets, such as the commonly used NCBI Reference Sequence Database (RefSeq), will change over time. Consistency between databases and completeness of genomes is already an issue plaguing these databases [ 15 ]. Loeffler et al highlight the cost of computational power and storage requirements of maintaining a master reference sequence database, with a proposed solution of a “ continuous assembly approach” (supported by the institution’s infrastructure and the scientific community supplying the data sources) [ 15 ].…”
Section: Introductionmentioning
confidence: 99%
“…Consistency between databases and completeness of genomes is already an issue plaguing these databases [ 15 ]. Loeffler et al highlight the cost of computational power and storage requirements of maintaining a master reference sequence database, with a proposed solution of a “ continuous assembly approach” (supported by the institution’s infrastructure and the scientific community supplying the data sources) [ 15 ]. From a classifier perspective, Nasko et al recently demonstrated that more reads are classified (as opposed to be assigned to an unclassified/unknown class) by the Kraken classifier with newer database versions [ 16 ].…”
Section: Introductionmentioning
confidence: 99%