2019 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) 2019
DOI: 10.1109/bibm47256.2019.8983041
|View full text |Cite
|
Sign up to set email alerts
|

Supporting supervised learning in fungal Biosynthetic Gene Cluster discovery: new benchmark datasets

Abstract: Fungal Biosynthetic Gene Clusters (BGCs) of secondary metabolites are clusters of genes capable of producing natural products, compounds that play an important role in the production of a wide variety of bioactive compounds, including antibiotics and pharmaceuticals. Identifying BGCs can lead to the discovery of novel natural products to benefit human health. Previous work has been focused on developing automatic tools to support BGC discovery in plants, fungi, and bacteria. Datadriven methods, as well as prob… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
12
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
5
1

Relationship

2
4

Authors

Journals

citations
Cited by 6 publications
(12 citation statements)
references
References 25 publications
(60 reference statements)
0
12
0
Order By: Relevance
“…DeepBGC achieved a 0.923 area under the curve when predicting BGC positions in a set of 65 experimentally validated BGCs from six bacterial genomes, outperforming previous studies ( 15 ). When handling fungal BGC data, DeepBGC in its original version yielded performance no higher than 0.2 F -m ( 17 ), and when trained on fungal data underperformed previous methods such as fungiSMASH ( 11 ), as we show in the ‘Results’ section. This could indicate that BGC discovery methods developed for bacteria may not be suitable for fungi due to the high diversity of fungal BGCs that are found to vary even among closely related species ( 3 ).…”
Section: Introductionmentioning
confidence: 76%
See 4 more Smart Citations
“…DeepBGC achieved a 0.923 area under the curve when predicting BGC positions in a set of 65 experimentally validated BGCs from six bacterial genomes, outperforming previous studies ( 15 ). When handling fungal BGC data, DeepBGC in its original version yielded performance no higher than 0.2 F -m ( 17 ), and when trained on fungal data underperformed previous methods such as fungiSMASH ( 11 ), as we show in the ‘Results’ section. This could indicate that BGC discovery methods developed for bacteria may not be suitable for fungi due to the high diversity of fungal BGCs that are found to vary even among closely related species ( 3 ).…”
Section: Introductionmentioning
confidence: 76%
“…TOUCAN classification models were developed with comprehensive and exhaustive fungal BGC datasets presented in ( 17 ) that are publicly available to support benchmarking of BGC discovery methods. The six fungal BGC training datasets are composed of different distributions of positive instances obtained from the MIBiG (Minimum Information about a Biosynthetic Gene cluster) ( 2 ) repository and synthetic negative instances generated from OrthoDB ( 18 ) orthologues.…”
Section: Methodsmentioning
confidence: 99%
See 3 more Smart Citations