2019
DOI: 10.1007/s10844-019-00545-0
|View full text |Cite
|
Sign up to set email alerts
|

CBSSD: community-based semantic subgroup discovery

Abstract: Modern data mining algorithms frequently need to address the task of learning from heterogeneous data, including various sources of background knowledge. A data mining task where ontologies are used as background knowledge in data analysis is referred to as semantic data mining. A specific semantic data mining task is semantic subgroup discovery: a rule learning approach enabling ontology terms to be used in subgroup descriptions learned from class labeled data. This paper presents Community-Based Semantic Sub… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
14
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
7
1
1

Relationship

6
3

Authors

Journals

citations
Cited by 14 publications
(14 citation statements)
references
References 63 publications
0
14
0
Order By: Relevance
“…The idea of applying enrichment methods to identify relevant subnetworks from a biological standpoint is a well-known approach performed in literature using: a) ontology-based enrichment analysis [ 62 ], b) de novo enrichment analysis [ 63 ], c) community-based algorithms followed by semantic rule induction [ 64 ] to link biological explanations to each discovered subgroups. Similarly, in our work we used community detection followed by enrichment analysis to dissect our biomolecular network in subgroups described by enriched pathways coherent and correlated inside each community.…”
Section: Resultsmentioning
confidence: 99%
“…The idea of applying enrichment methods to identify relevant subnetworks from a biological standpoint is a well-known approach performed in literature using: a) ontology-based enrichment analysis [ 62 ], b) de novo enrichment analysis [ 63 ], c) community-based algorithms followed by semantic rule induction [ 64 ] to link biological explanations to each discovered subgroups. Similarly, in our work we used community detection followed by enrichment analysis to dissect our biomolecular network in subgroups described by enriched pathways coherent and correlated inside each community.…”
Section: Resultsmentioning
confidence: 99%
“…Conceptually, community-based semantic subgroup discovery (CBSSD) consists of two main steps, i.e., community detection followed by a one-versus-all enrichment procedure, which here, was additionally corrected for multiple hypothesis testing. We refer the reader to Škrlj et al (2019) [ 24 ] for a more detailed overview of the enrichment process and for additional theoretical insights. Here, we used the method ‘as-is’, with the default hyperparameter settings including Bonferroni’s multiple test correction and the significance threshold of 0.05 (Fisher’s exact test).…”
Section: Proposed Methodologymentioning
confidence: 99%
“…Applied network-based methodologies have shown promising results in many branches of plant biology, including studies of immunity [ 21 ] and regulatory pathways [ 22 , 23 ]. When considering, for example, community enrichment [ 24 ], drug design [ 25 ] or the structural analysis of protein binding sites [ 26 , 27 ], network-based approaches have also been applied to organisms other than plants.…”
Section: Introductionmentioning
confidence: 99%
“…The use of background knowledge for improving the understandability of machine learning systems has also been referred to as semantic data mining [11]. For example, in relational domains, the CBSSD methodology [12] focuses on data mining tasks that use ontologies as background knowledge when explaining emergent structures in complex networks. Similarly, the NetSDM approach [13] explored how redundant ontologies are for the purposes of inductive rule learning.…”
Section: Obtain Explanationsmentioning
confidence: 99%