2004
DOI: 10.1007/bf02637154
|View full text |Cite
|
Sign up to set email alerts
|

Collective mining of Bayesian networks from distributed heterogeneous data

Abstract: Abstract. We present a collective approach to learning a Bayesian network from distributed heterogenous data. In this approach, we first learn a local Bayesian network at each site using the local data. Then each site identifies the observations that are most likely to be evidence of coupling between local and non-local variables and transmits a subset of these observations to a central site. Another Bayesian network is learnt at the central site using the data transmitted from the local site. The local and ce… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2009
2009
2015
2015

Publication Types

Select...
3
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(4 citation statements)
references
References 17 publications
(3 reference statements)
0
4
0
Order By: Relevance
“…The key component of BBM is a BN-induction algorithm; such algorithms scale well with the number of samples, and have been shown to scale to millions of samples 17, 22 . The BNs in Figure 3 include a model with a small number of variables (Figure 3 (a)), one with medium complexity (Figure 3 (b) and Figure 3 (c)), and a model with high complexity (Figure 3 (d)).…”
Section: Resultsmentioning
confidence: 99%
“…The key component of BBM is a BN-induction algorithm; such algorithms scale well with the number of samples, and have been shown to scale to millions of samples 17, 22 . The BNs in Figure 3 include a model with a small number of variables (Figure 3 (a)), one with medium complexity (Figure 3 (b) and Figure 3 (c)), and a model with high complexity (Figure 3 (d)).…”
Section: Resultsmentioning
confidence: 99%
“…where the participating parties get exact solution based on combined database [7]. For several computational experiments, the different database are considered to have single solution based on the data collection from different parties using different techniques such as standard algorithm for decision tree [8] in peer-to-peer systems, identification of TOP-l inner products elements in P2P network [9], collective mining of Bayesian networks [10], mining criminal networks [11], incentive compatible privacy preserving of data in distributed classification [12], etc. The techniques of above paper indicate the importance of the role of participating parties in distributed environment where each party never wants to release their private data without protection.…”
Section: Related Workmentioning
confidence: 99%
“…In this case, every distributed node has the same feature but different observations. A collective approach to implement a distributed Bayesian network has been proposed by Chen et al [3]. But this approach is implemented in a client-server framework for heterogeneous databases.…”
Section: Related Workmentioning
confidence: 99%