2008
DOI: 10.1002/sam.10009
|View full text |Cite
|
Sign up to set email alerts
|

A Scalable Local Algorithm for Distributed Multivariate Regression

Abstract: This paper offers a local distributed algorithm for multivariate regression in large peer-to-peer environments. The algorithm can be used for distributed inferencing, data compaction, data modeling and classification tasks in many emerging peer-to-peer applications for bioinformatics, astronomy, social networking, sensor networks and web mining. Computing a global regression model from data available at the different peer-nodes using a traditional centralized algorithm for regression can be very costly and imp… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
18
0

Year Published

2009
2009
2016
2016

Publication Types

Select...
4
2
2

Relationship

5
3

Authors

Journals

citations
Cited by 16 publications
(18 citation statements)
references
References 37 publications
(33 reference statements)
0
18
0
Order By: Relevance
“…This can happen, for example, when the each node has a vector in a different convex region and the global average is in another different region. However, as shown in this paper and also by several authors [27] [6] there are several problem instances for which the resource consumption becomes independent of the size of the network. Interested readers are referred to [5] for a detailed discussion on communication complexity and locality of such algorithms.…”
Section: Algorithm 62 Building Modelsmentioning
confidence: 62%
See 2 more Smart Citations
“…This can happen, for example, when the each node has a vector in a different convex region and the global average is in another different region. However, as shown in this paper and also by several authors [27] [6] there are several problem instances for which the resource consumption becomes independent of the size of the network. Interested readers are referred to [5] for a detailed discussion on communication complexity and locality of such algorithms.…”
Section: Algorithm 62 Building Modelsmentioning
confidence: 62%
“…The distributed PCA implementation makes use of the Distributed Data Mining Toolkit (DDMT) 6 -a distributed data mining development environment from DIADIC research lab at UMBC. DDMT uses topological information which can be generate by BRITE 7 , a universal topology generator from Boston University.…”
Section: Results Of Distributed Pca Algorithmmentioning
confidence: 99%
See 1 more Smart Citation
“…Though there is much work on distributed computing, there seems to be little prior work on a scalable solution of mixed regression problems with longitudinal data. Distributed regression algorithms are considered in Guestrin et al [2004], Bhaduri and Kargupta [2008], and Bazerque et al [2010]. These algorithms are motivated by sensor networks and the need to distribute the estimation computations over the network, rather than scalability needs.…”
Section: Previous Workmentioning
confidence: 99%
“…Recently, Bhaduri et al [1] have proposed an algorithm for doing regression in large P2P networks which checks the squared error between the predicted and the target variables based on a generic monitoring algorithm proposed by Wolff et al in [16]. If the error exceeds a predefined threshold (ǫ), the nodes raise an alert and the regression model is rebuilt.…”
Section: Related Workmentioning
confidence: 99%