Nina Taft scite author profile

Ridge regression is an algorithm that takes as input a large number of data points and finds the best-fit linear curve through these points. The algorithm is a building block for many machine-learning operations. We present a system for privacy-preserving ridge regression. The system outputs the best-fit curve in the clear, but exposes no other information about the input data. Our approach combines both homomorphic encryption and Yao garbled circuits, where each is used in a different part of the algorithm to obtain the best performance. We implement the complete system and experiment with it on real data-sets, and show that it significantly outperforms pure implementations based only on homomorphic encryption or Yao circuits.

show abstract

Structural analysis of network traffic flows

Lakhina

et al. 2004

View full text Add to dashboard Cite

Network traffic arises from the superposition of Origin-Destination (OD) flows. Hence, a thorough understanding of OD flows is essential for modeling network traffic, and for addressing a wide variety of problems including traffic engineering, traffic matrix estimation, capacity planning, forecasting and anomaly detection. However, to date, OD flows have not been closely studied, and there is very little known about their properties.We present the first analysis of complete sets of OD flow timeseries, taken from two different backbone networks (Abilene and Sprint-Europe). Using Principal Component Analysis (PCA), we find that the set of OD flows has small intrinsic dimension. In fact, even in a network with over a hundred OD flows, these flows can be accurately modeled in time using a small number (10 or less) of independent components or dimensions.We also show how to use PCA to systematically decompose the structure of OD flow timeseries into three main constituents: common periodic trends, short-lived bursts, and noise. We provide insight into how the various constitutents contribute to the overall structure of OD flows and explore the extent to which this decomposition varies over time.

show abstract

Privacy-preserving matrix factorization

et al. 2013

View full text Add to dashboard Cite

Combining filtering and statistical methods for anomaly detection

Soule¹,

Salamatian²,

Taft³

2005

222

151

View full text Add to dashboard Cite

Learning in a Large Function Space: Privacy-Preserving Mechanisms for SVM Learning

Rubinstein

Bartlett

Huang

et al. 2012

JPC

154

143

View full text Add to dashboard Cite

Abstract.The ubiquitous need for analyzing privacy-sensitive informationincluding health records, personal communications, product ratings, and social network data-is driving significant interest in privacy-preserving data analysis across several research communities. This paper explores the release of Support Vector Machine (SVM) classifiers while preserving the privacy of training data. The SVM is a popular machine learning method that maps data to a highdimensional feature space before learning a linear decision boundary. We present efficient mechanisms for finite-dimensional feature mappings and for (potentially infinite-dimensional) mappings with translation-invariant kernels. In the latter case, our mechanism borrows a technique from large-scale learning to learn in a finite-dimensional feature space whose inner-product uniformly approximates the desired feature space inner-product (the desired kernel) with high probability. Differential privacy is established using algorithmic stability, a property used in learning theory to bound generalization error. Utility-when the private classifier is pointwise close to the non-private classifier with high probability-is proven using smoothness of regularized empirical risk minimization with respect to small perturbations to the feature mapping. Finally we conclude with lower bounds on the differential privacy of any mechanism approximating the SVM.

show abstract

GraphSC: Parallel Secure Computation Made Easy

et al. 2015

View full text Add to dashboard Cite

Structural Analysis of Network Traffic Flows

Lakhina¹,

Papagiannaki²,

Crovella³

et al. 2003

107

View full text Add to dashboard Cite

show abstract

Long-term forecasting of Internet backbone traffic: observations and initial models

et al. 2003

View full text Add to dashboard Cite

Abstract-We introduce a methodology to predict when and where link additions/upgrades have to take place in an IP backbone network. Using SNMP statistics, collected continuously since 1999, we compute aggregate demand between any two adjacent PoPs and look at its evolution at time scales larger than one hour. We show that IP backbone traffic exhibits visible long term trends, strong periodicities, and variability at multiple time scales.Our methodology relies on the wavelet multiresolution analysis and linear time series models. Using wavelet multiresolution analysis, we smooth the collected measurements until we identify the overall long-term trend. The fluctuations around the obtained trend are further analyzed at multiple time scales. We show that the largest amount of variability in the original signal is due to its fluctuations at the 12 hour time scale.We model inter-PoP aggregate demand as a multiple linear regression model, consisting of the two identified components. We show that this model accounts for 98% of the total energy in the original signal, while explaining 90% of its variance. Weekly approximations of those components can be accurately modeled with low-order AutoRegressive Integrated Moving Average (ARIMA) models. We show that forecasting the long term trend and the fluctuations of the traffic at the 12 hour time scale yields accurate estimates for at least six months in the future.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Nina Taft

Privacy-Preserving Ridge Regression on Hundreds of Millions of Records

Structural analysis of network traffic flows

Privacy-preserving matrix factorization

Combining filtering and statistical methods for anomaly detection

Learning in a Large Function Space: Privacy-Preserving Mechanisms for SVM Learning

GraphSC: Parallel Secure Computation Made Easy

Structural Analysis of Network Traffic Flows

Long-term forecasting of Internet backbone traffic: observations and initial models

Contact Info

Product

Resources

About