Galen Andrew scite author profile

Federated learning and analytics are a distributed approach for collaboratively learning models (or statistics) from decentralized data, motivated by and designed for privacy protection. The distributed learning process can be formulated as solving federated optimization problems, which emphasize communication efficiency, data heterogeneity, compatibility with privacy and system requirements, and other constraints that are not primary considerations in other problem settings. This paper provides recommendations and guidelines on formulating, designing, evaluating and analyzing federated optimization algorithms through concrete examples and practical implementation, with a focus on conducting effective simulations to infer real-world performance. The goal of this work is not to survey the current literature, but to inspire researchers and practitioners to design federated learning algorithms that can be used in various practical applications.

show abstract

A hybrid Markov/semi-Markov conditional random field for sequence segmentation

Andrew

2006

View full text Add to dashboard Cite

Markov order-1 conditional random fields (CRFs) and semi-Markov CRFs are two popular models for sequence segmentation and labeling. Both models have advantages in terms of the type of features they most naturally represent. We propose a hybrid model that is capable of representing both types of features, and describe efficient algorithms for its training and inference. We demonstrate that our hybrid model achieves error reductions of 18% and 25% over a standard order-1 CRF and a semi-Markov CRF (resp.) on the task of Chinese word segmentation. We also propose the use of a powerful feature for the semi-Markov CRF: the log conditional odds that a given token sequence constitutes a chunk according to a generative model, which reduces error by an additional 13%. Our best system achieves 96.8% F-measure, the highest reported score on this test set.

show abstract

A General Approach to Adding Differential Privacy to Iterative Training Procedures

Brendan¹,

Andrew²,

Erlingsson³

et al. 2018

Preprint

View full text Add to dashboard Cite

In this work we address the practical challenges of training machine learning models on privacy-sensitive datasets by introducing a modular approach that minimizes changes to training algorithms, provides a variety of configuration strategies for the privacy mechanism, and then isolates and simplifies the critical logic that computes the final privacy guarantees. A key challenge is that training algorithms often require estimating many different quantities (vectors) from the same set of examples -for example, gradients of different layers in a deep learning architecture, as well as metrics and batch normalization parameters. Each of these may have different properties like dimensionality, magnitude, and tolerance to noise. By extending previous work on the Moments Accountant for the subsampled Gaussian mechanism, we can provide privacy for such heterogeneous sets of vectors, while also structuring the approach to minimize software engineering challenges.

show abstract

Boosting as a Metaphor for Algorithm Design

Leyton‐Brown

Nudelman

Andrew

et al. 2003

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Galen Andrew

Scalable training of L ¹ -regularized log-linear models

A Field Guide to Federated Optimization

A hybrid Markov/semi-Markov conditional random field for sequence segmentation

A General Approach to Adding Differential Privacy to Iterative Training Procedures

Boosting as a Metaphor for Algorithm Design

Contact Info

Product

Resources

About

Galen Andrew

Scalable training of L 1 -regularized log-linear models

A Field Guide to Federated Optimization

A hybrid Markov/semi-Markov conditional random field for sequence segmentation

A General Approach to Adding Differential Privacy to Iterative Training Procedures

Boosting as a Metaphor for Algorithm Design

Contact Info

Product

Resources

About

Scalable training of L ¹ -regularized log-linear models