Zelda Mariet scite author profile

Zelda Mariet

5Publications

53Citation Statements Received

61Citation Statements Given

How they've been cited

How they cite others

Affiliations

Publications

Order By: Most citations

Uncertainty Baselines: Benchmarks for Uncertainty & Robustness in Deep Learning

Nado¹,

Band²,

Collier³

et al. 2021

Preprint

View full text Add to dashboard Cite

High-quality estimates of uncertainty and robustness are crucial for numerous real-world applications, especially for deep learning which underlies many deployed ML systems. The ability to compare techniques for improving these estimates is therefore very important for research and practice alike. Yet, competitive comparisons of methods are often lacking due to a range of reasons, including: compute availability for extensive tuning, incorporation of sufficiently many baselines, and concrete documentation for reproducibility. In this paper we introduce Uncertainty Baselines: high-quality implementations of standard and state-ofthe-art deep learning methods on a variety of tasks. As of this writing, the collection spans 19 methods across 9 tasks, each with at least 5 metrics. Each baseline is a self-contained experiment pipeline with easily reusable and extendable components. Our goal is to provide immediate starting points for experimentation with new methods or applications. Additionally we provide model checkpoints, experiment outputs as Python notebooks, and leaderboards for comparing results. https://github.com/google/uncertainty-baselines

show abstract

Diversity Networks: Neural Network Compression Using Determinantal Point Processes

Mariet¹,

Sra²

2015

Preprint

View full text Add to dashboard Cite

Pre-trained Gaussian processes for Bayesian optimization

Wang¹,

Dahl²,

Swersky³

et al. 2021

Preprint

View full text Add to dashboard Cite

Sparse MoEs meet Efficient Ensembles

Allingham¹,

Wenzel²,

Mariet³

et al. 2021

Preprint

View full text Add to dashboard Cite

Machine learning models based on the aggregated outputs of submodels, either at the activation or prediction levels, lead to strong performance. We study the interplay of two popular classes of such models: ensembles of neural networks and sparse mixture of experts (sparse MoEs). First, we show that the two approaches have complementary features whose combination is beneficial. Then, we present partitioned batch ensembles, an efficient ensemble of sparse MoEs that takes the best of both classes of models. Extensive experiments on fine-tuned vision Transformers demonstrate the accuracy, log-likelihood, few-shot learning, robustness, and uncertainty improvements of our approach over several challenging baselines. Partitioned batch ensembles not only scale to models with up to 2.7B parameters, but also provide larger performance gains for larger models. * Work done as a Google Brain intern. † Work done while at Google Brain.

show abstract

DPPNet: Approximating Determinantal Point Processes with Deep Networks

Mariet¹,

Ovadia²,

Snoek³

2019

Preprint

View full text Add to dashboard Cite

Determinantal Point Processes (DPPs) provide an elegant and versatile way to sample sets of items that balance the point-wise quality with the set-wise diversity of selected items. For this reason, they have gained prominence in many machine learning applications that rely on subset selection. However, sampling from a DPP over a ground set of size N is a costly operation, requiring in general an O(N 3 ) preprocessing cost and an O(N k 3 ) sampling cost for subsets of size k. We approach this problem by introducing DPPNETs: generative deep models that produce DPP-like samples for arbitrary ground sets. We develop an inhibitive attention mechanism based on transformer networks that captures a notion of dissimilarity between feature vectors. We show theoretically that such an approximation is sensible as it maintains the guarantees of inhibition or dissimilarity that makes DPPs so powerful and unique. Empirically, we demonstrate that samples from our model receive high likelihood under the more expensive DPP alternative. * Work done while at Google Brain. 2 Here, we use diversity to mean useful coverage across dissimilar examples in a meaningful feature space, rather than other definitions diversity that may appear in ML fairness literature. 3 We are adopting here the L-Ensemble construction (Borodin & Rains, 2005) of a DPP.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Zelda Mariet

Uncertainty Baselines: Benchmarks for Uncertainty & Robustness in Deep Learning

Diversity Networks: Neural Network Compression Using Determinantal Point Processes

Pre-trained Gaussian processes for Bayesian optimization

Sparse MoEs meet Efficient Ensembles

DPPNet: Approximating Determinantal Point Processes with Deep Networks

Contact Info

Product

Resources

About