Wenshuo Guo scite author profile

Wenshuo Guo

5Publications

23Citation Statements Received

71Citation Statements Given

How they've been cited

How they cite others

134

Affiliations

Nankai University, Hong Kong University of Science and Technology, University of California, Berkeley

Publications

Order By: Most citations

Robust Optimization for Fairness with Noisy Protected Groups

Wang¹,

Guo²,

Narasimhan³

et al. 2020

Preprint

View full text Add to dashboard Cite

Many existing fairness criteria for machine learning involve equalizing or achieving some metric across protected groups such as race or gender groups. However, practitioners trying to audit or enforce such group-based criteria can easily face the problem of noisy or biased protected group information. We study this important practical problem in two ways. First, we study the consequences of naïvely only relying on noisy protected groups: we provide an upper bound on the fairness violations on the true groups G when the fairness criteria are satisfied on noisy groups Ĝ. Second, we introduce two new approaches using robust optimization that, unlike the naïve approach of only relying on Ĝ, are guaranteed to satisfy fairness criteria on the true protected groups G while minimizing a training objective. We provide theoretical guarantees that one such approach converges to an optimal feasible solution. Using two case studies, we empirically show that the robust approaches achieve better true group fairness guarantees than the naïve approach.

show abstract

The Stereotyping Problem in Collaboratively Filtered Recommender Systems

Guo

Krauth

Jordan

et al. 2021

View full text Add to dashboard Cite

Multi-Source Causal Inference Using Control Variates

Guo

Wang²,

Ding³

et al. 2021

Preprint

View full text Add to dashboard Cite

While many areas of machine learning have benefited from the increasing availability of large and varied datasets, the benefit to causal inference has been limited given the strong assumptions needed to ensure identifiability of causal effects; these are often not satisfied in real-world datasets. For example, many large observational datasets (e.g., case-control studies in epidemiology, click-through data in recommender systems) suffer from selection bias on the outcome, which makes the average treatment effect (ATE) unidentifiable. We propose a general algorithm to estimate causal effects from multiple data sources, where the ATE may be identifiable only in some datasets but not others. The key idea is to construct control variates using the datasets in which the ATE is not identifiable. We show theoretically that this reduces the variance of the ATE estimate. We apply this framework to inference from observational data under an outcome selection bias, assuming access to an auxiliary small dataset from which we can obtain a consistent estimate of the ATE. We construct a control variate by taking the difference of the odds ratio estimates from the two datasets. Across simulations and two case studies with real data, we show that this control variate can significantly reduce the variance of the ATE estimate.

show abstract

Optimization of financial network stability by genetic algorithm

Wang

Guo

Szeto

2017

View full text Add to dashboard Cite

Learning from an Exploring Demonstrator: Optimal Reward Estimation for Bandits

Guo¹,

Agrawal²,

Grover³

et al. 2021

Preprint

View full text Add to dashboard Cite

We introduce the "inverse bandit" problem of estimating the rewards of a multi-armed bandit instance from observing the learning process of a low-regret demonstrator. Existing approaches to the related problem of inverse reinforcement learning assume the execution of an optimal policy, and thereby suffer from an identifiability issue. In contrast, our paradigm leverages the demonstrator's behavior en route to optimality, and in particular, the exploration phase, to obtain consistent reward estimates. We develop simple and efficient reward estimation procedures for demonstrations within a class of upper-confidence-based algorithms, showing that reward estimation gets progressively easier as the regret of the algorithm increases. We match these upper bounds with information-theoretic lower bounds that apply to any demonstrator algorithm, thereby characterizing the optimal tradeoff between exploration and reward estimation. Extensive empirical evaluations on both synthetic data and simulated experimental design data from the natural sciences corroborate our theoretical results.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Wenshuo Guo

Robust Optimization for Fairness with Noisy Protected Groups

The Stereotyping Problem in Collaboratively Filtered Recommender Systems

Multi-Source Causal Inference Using Control Variates

Optimization of financial network stability by genetic algorithm

Learning from an Exploring Demonstrator: Optimal Reward Estimation for Bandits

Contact Info

Product

Resources

About