Current approaches to A/B testing in networks focus on limiting interference, the concern that treatment effects can "spill over" from treatment nodes to control nodes and lead to biased causal effect estimation. Prominent methods for network experiment design rely on two-stage randomization, in which sparsely-connected clusters are identified and cluster randomization dictates the node assignment to treatment and control. Here, we show that cluster randomization does not ensure sufficient node randomization and it can lead to selection bias in which treatment and control nodes represent different populations of users. To address this problem, we propose a principled framework for network experiment design which jointly minimizes interference and selection bias. We introduce the concepts of edge spillover probability and cluster matching and demonstrate their importance for designing network A/B testing. Our experiments on a number of real-world datasets show that our proposed framework leads to significantly lower error in causal effect estimation than existing solutions.
Using the multivariate statistical methods, this study interprets a set of data containing 23 water quality parameters from 10 quality monitoring stations in Karkheh River located in southwest of Iran over 5 years. According to cluster analysis, the stations are classified into three classes of quality, and the most important factors on the whole set of parameters and each class are determined by the help of factor analysis. The results indicate the effects of natural factors, soil weathering and erosion, urban and human wastewater, agricultural and industrial wastewater on water quality at different levels and any location. Afterwards, five input selection methods such as correlation model, principal component analysis, combination of gamma test and backward regression, gamma test and genetic algorithm, and gamma test by elimination method are used for modeling BOD, and then their efficiency is investigated in simulation BOD with local linear regression, Artificial Neural Network, and genetic programming. From five methods of input variables in BOD simulation by local linear regression, genetic test and backward regression with RMSE error of 0.27 are the best input methods; gamma test based on genetic algorithm is the best model in simulation by Artificial Neural Network with RMSE error of 0.28, and finally, the gamma test model based on genetic algorithm with RMSE error of 0.1303 is the most appropriate model in simulation with genetic programming.
Although pre-trained language models, such as BERT, achieve state-of-art performance in many language understanding tasks, they have been demonstrated to inherit strong gender bias from its training data. Existing studies addressing the gender bias issue of pre-trained models, usually recollect and build genderneutral data on their own and conduct a second phase pre-training on the released pre-trained model with such data. However, given the limited size of the genderneutral data and its potential distributional mismatch with the original pre-training data, catastrophic forgetting would occur during the second-phase pre-training. Forgetting on the original training data may damage the model's downstream performance to a large margin. In this work, we first empirically show that even if the gender-neutral data for second-phase pre-training comes from the original training data, catastrophic forgetting still occurs if the size of gender-neutral data is smaller than that of original training data. Then, we propose a new method, GEnder Equality Prompt (GEEP), to improve gender fairness of pre-trained models without forgetting. GEEP learns gender-related prompts to reduce gender bias, conditioned on frozen language models. Since all pre-trained parameters are frozen, forgetting on information from the original training data can be alleviated to the most extent. Then GEEP trains new embeddings of profession names as gender equality prompts conditioned on the frozen model. This makes GEEP more effective at debiasing as well. Because gender bias from previous data embedded in profession embeddings is already removed when they are re-intialized in GEEP before second-phase pre-training starts. Empirical results show that GEEP not only achieves state-of-the-art performances on gender debiasing in various applications such as pronoun predicting and coreference resolution, but also achieves comparable results on general downstream tasks such as GLUE with original pre-trained models without much forgetting.
This article discusses the value of sustainable oil maintenance by using submicron high-tech offline filtration based on oil condition monitoring to reduce wear particles and improve system reliability. These activities incorporate broader goals of establishing a proactive maintenance approach, defined as continuous monitoring and controlling of the root causes of machine failure. Among these causes, contamination is the industry's most severe, popular, and most recognized failure cause. In order to ensure all necessary proactive maintenance activities, this paper uses practical analysis for oil contamination monitoring based on ISO 4405. The author applied a 0.1-micron high-tech offline filtration for lubrication systems of ball mills in the Sungun copper mine as an oil maintenance activity to control the system lubrication contamination as a root cause of failures.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.