An iterative outlier elimination procedure based on hypothesis testing, commonly known as Iterative Data Snooping (IDS) among geodesists, is often used for the quality control of modern measurement systems in geodesy and surveying. The test statistic associated with IDS is the extreme normalised least-squares residual. It is well-known in the literature that critical values (quantile values) of such a test statistic cannot be derived from well-known test distributions but must be computed numerically by means of Monte Carlo. This paper provides the first results on the Monte Carlo-based critical value inserted into different scenarios of correlation between outlier statistics. From the Monte Carlo evaluation, we compute the probabilities of correct identification, missed detection, wrong exclusion, over-identifications and statistical overlap associated with IDS in the presence of a single outlier. On the basis of such probability levels, we obtain the Minimal Detectable Bias (MDB) and Minimal Identifiable Bias (MIB) for cases in which IDS is in play. The MDB and MIB are sensitivity indicators for outlier detection and identification, respectively. The results show that there are circumstances in which the larger the Type I decision error (smaller critical value), the higher the rates of outlier detection but the lower the rates of outlier identification. In such a case, the larger the Type I Error, the larger the ratio between the MIB and MDB. We also highlight that an outlier becomes identifiable when the contributions of the measures to the wrong exclusion rate decline simultaneously. In this case, we verify that the effect of the correlation between outlier statistics on the wrong exclusion rate becomes insignificant for a certain outlier magnitude, which increases the probability of identification.
Abstract:We present a numerical simulation method for designing geodetic networks. The quality criterion considered is based on the power of the test of data snooping testing procedure. This criterion expresses the probability of the data snooping to identify correctly an outlier. In general, the power of the test is defined theoretically. However, with the advent of the fast computers and large data storage systems, it can be estimated using numerical simulation. Here, the number of experiments in which the data snooping procedure identifies the outlier correctly is counted using Monte Carlos simulations. If the network configuration does not meet the reliability criterion at some part, then it can be improved by adding required observation to the surveying plan. The method does not use real observations. Thus, it depends on the geometrical configuration of the network; the uncertainty of the observations; and the size of outlier. The proposed method is demonstrated by practical application of one simulated leveling network. Results showed the needs of five additional observations between adjacent stations. The addition of these new observations improved the internal reliability of approximately 18%. Therefore, the final designed network must be able to identify and resist against the undetectable outliers -according to the probability levels.Keywords: Reliability, Outlier, Numerical Simulation, Monte Carlo, Design of geodetic networks.
Resumo:Nós apresentamos um método de simulação numérica para planejamento de redes geodésicas. O critério de qualidade considerado é baseado no poder do teste do procedimento estatístico data snooping. Esse critério expressa a probabilidade do procedimento data snooping de identificar corretamente um outlier. Em geral, o valor do poder do teste é sempre desconhecido na prática (definido na teoria). Porém, com o advento de computadores mais rápidos e sistemas de armazenamento de dados de maior capacidade, o poder do teste pode ser estimado por meio de simulações numéricas. O número de experimentos que o procedimento data snooping identifica corretamente um outlier é quantificado por meio do método Monte Carlo de simulação. Se a configuração inicial da rede não atender o critério de confiabilidade, então uma nova observação é adicionada no projeto da rede. O método não faz uso de observações reais coletadas em campo, dependendo apenas da configuração geométrica da rede, das incertezas das observações e da magnitude dos outliers. A eficiência do método é verificada por meio de um exemplo numérico de uma rede de nivelamento geométrico simulada. Os resultados mostraram a necessidade de cinco observações adicionais entre os pontos adjacentes. A adição destas novas observações melhorou a confiabilidade interna da rede de aproximadamente 18%. Portanto, a rede é projetada de modo que seja capaz de identificar outliers e ainda resistir a outilers não identificados nas observações -de acordo com os níveis de probabilidade.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.