2001
DOI: 10.1198/004017001316975907
|View full text |Cite
|
Sign up to set email alerts
|

Using Angles to Identify Concentrated Multivariate Outliers

Abstract: This anicle desc ribes a procedure for the detection of multivariate oU lliers based on the anal ysis 01' certain angular properties 01' the observations. The method is simple. exploratory in nature. and particularly well suited for the detection of concen trated contamination paltems, in which the outl iers appear 10 foml a cluster, separated from ¡he sample. II is shown that it presents good properties for the identification of conta minations on high-dimensional sample spaces and for high contaminati on lev… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2

Citation Types

0
16
0

Year Published

2001
2001
2012
2012

Publication Types

Select...
6
1

Relationship

2
5

Authors

Journals

citations
Cited by 21 publications
(16 citation statements)
references
References 13 publications
0
16
0
Order By: Relevance
“…The performance of these methods is documented in the article under discussion. Another attempt in this direction was given by Juan and Prieto (2001), who tried to nd outliers by looking at the angles subtended by clusters of points projected on an ellipsoid. They showed reasonable performance of this method but provided computations only for very concentrated outliers (‹ D 001).…”
Section: A3 An Expression For ƒ Zmentioning
confidence: 99%
See 1 more Smart Citation
“…The performance of these methods is documented in the article under discussion. Another attempt in this direction was given by Juan and Prieto (2001), who tried to nd outliers by looking at the angles subtended by clusters of points projected on an ellipsoid. They showed reasonable performance of this method but provided computations only for very concentrated outliers (‹ D 001).…”
Section: A3 An Expression For ƒ Zmentioning
confidence: 99%
“…The greatest chance of success comes from use of multiple methods, at least one of which is a general-purpose method such as FAST-MCD and MULTOUT, and at least one of which is meant for clustered outliers, such as kurtosis1, the angle method of Juan and Prieto (2001), or our clustering method (Rocke and Woodruff 2001).…”
Section: A3 An Expression For ƒ Zmentioning
confidence: 99%
“…Requirements for such detectors include a low input-output processing time, low computational time, and minimal number of user-specified parameters. Several types of anomaly detectors that have been considered for real-time implementation are based on linear mixing models [3][4][5][6][7][8], kernel functions [3,9,10], robust distance [11][12][13], angle [14,15], and statistical distance [1,[16][17][18].…”
Section: Introductionmentioning
confidence: 99%
“…Requirements for such detectors include a low input-output processing time, low computational time, and minimal number of user-specified parameters. Several types of anomaly detectors that have been considered for real-time implementation are based on linear mixing models [3-8], kernel functions [3, 9, 10], robust distance [11][12][13], angle [14,15], and statistical distance [1,[16][17][18].One type of anomaly detectors based on linear mixing models requires prior knowledge of the spectra of the end members. One of the difficulties with this type of anomaly detectors is that the spectra of the end members for the complex background are contaminated with background spectra and do not really resemble the actual background spectra.…”
mentioning
confidence: 99%
“…It is well known that a few outliers in the data may arbitrarily distort the sample mean and the sample covariance matrix, therefore, the robust estimation of location and shape is a crucial problem in multivariate statistics. Several robust estimates have been proposed, see Gnanadesikan and Kettenring (1972), Maronna (1976), Stahel (1981), Donoho (1982), Rousseeuw (1985), Davies (1987), Rousseeuw and van Zomeren (1990), Tyler (1991Tyler ( , 1994, Hadi (1992), Cook, Hawkins, and Weisberg (1993), Rocke andWoodruff (1993, 1996), Atkinson (1994), Hawkins (1994), Maronna and Yohai (1995), Agulló (1996), Rousseeuw and van Driessen (1999), Becker and Gather (2001), Peña and Prieto (2001a), Juan and Prieto (2001), Hawkins and Olive (2002), and Maronna and Zamar (2002) and the references therein. For high-dimensional large datasets a useful way to avoid the curse of dimensionality in data mining applications is to search for outliers in univariate projections of the data.…”
Section: Introductionmentioning
confidence: 99%