1966
DOI: 10.1093/comjnl/9.1.60
|View full text |Cite
|
Sign up to set email alerts
|

Computer Programs for Hierarchical Polythetic Classification ("Similarity Analyses")

Abstract: It is demonstrated that hierarchical classifications have computational advantages over cluster analyses. Flexible programs are outlined providing two sorting strategies and four alternative similarity-coefficients. Preliminary results suggest that for qualitative data one of the strategies and two of the coefficients are superior to the remainder of the system; the further refinements desirable for a large-scale production program are discussed.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
131
0
4

Year Published

1984
1984
2016
2016

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 332 publications
(135 citation statements)
references
References 11 publications
0
131
0
4
Order By: Relevance
“…3. Top five diversity indicators with the highest correlation: with individual qualities of timber production (a), with individual age categories (b) with the study area QTP -indicator of the quality of timber production, QTP1 -best-quality timber, QTP2, QTP3 -sawn timber of higher and lower quality, QTP4 -pulp and fuel wood, black circle -percentage of significant correlations of the diversity indicator with the quality of timber production, grey bar -average value of correlation coefficient R xy calculated from significant correlations of a particular diversity indicator, white rectangle -95% confidence interval of the average value of correlation coefficient R xy calculated from all correlations derived for a particular diversity indicator, black dagger -average value of correlation coefficient R xy calculated from all correlations derived for a particular diversity indicator, black diamond -structural indicator of diversity, BC1_SP_Rr -BC1 index of similarity (Bray, Curtis 1957) between the trees with diameter below 7 cm and the trees with diameter above 7 cm, while the species composition was derived from the growth area, BC2_SP_Hpa -BC2 index of similarity (Bray, Curtis 1957) between the trees with diameter below 7 cm and the trees with diameter above 7 cm calculated from the average tree heights of species, BP_ST_Ha -BP index of species evenness (Berger, Parker 1970) calculated from the sum of tree heights of trees with diameter above 7 cm, BUB_SP_Rr -BUB index of similarity (Baroni-Urbani, Buser 1976) between the trees with diameter below 7 cm and the trees with diameter above 7 cm, while the species composition was derived from the growth area, D_ST_Na -D index of species evenness (McIntosh 1967) of trees with diameter above 7 cm, DF1_SP_Hpa -DF1 index of similarity (Canberra distance) (Lance, Williams 1966) between the trees with diameter below 7 cm and the trees with diameter above 7 cm calculated from the average tree heights of species, E1_SP_Hr -E1 index of species evenness (Pielou 1975(Pielou , 1977 of all trees, while the species composition was derived from the sum of tree heights, E1_ST_Rr -E1 index of species evenness (Pielou 1975(Pielou , 1977 of trees with diameter above 7 cm, while the species composition was derived from the growth area, E3_SP_Hr -E3 index of species evenness (Heip 1974) of all trees, while the species composition was derived from the sum of tree heights, E3_ST_Hr -E3 index of species evenness (Heip 1974) of trees with diameter above 7 cm, while the species composition was derived from the sum of tree heights, E3_ST_Rr -E3 index of species evenness (Heip 1974) of trees with diameter above 7 cm, while the species composition was derived from the growth area, E5_SP_Hr -E5 index of species evenness (Hill 1973) of all trees, while the species composition was derived from the sum of tree heights, E5_ST_Hr -E5 index of species evenness (Hill 1973) of trees with diameter above 7 cm, while the species composition was derived from the sum of tree heights, ED1_SP_Ha …”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…3. Top five diversity indicators with the highest correlation: with individual qualities of timber production (a), with individual age categories (b) with the study area QTP -indicator of the quality of timber production, QTP1 -best-quality timber, QTP2, QTP3 -sawn timber of higher and lower quality, QTP4 -pulp and fuel wood, black circle -percentage of significant correlations of the diversity indicator with the quality of timber production, grey bar -average value of correlation coefficient R xy calculated from significant correlations of a particular diversity indicator, white rectangle -95% confidence interval of the average value of correlation coefficient R xy calculated from all correlations derived for a particular diversity indicator, black dagger -average value of correlation coefficient R xy calculated from all correlations derived for a particular diversity indicator, black diamond -structural indicator of diversity, BC1_SP_Rr -BC1 index of similarity (Bray, Curtis 1957) between the trees with diameter below 7 cm and the trees with diameter above 7 cm, while the species composition was derived from the growth area, BC2_SP_Hpa -BC2 index of similarity (Bray, Curtis 1957) between the trees with diameter below 7 cm and the trees with diameter above 7 cm calculated from the average tree heights of species, BP_ST_Ha -BP index of species evenness (Berger, Parker 1970) calculated from the sum of tree heights of trees with diameter above 7 cm, BUB_SP_Rr -BUB index of similarity (Baroni-Urbani, Buser 1976) between the trees with diameter below 7 cm and the trees with diameter above 7 cm, while the species composition was derived from the growth area, D_ST_Na -D index of species evenness (McIntosh 1967) of trees with diameter above 7 cm, DF1_SP_Hpa -DF1 index of similarity (Canberra distance) (Lance, Williams 1966) between the trees with diameter below 7 cm and the trees with diameter above 7 cm calculated from the average tree heights of species, E1_SP_Hr -E1 index of species evenness (Pielou 1975(Pielou , 1977 of all trees, while the species composition was derived from the sum of tree heights, E1_ST_Rr -E1 index of species evenness (Pielou 1975(Pielou , 1977 of trees with diameter above 7 cm, while the species composition was derived from the growth area, E3_SP_Hr -E3 index of species evenness (Heip 1974) of all trees, while the species composition was derived from the sum of tree heights, E3_ST_Hr -E3 index of species evenness (Heip 1974) of trees with diameter above 7 cm, while the species composition was derived from the sum of tree heights, E3_ST_Rr -E3 index of species evenness (Heip 1974) of trees with diameter above 7 cm, while the species composition was derived from the growth area, E5_SP_Hr -E5 index of species evenness (Hill 1973) of all trees, while the species composition was derived from the sum of tree heights, E5_ST_Hr -E5 index of species evenness (Hill 1973) of trees with diameter above 7 cm, while the species composition was derived from the sum of tree heights, ED1_SP_Ha …”
Section: Resultsmentioning
confidence: 99%
“…Biodiversity was quantified by the following basic indicators that describe species and structural diversity: (i) indices of species richness: N0 (Hill 1973), R1 (Margalef 1958), R2 (Menhinick 1964); (ii) indices of species evenness: BP (Berger, Parker 1970), E1 (Pielou 1975(Pielou , 1977, E3 (Heip 1974), E5 (Hill 1973), D (McIntosh 1967; (iii) indices of species heterogeneity: Si (Simpson 1949), H (Shannon, Weaver 1949), HB (Brillouin 1956); (iv) indices of similarity: QS (Sørensen 1948), BC (Bray, Curtis 1957), ED -Euclidian distance, BUB (Baroni-Urbani, Buser 1976), Y (Boyce 2003), DF -Canberra distance (Lance, Williams 1966), PS -proportional similarity (Czekanowski 1909); (v) other indicators: absolute and relative range of tree heights, species aggregation and mixture assessed in the field, volume of fine and coarse woody debris on a plot, number of vertical layers according to Zlatník (1976), number of shrub species, number of moss and lichen species.…”
Section: Methodsmentioning
confidence: 99%
“…We have Fourthly, we propose to adopt the abstract value of the Pearson's correlation coefficient value as the similarity measure. We have Fifthly, we propose to adopt Canberra distance [16][17][18], which is a metric function often used for data scattered around an origin. The Canberra distance is similar to the Manhattan distance and the distinction is that the absolute difference between the variables of the two objects is divided by the sum of the absolute variable values prior to summing.…”
Section: Guilt-by-association Modelmentioning
confidence: 99%
“…Although it has been stated that "the coefficient is not defined for continuously-varying numerical data" (Lance and Williams 1966), this is not relevant for polyclaves because the identification database will treat numeric ranges in most respects like other discrete categories (see below).…”
mentioning
confidence: 99%
“…The entropy H of a character takes a maximum value when the probabilities are uniformly distributed (Cover and Thomas 2006, Theorem 2.6.4), and that maximum is log n where n denotes "the number of elements in the range". This result has been stated as "maximal H… depends only on the number of states" (Abbott et al 1985, p.101), as "the base of the logarithms is an arbitrary choice" (Lance and Williams 1966), and as "if one wanted to confine H always to the range 0 to 1, then logarithms to base m can be used for characters with m states ... this is in effect just the same as multiplying H by a normalizing constant." (Pankhurst 1991, p. 192).…”
mentioning
confidence: 99%