OptimClass: Using species-to-cluster fidelity to determine the optimal partition in classification of ecological communities
Authors | |
---|---|
Year of publication | 2010 |
Type | Article in Periodical |
Magazine / Source | Journal of Vegetation Science |
MU Faculty or unit | |
Citation | |
web | http://www3.interscience.wiley.com/journal/123193144/abstract |
Field | Ecology |
Keywords | Cluster analysis; Cover transformation; Dendrogram; Optimal number of clusters; Ordinal clustering; Resemblance measures; Stopping rules; TWINSPAN |
Description | Question: Community ecologists are often confronted with multiple possible partitions of a single set of records of species composition and/or abundances from several sites. Different methods of numerical classification produce different results, and the question is which of them, and how many clusters, should be selected for interpretation. We demonstrate a new method for identifying the optimal partition from a series of partitions of the same set of sites, based on number of species with high fidelity to clusters in a partition (faithful species). Methods: The new method, OptimClass, has two variants. OptimClass 1 searches the partition with the maximum number of faithful species across all clusters, while OptimClass 2 searches the partition with the maximum number of clusters that contain at least a preselected minimum number of faithful species. Faithful species are determined based on the P value of the Fisher's exact test, as a measure of fidelity. OptimClass was tested on three vegetation datasets that varied in species richness and internal heterogeneity, using several classification algorithms, resemblance measures and cover transformations. Results: Results from both variants of OptimClass depended on the preselected threshold P value for faithful species: higher P gave higher probability that a partition with more clusters was selected as optimal. Good partitions, in terms of OptimClass criteria, involved flexible beta clustering, and also ordinal clustering. Good partitions were also obtained with TWINSPAN when the required number of clusters was small, or UPGMA when the required number of clusters was large. Poor partitions usually resulted from classifications that used resemblance measures and cover transformations emphasizing differences in species cover; this is not unexpected because OptimClass uses a presence/absence-based fidelity measure. Conclusions: If the aim of a classification is to obtain clusters rich in faithful species, which can be subsequently used as diagnostic species for identification of community types, OptimClass is a suitable method for simultaneous choice of the optimal classification algorithm and optimal number of clusters. It can be computed in the JUICE program. |
Related projects: |