Statistical Analysis of Maximally Similar Sets in Ecological Research


David W. Roberts




Maximally similar sets (MSSs) are sets of elements that share a neighborhood in a high-dimensional space defined by a symmetric, reflexive similarity relation. Each element of the universe is employed as the kernel of a neighborhood of a given size (number of members), and elements are added to the neighborhood in order of similarity to the current members of the set until the desired neighborhood size is achieved. The set of neighborhoods is then reduced to the set of unique, maximally similar sets by eliminating all sets that are permutations of an existing set. Subsequently, the within-MSS variability of candidate explanatory variables associated with the elements is compared to random sets of the same size to estimate the probability of obtaining variability as low as was observed. Explanatory variables can be compared for effect size by the rank order of within-MSS variability and random set variability, correcting for statistical power as necessary. The analyses performed identify constraints, as opposed to determinants, in the triangular distribution of pair-wise element similarity. In the example given here, the variability in spring temperature, summer temperature, and the growing degree days of forest vegetation sample units shows the greatest constraint on forest composition of a large set of candidate environmental variables.



How is this information collected?

This collection of Montana State authored publications is collected by the Library to highlight the achievements of Montana State researchers and more fully understand the research output of the University. They use a number of resources to pull together as complete a list as possible and understand that there may be publications that are missed. If you note the omission of a current publication or want to know more about the collection and display of this information email Leila Sterman.