Categories
Nevin Manimala Statistics

Character correlation and its use for identification

Zootaxa. 2021 Nov 22;5071(1):131-142. doi: 10.11646/zootaxa.5071.1.7.

ABSTRACT

A method is presented for correlating phylogenetic characters through cladistic analysis. It extends the use of phylogenetic datasets for diagnostic purposes. It improves matrix-based identification tools by predicting novel character-state combinations that were not observed when the key was constructed. By interpreting homoplasy as analytical error, hypothetical character-state combinations are tested for the homoplasy that they would add to the shortest tree(s). The correlation is equal to the homoplasy summed across all state combinations, divided by a maximum possible value. The results depend on uncertainty about the sequence of state transitions and their overlap among characters. A correlation index r is proposed for sets of non-additive characters; it is a kind of multiple-regression value, and its ensemble value R is a statistic of a whole matrix. This approach can be used to select sets of the best “proxy” characters to substitute for unobservable characters of interest. The concept can be extended to continuous characters. Worked examples are given with datasets of various insect orders.

PMID:34810677 | DOI:10.11646/zootaxa.5071.1.7

By Nevin Manimala

Portfolio Website for Nevin Manimala