bioRxiv preprint doi: https://doi.org/10.1101/2020.07.23.218289; this version posted January 29, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license. F. R. O’Keefe
1 Shape Dimensionality Metrics for Landmark Data
2
3 F. Robin O'Keefe1*
4
5 Marshall University, Biological Sciences, Huntington, WV
6
7 *corresponding author: F. Robin O’Keefe, Professor, Marshall University, College of
8 Science 265, One John Marshall Drive, Hunington, WV 25755. Phone: +1 304 696 2427.
9 Email: [email protected]
10
11 Running Head: Whole Shape Integration Metrics
12
13 Keywords: Dire wolf, Canis dirus, geometric morphometrics, modularity and integration,
14 information entropy, effective rank, effective dispersion, latent dispersion.
15
1 bioRxiv preprint doi: https://doi.org/10.1101/2020.07.23.218289; this version posted January 29, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license. WHOLE SHAPE INTEGRATION METRICS
16 ABSTRACT
17 The primary goal of this paper is to examine and rationalize different integration metrics
18 used in geometric morphometrics, in an attempt to arrive at a common basis for the
19 characterization of phenotypic covariance in landmark data. We begin with a model
20 system; two populations of Pleistocene dire wolves from Rancho La Brea that we
21 examine from a data-analytic perspective to produce candidate models of integration. We
22 then test these integration models using the appropriate statistics and extend this
23 characterization to measures of whole-shape integration. We demonstrate that current
24 measures of whole-shape integration fail to capture differences in the strength and pattern
25 of integration. We trace this failure to the fact that current whole-shape integration
26 metrics purport to measure only the pattern of inter-trait covariance, while ignoring the
27 dimensionality across which trait variance is distributed. We suggest a modification to
28 current metrics based on consideration of the Shannon, or information, entropy, and
29 demonstrate that this metric successfully describes differences in whole shape integration
30 patterns. Finally, the information entropy approach allows comparison of whole shape
31 integration in a dense semilandmark environments, and we demonstrate that the metric
32 introduced here allows comparison of shape spaces that differ arbitrarily in their
33 dimensionality and landmark membership.
34
2 bioRxiv preprint doi: https://doi.org/10.1101/2020.07.23.218289; this version posted January 29, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license. F. R. O’Keefe
35 The study of integration in biological systems has a long history, stretching back to
36 Darwin (1859), given a modern footing by Olson and Miller (1958), quantified by
37 Cheverud (1982, 1996), and maturing into a broad topic of modern inquiry encompassing
38 genetics, development, and the phenotype (see Klingenberg, 2013, and Goswami and
39 Polly, 2010, for reviews). The concept of ‘integration’ means that the traits of a
40 biological whole are tightly dependent. This dependence is critical in evolving
41 populations because traits are not free to respond to selection without impacting
42 dependent traits, and the directionality of selection response is constrained by these
43 dependencies (Grabowski and Porto, 2017, Figure 1). Yet trait dependency can also
44 remove constraint by giving a population access to novel areas of adaptive space
45 (Goswami et al., 2014, Figure 5). Consequently the integration of traits is central to the
46 evolvability of biological systems, accounting for the intense research scrutiny it has
47 received. This work has produced a family of metrics that summarize and characterize
48 covariance patterns in quantitative morphological traits (Pavlicev et al, 2009; Goswami
49 and Polly, 2010), each with its own mathematical treatment and simplifying assumptions.
50
51 Background
52 “The diagonalization of a matrix is the way in which the space described by the matrix
53 can be conveniently summarized, and the properties of the matrix determined.”—Blows,
54 2006, p.2.
55 The use of landmark data to characterize the shape of biological structure, and to quantify
56 shape change, has become ubiquitous since the introduction of geometric morphometrics
57 by Bookstein (1997; reviewed in Klingenberg, 2010). All geometric morphometric
3 bioRxiv preprint doi: https://doi.org/10.1101/2020.07.23.218289; this version posted January 29, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license. WHOLE SHAPE INTEGRATION METRICS
58 techniques begin with a matrix LM of landmark data comprising spatial coordinates of
59 homologous points on a series of specimens, that may be of two or three dimensions, so
60 that the elements of the variable vector V are:
61