J Math Imaging Vis (2009) 33: 39–65 DOI 10.1007/s10851-008-0104-3
Probabilistic Models for Shapes as Continuous Curves
Jeong-Gyoo Kim · J. Alison Noble · J. Michael Brady
Published online: 23 July 2008 © Springer Science+Business Media, LLC 2008
Abstract We develop new shape models by defining a stan- 1 Introduction dard shape from which we can explain shape deformation and variability. Currently, planar shapes are modelled using Our work contributes to shape analysis, particularly in med- a function space, which is applied to data extracted from ical image analysis, by defining a standard representation images. We regard a shape as a continuous curve and identi- of shape. Specifically we develop new mathematical mod- fied on the Wiener measure space whereas previous methods els of planar shapes. Our aim is to develop a standard rep- have primarily used sparse sets of landmarks expressed in a resentation of anatomical or biological shapes that can be Euclidean space. The average of a sample set of shapes is used to explain shape variations, whether in normal subjects defined using measurable functions which treat the Wiener or abnormalities due, for example, to disease. In addition, measure as varying Gaussians. Various types of invariance we propose a quasi-score that measures global deformation of our formulation of an average are examined in regard to and provides a generic way to compare statistical methods practical applications of it. The average is examined with of shape analysis. relation to a Fréchet mean in order to establish its valid- As D’Arcy Thompson pointed in [50], there is an impor- ity. In contrast to a Fréchet mean, however, the average al- tant relationship between shape of a biological structure and ways exists and is unique in the Wiener space. We show that its function. To understand the identity, structure and func- the average lies within the range of deformations present in tion of anatomical objects non-invasively, one must study the sample set. In addition, a measurement, which we call a their shape as it appears in images. One of the principal quasi-score, is defined in order to evaluate “averages” com- puted by different shape methods, and to measure the over- problems in medical image analysis is the description of all deformation in a sample set of shapes. We show that the shapes in a way that represents biological variability. This is average defined within our model has the least spread com- often termed shape variation or deformation. Evidently, de- pared with methods based on eigenstructure. We also derive formation has to be explained in terms of correspondences a model to compactly express shape variation which com- between shapes. prises the average generated from our model. Some exam- There has been considerable progress in shape analysis ples of average shape and deformation are presented using during the last two decades. Shape has often been defined by well-known datasets and we compare our model to previous a set of landmarks, significant points on a shape, and which work. are usually selected manually. For this reason, sparse set of landmarks tend to be used. We define a shape space as an Keywords Shape space · Average over a function space · underlying set of shape representations. The representation Wiener measure space · Fréchet mean chosen for a shape space is important, because manipula- tions of shape are determined in large part by how shape in- formation is represented. Several authors have provided pre- · · J.-G. Kim ( ) J.A. Noble J.M. Brady cise definitions of shape and shape space. These include: dif- Dept. Engineering Science, Oxford University, Oxford OX1 3PJ, UK ferential manifolds [4, 12, 13, 20, 27, 33], eigenstructures, e-mail: [email protected] [6, 7], and function spaces [3, 22]. 40 J Math Imaging Vis (2009) 33: 39–65 Kendall has defined shape as “what is left when the dif- ferential manifold of the descriptors. They use eigenstruc- ferences which can be attributed to translations, rotations, ture on the differential manifolds to capture shape variabil- and dilations have been quotiented out” [20]. Kendall’s ap- ity. On the other hand, Klassen et al. [22] built another type proach, using a Procrustean metric, has been a cornerstone of differential manifold on L2. In their work, a single para- for a great deal of shape related research, and adopted in meter, such as a direction function or the curvature of con- many applications, both theoretically and practically. tours, is described on Hilbert manifold. They used a mean in In Kendall’s approach, shapes characterised by land- terms of Karcher [18], in fact a Frechét mean [14]. In their marks form a Riemannian manifold with the Procrustean shape space, analytic expressions for geodesics are not pre- metric; a shape is represented by a point on a sphere. The sented so that their model is difficult to be practical. This Procrustes Analysis that Kendall [20] employed on a Rie- is a common limitation from which differential geometric mannian manifold has been used for Euclidean space in nu- approaches suffer. merous methods. On the other hand, Bookstein [4] consid- Sparr [42–44] formulated both a shape representation and ers triangular shapes (characterised by three points) which a basis for the shape space for a finite number of points, form a differential manifold, a sphere but with a differ- a polyhedron. The shape space is designed to be invariant ent metric from Kendall’s; the Poincaré plane. Both shape with respect to affine transformations, so is named the Affine spaces have been developed and statistics of the result- Shape. Berthilsson and Åström [3] extended Sparr’s idea for ing spaces have been investigated by many researchers, shapes represented by finitely many points to shapes repre- sometimes separately [24, 26, 27], sometimes comparatively sented by continuous curves. [10, 28, 29, 41]. Pennec et al. [33, 34] regards shapes as We contend that methods using sparse sets of landmarks a combination of a feature (such as a point or curve) and are suitable only for shapes that can be characterised by a a transformation (rigid-body). Both the feature set and the small number of landmarks. Some geometric shapes have transformation set constitute differential manifolds, respec- evident landmarks that are sufficient to characterise them. tively, with invariant metrics. Very recently and indepen- For example, the planar shape of 3 pyramids in an aerial dently of our own work, Pennec [32] studies statistics on view in Fig. 1(left) is perfectly captured by the four land- Riemannian manifolds, focusing on Fréchet means. marks shown on each of them. However, this approach is of Cootes et al. [6, 7] use generalised Procrustes analysis questionable relevance for anatomical shapes, such as that [16] and proposed a shape model in terms of an eigenspace showninFig.1 (right), which tend to have very few distinc- by using Principal Component Analysis (PCA). In this ap- tive landmarks. It is precisely this kind of shape that is the proach, a shape is represented by the eigenvectors of the co- focus of this paper. variance matrix of the locations of landmarks, with weights There are widely recognised fundamental problems in as parameters. The weights express shape variations un- methods that depend on sparse sets of landmarks. First, lo- der the assumption that these are independent and follow cating landmarks on images is not only time-consuming; a Gaussian distribution. This approach results in a huge re- but also, especially for noisy medical images, often requires duction of the dimensionality of their shape space. For this expert knowledge. Second, landmarks are not only used to reason, the method is computationally efficient and easy to characterise a particular shape; but are also used to match apply and test. It has been widely applied, especially in med- corresponding points over the whole sample set of shapes. In ical image analysis [2, 8, 9, 53]. methods that use a sparse set of landmarks, mis-represented Fletcher et al. [12, 13] represented shapes in terms of shapes can lead to insufficient or false analysis. For this rea- medial axis descriptions and developed a model on a dif- son, recent work has tended to use denser sets of landmarks.
Fig. 1 Landmarking annotated by ∗: planar shapes of 3 pyramids (left) and a femoral head (right) J Math Imaging Vis (2009) 33: 39–65 41
Fig. 2 Femurs: (left) each femoral shape consists of 31 landmarks; (right) every 5 point are marked by + and their average by ∗; the average of others by ·
However, the fundamental problems persist, since the same istence and uniqueness are not guaranteed. There has been techniques are applied. Also, to overcome the first of these a study of its existence [18] under quite restrictive assump- difficulties, there have been attempts to automate landmark- tions and its uniqueness, with even more assumptions [26]; ing [8, 9, 23, 49] but to date these techniques are limited in but both are limited to a narrow set of a structures in Rie- accuracy and the correspondence problem remains. mannian manifolds. Due to the lack of existence and unique- Methods of shape analysis increasingly tend to be devel- ness of a Fréchet mean, one needs more than one term for oped within a statistical framework. In such methods, land- a mean and has to distinguish them as in [28] (a mean of marks are labelled points (locations) as shown in Fig. 1. shapes, the shape of a mean) and [33] (a mathematical mean, After being filtered by Procrustes analysis, corresponding empirical mean). landmarks are grouped according to a set of labels. That is, a set of corresponding landmarks across all shapes of a 1.1 The proposed model sample set is formed for each label. For each label, an aver- age location is estimated, usually assuming that the sample is uniformly distributed. Then the shape configured by the Our model developed in Sect. 4 mainly focuses on solv- collection of these average locations corresponding to each ing the problems outlined above. First, we contend that the label is called the “average” shape. Indeed, most methods mathematical precision of the definition of shape, and the employing an average shape assume a uniform distribution1 numerical measurement of shape variability, remain inade- over shapes, regardless of the structure of their shape spaces. quate. Since our primary interest is in shapes which tend to Such methods are known to lead to average shapes which are have few evident landmarks, we prefer to regard a planar not representative of actual shapes (see for example Fig. 2), shape as a continuous curve, and this leads to the construc- or which deviates from the majority of the population of a tion of a shape space that is totally different from previous sample set [7]. This is particularly the case for shapes hav- methods. ing no, or only a few, evident landmarks. We have observed Very recently, researchers recognise a shape should be that the average with a uniform distribution does not ade- defined as continuous functions and there are methods quately reflect the extent of a deformation found in a sample where shape is viewed in terms of continuous contours set, particularly when the deformation is large. [3, 22], but no one has suggested concrete structure on Other methods, not necessarily using a distribution, adopt continuous functions to explain deformation. For example, the approach of a Fréchet mean: a minimiser of a variance- Berthilsson and Åström [3] present a precise definition of like functional defined on a metric space [14]. A fundamen- the shape space but concentrate on making it affine invari- tal problem with the use of a Fréchet mean is that its ex- ant;Klassenetal.[22] use a Fréchet mean on a Hilbert mani- fold. We are not aware of methods for generating a standard 1 representation of shapes expressed in terms of continuous An average shape of n shapes having m landmarks is the collection of ¯1 ¯m ¯i = n i i curves. Conventional methods of building a shape space of points (x ,...,x ),wherex j=1 xj /n,andxj is the i-th land- mark of a shape j. continuous functions mix up with the structure of a sparse 42 J Math Imaging Vis (2009) 33: 39–65 set of landmarks, probably due to the difficulty of manipu- 2 Mathematical Background lating a complicate infinite-dimensional space as pointed in [31] and finding a distribution and an average. Hence, they We regard univariate functions as paths in Brownian motion lack structural consistency in our view. on the Euclidean space R2. Primarily to establish notation, In the following, we concentrate on how to formalise we briefly introduce the Wiener measure space, measurable normal shape, and how to describe and to measure shape functions on it, and metrics on the Wiener space. We mostly changes due to disease or growth over time. Apart from this follow notation in [17] where a compact description of the primary consideration, we try to develop our model so that Wiener measure space may be found. it does not suffer from correspondence problems, but to date we have found it difficult to judge this objectively. 2.1 The Wiener measure space We are using a well-known function space for the shape space of the proposed model. The shape space for our model The Wiener measure space originates from Brownian mo- C0[a,b], where [a,b] is a closed bounded interval of real tion, that is the motion of small particles suspended in a numbers, is chosen for the following reasons: fluid. Einstein used kinetic theory to derive the diffusion equation ∂ρ = Dρ for such motions in terms of funda- • It is a metric space with the L2-norm which is sensible for ∂t mental parameters of the particles and liquid, where D is a practical use; there is no difficulty in defining a metric as diffusion constant. The normalised one-dimensional version in shape models built on differential manifolds. of Einstein’s probabilistic formula is • We define a shape by its continuous boundary; the contour is represented by a continuous function, a member of the Prob({α