SPECIAL ISSUE CSIRO PUBLISHING Marine and Freshwater Research, 2016, 67, 1059–1071 http://dx.doi.org/10.1071/MF15069
New parameterisation method for three-dimensional otolith surface images
P. Marti-PuigA,C, J. Dane´sA, A. ManjabacasB and A. LombarteB
AGrup de Tractament de Dades i Senyals, University of Vic (UVIC-UCC), Sagrada Famı´lia, 7, E-08500 Vic, Catalonia, Spain. BInstitut de Cie`ncies del Mar, ICM (CSIC), Passeig Marı´tim de la Barceloneta, 37-49, E-08003 Barcelona, Catalonia, Spain. CCorresponding author. Email: [email protected]
Abstract. The three-dimensional (3-D) otolith shapes recently included in the Ana`lisi de FORmes d’Oto`lits (AFORO) database are defined by means of clouds of points across their surfaces. Automatic retrieval and classification of natural objects from 3-D databases becomes a difficult and time-consuming task when the number of elements in the database becomes large. In order to simplify that task we propose a new method for compacting data from 3-D shapes. The new method has two main steps. The first is a subsampling process, the result of which can always be interpreted as a closed curve in the 3-D space by considering the retained points in an appropriate order. The subsampling preserves morphological information, but greatly reduces the number of points required to represent the shape. The second step treats the coordinates of the 3-D closed curves as periodic functions. Therefore, Fourier expansions can be applied to each coordinate, producing more information compression into a reduced set of points. The method can reach very high information compression factors. It also allows reconstruction of the 3-D points resulting from the subsampling process in the first step. This parameterisation method is able to capture 3-D information relevant to classification of fish species from their otoliths, providing a greater percentage of correctly classified specimens compared with the previous two-dimensional analysis.
Additional keywords: fast Fourier transforms, feature extraction, shape analysis, three-dimensional contour descriptors, three-dimensional shape parameterisation.
Received 20 February 2015, accepted 31 July 2015, published online 26 October 2015
Introduction ecomorphological comparisons (Cruz and Lombarte 2004; Fish otolith morphology has been widely used in the identifi- Lombarte and Cruz 2007; Sadighzadeh et al. 2014b), archaeo- cation of stocks (Bird et al. 1986) and species (Schmidt 1969), logical studies (Harrison 2009) and, most especially, prey phylogenetic reconstructions (Gaemers 1983), paleontological identifications (Veiga et al. 2011; Neves et al. 2012; Ota´lora- studies (Nolf 1985), food web analyses (Fitch and Brownell Ardila et al. 2013; Rosas-Luis et al. 2014). 1968), ecomorphological work (Lombarte and Fortun˜o 1992) Attempts to develop three-dimensional (3-D) morphological and even sex identification and age estimation (Cardinale et al. studies of otoliths started with the use of X-ray microtomogra- 2004; Doering-Arjes et al. 2008). In all cases, adequate and phies (Hamrin et al. 1999) for ageing fish and were followed complete information about otolith morphology is required. The by anatomical studies of the inner ear (Ramcharitar et al. Ana`lisi de FORmes d’Oto`lits (AFORO) website (http://aforo. 2004; Schulz-Mirbach et al. 2011, 2013; Ke´ver et al. 2014). cmima.csic.es, accessed 22 August 2015) is an open online We are engaged in a Spanish National Research Program project catalogue of otolith images. It offers image analysis for auto- titled AFORO3-D, the main objective of which is to create 3-D matic identification of fish species and populations using two- morphological descriptors for whole otoliths and to provide dimensional (2-D) closed contours representing otolith shape more effective analysis and classification tools. The possibility (Lombarte et al. 2006). of obtaining 3-D pictures will offer more reliability for the At present, the AFORO database contains more than 4600 identification of otoliths by including external side and internal images from 1440 Teleostean species, and its software tools side (sulcus acusticus) curvature descriptions with high taxo- have been used to generate an otolith atlas (Tuset et al. 2008; nomic value (Smale et al. 1995; Torres et al. 2000; Campana Sadighzadeh et al. 2012), morphological indices (Tuset et al. 2004; Tuset et al. 2008; Sadighzadeh et al. 2012). Most 2006), stock identifications (Capoccioni et al. 2011; Tuset et al. importantly, the software we develop will allow production 2013; Sadighzadeh et al. 2014a), automatic species identifica- and study of virtual 3-D models without having the otoliths tions (Parisi-Baradad et al. 2005, 2010; Tuset et al. 2012), actually in hand (Gauldie 1988; Popper and Fay 1993;
Journal compilation CSIRO 2016 Open Access www.publish.csiro.au/journals/mfr 1060 Marine and Freshwater Research P. Marti-Puig et al.
Ϫ R 1 ϩ1
z^ θ 1.0
0.5
ϕ y^ 0 Ϫ1.0 x^ Ϫ Ϫ0.5 0.5 0 Ϫ1.0 0.5 Ϫ1.0 Ϫ 0.5 0 1.0 0.5 1.0
Fig. 1. Representation of the left sagittal otolith position of Argyrosomus regius according to the Cartesian coordinates of the scanning protocol.
Schulz-Mirbach et al. 2013). To that end, the University of Girona Umbrina cirrosa) were digitised, following a protocol similar to (Girona, Spain; Computer Vision and Robotics Research Group, that used to capture the images in AFORO (Lombarte et al. ViCOROB) and AQSENSE Co. (see http://www.aqsense.com/, 2006; Tuset et al. 2008) and also used by palaeontologists accessed 27 August 2015) have developed a purpose-built, single- (Gaemers 1983; Nolf 1985; Schwarzhans 1996; Reichenbacher camera, 3-D scanner prototype based on projected light-emitting et al. 2007). Thus, all specimens share the same positions and diode (LED) light capable of digitising the shapes of otoliths orientations in the Cartesian coordinates after being digitised. ranging in size from 1 mm to a few centimetres. The objective of In biological terms, the x-axis corresponds to the rostrum– the present study was to create 3-D morphological descriptors of postrostrum axis of the otolith. The y-axis corresponds to the whole otoliths for analysis, automatic classification and automat- otolith’s dorso–ventral axis. The z-axis is placed midway ic retrieval from an image database. Introducing a shape into a between the internal and external side extremes of the otolith computer system in order to find the most similar shape already in and represents the width of the sagitta. Application of that the database or to fit a new 3-D natural object into a classification protocol makes it unnecessary to determine descriptors invariant by comparing it with all the shapes in the database is very time to rotations. The only invariance we need to consider with regard consuming because of the huge amount of information involved to scale, and that normalisation requires a scale factor. The for each object; it becomes progressively more difficult as the size otolith shape is scaled so that its length along the y-axis is 2 units of the database increases. (Fig. 1); therefore, for all the 3-D otolith shapes, this axis in our Our study of 3-D methods has focussed on otoliths of the coordinate system extends from the point (0,–1,0) to (0,1,0). family Scienidae (Percifomes). The sciaenids (drums or croa- The otolith digitisation process involves scanning all otolith kers) are characterised by their specialised acoustic communi- sides and the partial scans are joined to represent the entire 3-D cation. Their ability to produce sounds has long been known, as object. The final result is a file containing a cloud of points their English common names suggest, and it is part of their representing the otolith’s shape. Specialised software and reproductive behaviour (Luczkovich et al. 1999; Ramcharitar a technical expert are necessary for these tasks, so they are et al. 2001). Their sagittae are characterised by their relatively expensive and time consuming. Furthermore, the scanner sys- large size (Cruz and Lombarte 2004), compared to the average tem is limited by otolith size and only big sagittae, like those of size of fish otoliths, with strong development of the external side the sciaenids, provide optimal results. The cloud of points (Nolf 2013; Ramcharitar et al. 2004) and a specialised sulcus representing an otolith surface can be seen in Fig. 2. acusticus (Volpedo and Echevarrı´a 2000; Monteiro et al. 2005; Ramcharitar et al. 2006; Tuset et al. 2008; Lin and Chang 2012). Surface parameterisation method The proposed method can be broken down into two parts. The Material and methods first consists of a subsampling process that generates a set of Scanning protocol points that can always be interpreted as on a closed curve in the In order to obtain 3-D otolith surface images, samples from three 3-D space. This process preserves important morphological species of Scienidae (Argyrosomus regius, Sciaena umbra and information, but the number of points is markedly reduced. Otolith 3-D parameterisation Marine and Freshwater Research 1061
0.4 0.3 0.2 0.1 0 -axis
z Ϫ0.1 Ϫ0.2 Ϫ0.3 Ϫ0.4
0.5
x-axis 0 Ϫ 0.5 Ϫ Ϫ Ϫ Ϫ1.0 0 Ϫ0.2 0.4 0.6 0.8 1.0 0.8 0.6 0.4 0.2 y-axis
Fig. 2. Cloud-of-points on an otolith surface represented in Cartesian coordinates after its digitisation according to the scanning protocol. The otolith represented belongs to Umbrina cirrosa.
z
1.0 0.8 0.6 y x 0.4 0.2 0 Ϫ0.2 -axis
z Ϫ0.4 Ϫ0.6 Ϫ0.8 Ϫ1.0 1.0 0.5 1.0 y 0 0.5 -axis 0 Ϫ0.5 Ϫ0.5 Ϫ -axis 1.0 Ϫ1.0 x
Fig. 3. The left-hand panel shows the auxiliary curves forward (from 1 to 1 on the y-axis, 1 on the z-axis) and backward (from 1 to 1 on the y-axis, 1 on the z-axis) used to guide the subsampling process. In this example, the number of loops L ¼ 3. The right-hand panel shows a representation of the first 38 auxiliary straight lines.
The second part consists of computing the discrete Fourier points (cxn,cyn,czn) and is shown in green on the left in Fig. 3. transform (DFT) on each of the 3-D coordinates. Periodic Those points are obtained using the following equations: functions, such as those obtained from closed curves, can be 4 expanded into a Fourier decomposition. This step achieves cyn ¼ 1 þ n P P greater information compression by selecting a reduced set of p cxn ¼ A sinðL cynÞ n ¼ 0; 1; 2; :::; 1 ð1Þ the Fourier coefficients to represent the shape. 2 czn ¼ A cosðLp cynÞ Subsampling process The points of the backward path are represented in red on the The subsampling process is driven by an auxiliary curve com- right in Fig. 3 and are obtained using the following equations: posed of two paths (forward and backward) that depend on two main parameters, P and L, selected for a particular appli- 4 p cyn ¼ 1 n P 2 P P cation: P is the even number of points retained in the subsample p cxn ¼ A sinðL cynÞ n ¼ ; þ 1; :::; P 1 ð2Þ and L is the number of loops that each of the two paths takes. 2 2 The forward and backward paths of the auxiliary curve that czn ¼ A cosðLp cynÞ guide the subsampling process are shown on the left side of Fig. 3. Each point on the auxiliary curve corresponds to a point The constant A can be taken as 1 (in practice, a larger number on the otolith surface. The forward path is described by half the is used). Note that by construction the coordinate y goes from 1 1062 Marine and Freshwater Research P. Marti-Puig et al. to 1 and comes back from 1 to 1 in regular steps. So, the line is stopped with several points inside and the point of interest proposed subsampling process occurs by increments and decre- is computed from them. With the original 3-D high-resolution ments (forward and backward paths) along the y-axis, following otolith shapes used in our work, all these approaches perform the cyn points. On the right side of Fig. 3, there is a partial very similarly. Fig. 5 is a representation of the normalised otolith representation of the lines that go from the centre of the otolith cloud of points together with the auxiliary curve that guides the mass to the points on the auxiliary curves. subsampling. For each of the auxiliary curve points, a set of points is The subsampling process finishes once all the P points of the selected on the surface that belongs to the interval cyn –2/P , auxiliary curve have been processed, obtaining P points on cyn # cyn þ 2/P. Those points correspond to a slice orthogonal the otolith surface representative of otolith shape. Fig. 6 shows to the y-axis of width 4/P. If coordinates x and z of those selected the partial result of processing the forward (upper) and backward points are represented on the x–z plane, a figure similar to a (bottom) paths when P ¼ 1600 and L ¼ 20; and the complete section of the shape is drawn. This idea is illustrated for a result for the same otolith is shown in Fig. 7. synthetic shape in Fig. 4. Next, the coordinates mxn and mzn of Fig. 8 shows plots of the coordinates x, y and z for the set of the centre of mass of each slice of points are computed to obtain P ¼ 1600 points that are represented in 3-D in Fig. 7. The x, y and the centre of the section at the point cyn (mxn,cyn,mzn). For every z plots can be interpreted as the fundamental period of a periodic cyn point, the straight line from the centre of mass to the point on function. Note that the graph in the middle, once the number of the auxiliary curve and orthogonal to the y-axis is computed P points is chosen, is, by construction, always the same. Because using the coordinates of those points. That line guides the we know a priori that it contains no information, the y coordi- selection-of-point process. This idea is also represented in the nate can be discarded in the next parameterisation step. Further- illustration in Fig. 4 in which, for a given cyn, a representation of more, notice that the y coordinate is always the same for a set of points of a section appears together with the auxiliary all shapes. To provide an idea of the information reduction line. In the x–z planes, the straight line equations are expressed achieved by the down-sampling process, the shape represented in continuous form by the following: with 150 000 real numbers in the present example could be represented with only 3200 real numbers: those required to z mzn x mxn represent x and z coordinates of the closed curve. Comparing ¼ n ¼ 0; 1; 2; :::; P 1 ð3Þ czn mzn cxn mxn the information provided by the original shape in Fig. 2 with the selection of points shown in Fig. 7, the reduction of points can The last step consists of selecting the point on the otolith be appreciated. However, Fig. 7 retains the most important surface that intersects with that line. In practice, because we morphological information from the otolith shape. have a discretised surface, there is no point on that surface that matches perfectly with the line, so different approaches could be Discrete Fourier expansion of the coordinates: a fast considered. One is to select the discrete point on the surface algorithm by fast Fourier transformation to compute closest to the line; another is to obtain an interpolation given coefficients and to reconstruct closed curves a set of candidates. Both approaches begin with selecting a The subsampling method just presented has some particular reduced set of points by narrowing the area around the line and characteristics. Samples of the otolith surface are taken at reg- analysing the points of the otolith included inside. In the first ular increments along the y-axis, thus the velocity of the closed case, the region is tapered until it contains only a single point. In the second case, the process of narrowing the space around the 1.0
0.8
0.6
0.4
0.2
0 -axis z Ϫ0.2
Ϫ0.4
Ϫ0.6
Ϫ0.8
Ϫ1.0 1 x 0 x–z plane -axis Ϫ1 Ϫ Ϫ Ϫ0.6 Ϫ0.8Ϫ1.0 1.0 0.8 0.6 0.4 0.2 0 0.2 0.4
Fig. 4. Illustration of the subsampling process. For a given yn, the set of y-axis points is shown belonging to a slice, together with the auxiliary line. The selected point comes from the intersection of the line with the section Fig. 5. Representation of the normalised cloud of points together with the boundary. This section was obtained from a synthetic shape. auxiliary curve. Otolith 3-D parameterisation Marine and Freshwater Research 1063 curve sampling is constant. This approach differs from that the otolith sections are not regular and can differ considerably described by Kuhl and Giardina (1982), which imposes constant in size, meaning that parts of the otolith are represented with sampling velocities around the closed curve for all of its x, y different densities of points, with more points in the small sec- and z components, not just for one (y) as described herein. tions, usually located at the otolith extremes. The considerable The advantage of our approach compared with that of Kuhl advantage of our approach is that all the closed curves obtained and Giardina (1982) is that our approach always uses the are described with the same number of points (P) regardless same number of points to obtain a closed curve from a scale- of the shape they describe. The user can choose the parameter normalised shape. In our approach, the samples are taken at P depending on the quality of the digital data or according to a regular angular increments around the auxiliary curve; however, given application. With our subsampling process, a set of P representative points is selected from the original surface. Their spatial 0.4 relationships are described with those points. They can also be 0.2 described by the coefficients of a Fourier expansion derived by a DFT. The fast Fourier transform (FFT) algorithm applied to the 0 -axis P representative points provides a reduced set of parameters z 0.2 with which to reconstruct the closed contour of the objects. This will be illustrated for a single coordinate x and can be extended 0.4 in a similar way to the coordinate z. 0.5 0 Consider the P points (x ,y ,z ) of the 3-D closed curve and x n n n -axis Ϫ0.5 Ϫ0.5 Ϫ1.0 1.0 0.5 0 take without loss of generality only the coordinate xn, where y-axis n ¼ 0, 1,y, P 1. By applying the DFT on xn, the DFT coefficient Xk is computed by: 0.4 XP 1 j2pnk 0.2 Xk ¼ xne P k¼0; 1;:::; P 1 ð4Þ n¼0 0 axis - z Ϫ0.2 and, therefore, the original xn can be written as Xk using the inverse DFT (IDFT) as follows: Ϫ0.4 0.5 XP 1 x 0 1 j 2pnk -axisϪ0.5 Ϫ0.5 Ϫ1.0 xn ¼ Xke P ð5Þ 1.0 0.5 0 n¼0;1;:::;P 1 P k 0 y-axis ¼
Fig. 6. Partial results of processing the forward (top) and backward Of course, with all the DFT coefficients Xk, the reconstruc- (bottom) paths of an otolith surface when the parameters of the auxiliary tion xn where n ¼ 0, 1,y, P 1 is perfect. Now, let us take curve are P ¼ 1600 (number of points retained) and L ¼ 20 (number of advantage of some DFT properties. Because we always loops). The consecutive points are joined by continuous lines. deal with xn samples that are real, from Eqn 4 it is easy to see
Ϫ0.4 Ϫ0.3 Ϫ0.2 Ϫ0.1 0 -axis z Ϫ0.1 Ϫ0.2 Ϫ0.3 Ϫ0.4
0.5 x-axis 0 Ϫ0.5 Ϫ 0 Ϫ0.2 Ϫ0.4 Ϫ0.6 Ϫ0.8 1.0 1.0 0.8 0.6 0.4 0.2 y-axis
Fig. 7. Result of performing subsampling on the cloud of points from the Umbrina cirrosa otolith represented in Fig. 2 when the number of points retained P ¼ 1600 and the number of loops L ¼ 20. The picture was completed by joining the forward and backward paths of Figs 5 and 6. 1064 Marine and Freshwater Research P. Marti-Puig et al.
x coordinate 0.5
0
Ϫ0.5 200 400 600 800 1000 1200 1400 1600 y coordinate 1.0 0.5 0 Ϫ0.5 Ϫ1.0 200 400 600 800 1000 1200 1400 1600 z coordinate 0.4 0.2 0 Ϫ0.2 Ϫ0.4 200 400 600 800 1000 1200 1400 1600 Samples
Fig. 8. Representation of the coordinates of the three-dimensional closed curve given in Fig. 7. The graphs represent a periodical function derived from ‘wrapping’ an otolith. In the representation the function has been subsampled using P ¼ 1600 points. The x coordinate appears at the top, the y coordinate is in the centre and the z coordinate is shown at the bottom. the well-known property of DFT coefficients (Proakis and Eqn 9 uses the result that the addition of a complex number Manolakis 1996): with its complex conjugate gives twice its real part. Therefore, Eqn 5 becomes: XP k ¼ Xk ¼ X k ð6Þ P 1 X2 p X0 1 n 2Re½ Xk 2 where the superindex asterisk (*) in Eqn 6 stands for the xn ¼ þ XPðÞ 1 þ cos nk P P 2 P P complex conjugate operation. This result means that only the k¼1 first half of complex coefficients bring information and the other 2Im½ Xk 2p sin nk n ¼ 0; 1; :::; P 1 ð10Þ half are redundant, because they can be obtained from the first P P half. Now, let us split Eqn 5 into four parts: 0 1 Note than ( 1)n comes from cos 2p nk when k P/2. Then, P P ¼ X2 1 XP 1 1 @ j 2pnk n j 2pnkA if we consider the discrete Fourier expansion of the coordinate xn ¼ X0 þ Xke P þ XPðÞ 1 þ Xke P P 2 xn,(n ¼ 0, 1,..., P 1) provided by the coefficients ak and bk in k¼1 k¼Pþ1 2 n¼0;1;:::;P 1 the form: ð7Þ P X2 2p x ¼ a þ a cos nk Then, analysing the last term of Eqn 7, rearranging the index n 0 k P of its summation from the last element to the first and using Eqn k¼ 1 6, from simple algebra of complex numbers, we have: 2p þb sin nk n ¼ 0; 1; :::; P 1 ð11Þ k P P P XP 1 X2 1 X2 1 2p 2p 2p j P nk j P nPðÞ k j P nk Xke ¼ XP ke ¼ Xk e ð8Þ we can establish from Eqn 10 the relationship between k P 1 k¼1 k¼1 ¼2þ real coefficients ak and bk with the complex DFT coefficients Xk as follows: Then, considering this term together with the second term in Eqn 7 and taking into account that one is the complex conjugate X0 of the other, we can rewrite Eqn 8 as follows: k ¼ 0 ! a0 ¼ P ; b0 ¼ 0 P 2Re½ Xk 2Im½ Xk k ¼ 1; ...; 1 ! ak ¼ ; bk ¼ P P P 2 P P ð12Þ X2 1 X2 1 X2 1 2p 2p 2p XP j P nk j P nk P 2 Xke þ Xk e ¼ 2 Re½ Xk cos nk k ¼ ! aP ¼ ; bP ¼ 0 P 2 2 P 2 k¼1 k¼1 k¼1 p 2 The coefficients X can be computed very quickly from the Im½ Xk sin nk ð9Þ k P P samples of xn using the FFT algorithm, and similarly the xn Otolith 3-D parameterisation Marine and Freshwater Research 1065
0.4 0.4 0.3 0.3 0.2 0.2 0.1 0.1 0 0 -axis
z Ϫ 0.1 -axis Ϫ0.1 z Ϫ0.2 Ϫ0.2 Ϫ0.3 Ϫ0.3 Ϫ0.4 Ϫ0.4 0.5 2 ϫ 10 Coeff. 0.5 ϫ x 2 15 Coeff. x-axis 0 -axis 0 Ϫ 0.5 Ϫ0.2 Ϫ0.4 Ϫ0.6 Ϫ0.8 0.5 0.2 0 Ϫ0.2 Ϫ0.4 Ϫ0.6 Ϫ0.8 Ϫ1.0 1.0 0.8 0.6 0.4 0.2 0 1.0 0.8 0.6 0.4 y-axis y-axis
0.4 0.4 0.3 0.3 0.2 0.2 0.1 0.1 0 0 -axis z Ϫ0.1 -axis Ϫ0.1 z Ϫ0.2 Ϫ0.2 Ϫ0.3 Ϫ0.3 Ϫ0.4 Ϫ0.4 0.5 2 ϫ 30 Coeff. 0.5 2 ϫ 45 Coeff. x 0 x-axis0 -axisϪ 0.5 Ϫ0.2 Ϫ0.4 Ϫ0.6 Ϫ0.8 0.5 0.2 0 Ϫ0.2 Ϫ0.4 Ϫ0.6 Ϫ0.8 Ϫ1.0 1.0 0.8 0.6 0.4 0.2 0 1.0 0.8 0.6 0.4 y-axis y-axis
0.4 0.4 0.3 0.3 0.2 0.2 0.1 0.1 0 0
-axis Ϫ -axis Ϫ z 0.1 z 0.1 Ϫ0.2 Ϫ0.2 Ϫ0.3 Ϫ0.3 Ϫ0.4 Ϫ0.4 0.5 2 ϫ 60 Coeff. 0.5 2 ϫ 75 Coeff. x 0 x-axis 0 -axis 0.5 Ϫ0.5 Ϫ0.2 Ϫ0.4 Ϫ0.6 Ϫ0.8 Ϫ0.2 Ϫ0.4 Ϫ0.6 Ϫ0.8 Ϫ1.0 1.0 0.8 0.6 0.4 0.2 0 1.0 0.8 0.6 0.4 0.2 0 y-axis y-axis
Fig. 9. Reconstructions of the closed curve of points on the Umbrina cirrosa otolith when the number of points retained P ¼ 800 and the number of loops L ¼ 5. In all figures, the true points are shown in black and the reconstructions have been done with 10 2, 15 2, 30 2, 45 2, 60 2 and 75 2 complex discrete Fourier transform (DFT) coefficients.