Medical Image Analysis 18 (2014) 22–35
Contents lists available at ScienceDirect
Medical Image Analysis
journal homepage: www.elsevier.com/locate/media
Organ-focused mutual information for nonrigid multimodal registration of liver CT and Gd–EOB–DTPA-enhanced MRI ⇑ Laura Fernandez-de-Manuel a,b, , Gert Wollny a,b, Jan Kybic c, Daniel Jimenez-Carretero a,b, Jose M. Tellado d, ⇑ Enrique Ramon e, Manuel Desco f,g, Andres Santos a,b, Javier Pascau f,g, Maria J. Ledesma-Carbayo a,b, a Biomedical Image Technologies Lab., DIE, ETSI Telecomunicación, Universidad Politécnica de Madrid, Madrid, Spain b Centro Investigación Biomédica en Red: Bioingeniería, Biomateriales y Nanomedicina (CIBER-BBN), Spain c Faculty of Electrical Engineering, Czech Technical University in Prague, Czech Republic d Seccion Hepatobiliopancreatica, Servicio de Cirugía General I, Hospital General Universitario Gregorio Marañón, Madrid, Spain e Servicio de Radiodiagnóstico, Hospital General Universitario Gregorio Marañón, Madrid, Spain f Departamento de Bioingeniería e Ingeniería Aeroespacial, Universidad Carlos III de Madrid, Madrid, Spain g Instituto de Investigación Sanitaria Gregorio Marañón, CIBERSAM, Madrid, Spain article info abstract
Article history: Accurate detection of liver lesions is of great importance in hepatic surgery planning. Recent studies have Received 18 October 2012 shown that the detection rate of liver lesions is significantly higher in gadoxetic acid-enhanced magnetic Received in revised form 7 August 2013 resonance imaging (Gd–EOB–DTPA-enhanced MRI) than in contrast-enhanced portal-phase computed Accepted 5 September 2013 tomography (CT); however, the latter remains essential because of its high specificity, good performance Available online 13 September 2013 in estimating liver volumes and better vessel visibility. To characterize liver lesions using both the above image modalities, we propose a multimodal nonrigid registration framework using organ-focused mutual Keywords: information (OF-MI). This proposal tries to improve mutual information (MI) based registration by adding Image registration spatial information, benefiting from the availability of expert liver segmentation in clinical protocols. The Mutual information Liver surgery incorporation of an additional information channel containing liver segmentation information was stud- Magnetic resonance imaging ied. A dataset of real clinical images and simulated images was used in the validation process. A Gd–EOB– Computed tomography DTPA-enhanced MRI simulation framework is presented. To evaluate results, warping index errors were calculated for the simulated data, and landmark-based and surface-based errors were calculated for the real data. An improvement of the registration accuracy for OF-MI as compared with MI was found for both simulated and real datasets. Statistical significance of the difference was tested and confirmed in the simulated dataset (p < 0.01). 2013 Elsevier B.V. All rights reserved.
1. Introduction mapping of hepatic vasculature, spatial localization of tumors and their relation with other tumors or vascular structures, and The advances made in recent decades in the understanding of the estimation of remnant liver volume in order to determine the liver anatomy and physiology (Couinaud, 1999), together with suitability of a patient for surgery and to decide the procedure. the improvement of medical imaging techniques (Handels and Ehr- Accurate detection of individual liver lesions is of great importance hardt, 2009; Radtke et al., 2007) and the progressive safety of sur- because their number, location, and relationships determine both gical instrumentation, allow surgeons to design complex liver resectability (the probability of performing a resection safely) resections more accurately and effectively without jeopardizing and radicality (the probability of a potential cure by achieving an patient safety. Further, preoperative planning has become an R0 resection) (Solbiati et al., 1999). essential task before undertaking liver surgery. It requires the During preoperative planning of a liver resection, different imag- ing modalities play different roles: for example, for accurate detec- tion of cancer (staging) and for practical description of inner liver ⇑ Corresponding authors. Address: Biomedical Image Technologies Lab., DIE, ETSI anatomy (mapping). Recently, clinical protocols have included con- Telecomunicación, Universidad Politécnica de Madrid, 28040 Madrid, Spain. trast-enhanced computed tomography (CT) and a liver-specific con- E-mail addresses: [email protected] (L. Fernandez-de-Manuel), gw.foss- trast agent-enhanced magnetic resonance imaging (MRI). Several [email protected] (G. Wollny), [email protected] (J. Kybic), daniel.jimenezc@- die.upm.es (D. Jimenez-Carretero), [email protected] (J.M. studies have analyzed the advantages and constraints of both meth- Tellado), [email protected] (E. Ramon), [email protected] (M. ods (Oliva and Saini, 2004; Oudkerk et al., 2002; Weinmann et al., Desco), [email protected] (A. Santos), [email protected] (J. Pascau), mledes- 2003). Contrast-enhanced portal-phase CT imaging offers high [email protected] (M.J. Ledesma-Carbayo).
1361-8415/$ - see front matter 2013 Elsevier B.V. All rights reserved. http://dx.doi.org/10.1016/j.media.2013.09.002 L. Fernandez-de-Manuel et al. / Medical Image Analysis 18 (2014) 22–35 23 sensitivity and specificity for detecting hepatic metastases and is in noma), secondary tumors (for example, liver metastases most cases more convenient than MRI for evaluating the extrahe- secondary to colorectal cancer) and other liver diseases, an accu- patic abdomen (Oliva and Saini, 2004) and for estimating remnant rate multimodal nonrigid image registration algorithm is clearly liver volumes because of its higher spatial resolution, better vessel required. visibility, and wide availability and acquisition speed. However, Some studies have presented methods for liver monomodal im- the lesion detection rate has been found to be significantly higher age registration (Carrillo et al., 2000; Lange et al., 2005b). Other with Gd–EOB–DTPA-enhanced MRI as compared with CT, especially proposed techniques have focused on compensating multimodal for small lesions (Donati et al., 2010; Hammerstingl et al., 2008) image differences in the location and motion of the liver in relation (Fig. 1). Gd–EOB–DTPA (gadoxetic acid, Primovist in Europe, Eo- to other organs by using rigid approaches (Van Dalen et al., 2004) vist in the US, by Bayer HealthCare Pharmaceuticals) is an organ- or nonrigid methods based on finite elements, B-splines or demons specific contrast medium for hepatic MRI, in use since 2005. In de- (Archip et al., 2007). Different similitude criteria have been also ap- layed T1-weighted MRI, it produces strong signal enhancement in plied, such as voxel similarity or surface based criteria (Lee et al., normal liver parenchyma and absence of signal for focal liver lesions 2005). However, to the best of our knowledge, the performance with absence of hepatocellular activity. Consequently, detection of in terms of correspondences between internal liver structures, le- liver metastases and other secondary malignant liver tumors is im- sions, and vascular landmarks such as vessel bifurcations has not proved. However, vessels are not opacified. been evaluated. Additionally, CT/Gd–EOB–DTPA-enhanced MRI Considering the advantages and disadvantages of the different registration methods have not been proposed before. imaging modalities and contrast media, several researchers have Voxel intensity measures have been shown to be robust mea- pointed out the importance of combining MRI and CT for the detec- sures of image similarity. There are several possible image metrics tion and localization of hepatic lesions and their relation with ves- that are used in voxel similarity-based image registration (Crum sels for therapy planning (Bluemke et al., 2000; Kong et al., 2008; et al., 2004; Hill et al., 2001; Maintz and Viergever, 1998; Rueckert Lange et al., 2005a). In this work, we propose a nonrigid registra- and Schnabel, 2011; Zitova and Flusser, 2003): correlation coeffi- tion framework for aligning contrast-enhanced portal-phase CT cient, sum of squared differences, or mutual information (MI). MI and delayed T1-weighted Gd–EOB–DTPA-enhanced MRI into a (Maes et al., 1997; Mattes et al., 2003, 2001; Pluim et al., 2003; common coordinate system. As far as we know, this problem has Wells et al., 1996) is one of the more successful medical image sim- never been tackled before. To use all available data and improve ilarity measures. However, extending the maximization of MI to the registration robustness and accuracy, we propose the use of nonrigid image registration and applying it to extensive areas of an organ-focused mutual information (OF-MI) registration body images is still an active field of research. Moreover, the most criterion. important drawback of MI is that, due to the absence of spatial information, intensity relationships in one region can occasionally 1.1. State of the art mislead the algorithm in another region where the intensity rela- tionships are completely different (e.g., problems with spatially To date, commercial systems for planning hepatic surgery varying intensity inhomogeneity in MRI (Loeckx et al., 2010) or li- mostly align images rigidly. However, registering soft tissues with ver vessel misalignments in contrast-enhanced CT and delayed T1- rigid registration may result in errors as high as 19–20 mm (Archip weighted Gd–EOB–DTPA-enhanced MRI (Fig. 2)). et al., 2007; Lee et al., 2005), due to deformations that may be Some studies have focused on improving registration accuracy caused by liver movements because of respiration, variations of po- by considering the use of additional image gradient information sition, and corporal mass changes over time. To improve detection (Pluim et al., 2000a,b), neighbor pixel information (Heinrich and characterization in terms of volume and relation with vascula- et al., 2012b; Kybic and Vnucˇko, 2012; Rueckert et al., 2000), tex- ture of primary liver cancers (for example hepatocellular carci- tural information (Heinrich et al., 2012a), or different approaches to weighted MI (Park et al., 2010; Rodriguez-Carranza and Loew, 1999; Van Dalen et al., 2004). One approach to weighted MI is the regularization of MI with the use of weights based on overlaps (Rodriguez-Carranza and Loew, 1999) without including spatial information. Other weighted MI approaches increase histogram contributions of certain pixels (Park et al., 2010) or restrict the reg- istration to certain regions (Van Dalen et al., 2004). Nevertheless, the main problems with the application of the last method are the lack of information on the borders of the regions and neighbor- ing structures, and having too few samples to obtain a good entro- py estimation. These problems may be less significant in rigid scenarios. However, they are more relevant when nonrigid trans- formations are required, hence, the weighted MI performance strongly depends on the concrete registration problem. Recent approaches add spatial context to mutual information, either by studying different spatial encoding schemes (Zhuang et al., 2011) or by searching for the correspondence of a priori learned set of image patches (Yi and Soatto, 2011). In Hermosillo et al. (2002) the formulation of a locally computed similarity mea- sure is presented and in Rogelj et al. (2003) a variant to obtain pointwise similarity metric is described. First attempts to incorporate an additional information channel Fig. 1. Contrast-enhanced portal-phase CT image and delayed T1-weighted into the histogram definition of the MI were tackled by Studholme gadoxetic acid MRI from a patient after right hepatectomy. Arrows show metastases et al. (1996) for the rigid registration of MRI and positron emission within the different modalities. Each row represents different slice positions. The figure shows how metastases clearly identified in MRI are hardly visible in CT. tomography images of the pelvis. In Studholme et al. (2006) a re- Better vessel visibility is observed in the CT. lated method named regional mutual information (RMI) was ap- 24 L. Fernandez-de-Manuel et al. / Medical Image Analysis 18 (2014) 22–35
entropy of a discrete distribution is invariant to rotations and translations and making the simplifying assumption that high- dimensional distributions are approximately normally distributed. In Studholme et al. (1996) preliminary rigid registration results demonstrated the possibility of extending the histogram with a small number of unconnected regions of similar intensity ranges (e.g. air or fat tissue). As the reference image R and the regions L were inherently registered, they proposed an extension I of the mutual information for more than two variables defined by: IðT; R; LÞ¼HðR; LÞþHðTÞ HðT; R; LÞð3Þ Their preliminary results motivated us to use the same extension of the mutual information for more than two variables. However, they proposed the use of regions calculated by using only intensity ranges. Unconnected regions containing the same range of intensi- ties were considered as different, which does not allow separating intensity relationships based on anatomical reasons nor having re- gions with more than one range of intensities.
1.2. Our contribution
As already pointed out above, delayed T1-weighted Gd–EOB– DTPA-enhanced MRI causes strong signal enhancement for normal liver parenchyma. For this reason, the relationship between inten- sities in CT and MRI images is very different inside and outside the liver. Hence, using the classical formulation of MI, intensity rela- tionships in a region can occasionally mislead the algorithm in an- other region where the relationships are completely different; especially in nonrigid registrations. Liver volume estimation from preoperative CT is a routine man- datory process to determine the suitability of a patient for surgery and to make the final clinical decision before extensive hepatecto- mies (Fernandez-de-Manuel et al., 2011; Heimann et al., 2009). To estimate remnant liver volumes, both manual and automatic seg- Fig. 2. Cost functions for a synthetic 2D model as a function of horizontal mentation tools are applied in the daily clinical routine. Conse- transformation. A local nonrigid deformation is applied with maximum amplitude quently, liver segmentation from CT images are available in most dx from 1.5 pixels to 1.5 pixels in the middle of the vessel region. The EMI cost hospitals for patients considered for liver surgery. For that reason, function shows multiple local minima and a global minimum shifted away from 0 incorporating the liver segmentation information into the registra- where the correct solution should be. The EOF-MI cost function shows a clean global minimum in the correct location. tion process is very feasible. We discarded the approaches presented in Loeckx et al. (2010), Russakoff et al. (2004) and Studholme et al. (2006) because those plied to analyze local tissue contrast changes in brain MRI nonrigid techniques cannot take advantage of the available liver registration. The RMI was defined as: segmentations. In this work, we propose the use of an organ-focused mutual RMIðT; R; XÞ¼HðTÞþHðRÞþHðXÞ HðT; R; XÞð1Þ information (OF-MI) criterion. We extend the joint histogram with with X expressing the spatial position of cubic overlapping subre- an additional information channel using an extension of the mu- gions in the reference image R and the test image T.InLoeckx tual information for more than two variables similar to the one et al. (2010) a conditional mutual information (cMI) was defined proposed in Studholme et al. (1996) but with a different probabil- and calculated between two images T, R, given a certain spatial dis- ity distribution estimation. Studholme et al. (1996) used regions tribution X. Besides the intensity dimensions, a third spatial channel segmented based on intensity ranges, however, they suggested to was incorporated into the histogram definition, expressing the spa- exploit a higher level of anatomical knowledge. Therefore, we take tial location of every joint intensity pair: advantage of an anatomical segmentation resulting in regions with varying intensities. We consider that using anatomical regions cMIðT; RjXÞ¼HðTjXÞþHðRjXÞ HðT; RjXÞð2Þ actually allows taking maximum advantage of the definition pro- In Russakoff et al. (2004), a regional MI was described, introduc- posed in Studholme et al. (1996). Additionally, we have extended ing neighborhood regions of pixels into a multidimensional histo- its implementation to nonrigid multimodal registration. Conse- gram. In this work, each pixel co-occurrence was represented by quently, our work’s main contributions are related to the experi- more than one entry in the joint histogram depending on its neigh- mental novelty and the obtained results in a clinical application bor pixel co-occurrences. However, one should notice that as de- that benefits from the proposed approach. scribed in Russakoff et al. (2004), the main problem in increasing Therefore, the main contributions of this work are (1) an organ- the dimensionality of joint histograms is the need of a higher num- focused mutual information as registration criterion that takes ber of samples to obtain a reasonable estimate of entropy distribu- advantage of available clinical segmentation, (2) the mathematical tion. Therefore, these methods require a large number of image formulation for its implementation within a B-spline based regis- voxels hence increasing the computational complexity enor- tration framework using the explicit derivatives of the metric, (3) mously. In Russakoff et al. (2004) a method to make the problem a method to simulate Gd–EOB–DTPA-enhanced MRI from CT for more tractable is proposed, taking advantage of the fact that the validation purposes, (4) a thorough validation of the method with L. Fernandez-de-Manuel et al. / Medical Image Analysis 18 (2014) 22–35 25
synthetically generated data as well as its application to relevant the image, and bn represents an N-dimensional tensor product of clinical liver datasets (CT and Gd–EOB–DTPA-enhanced MRI), and centered B-splines of degree n. c a comparison of the registration performance using OF-MI com- The warped test image W is defined as fwðxÞ¼ft ðgðxÞÞ. We de- pared with MI as registration criterion. fine the solution to our registration problem as the result of the
The proposed criterion takes into account an organ (liver) seg- minimization g = arg ming2G E(g), where G is the space of all admis- mentation based on the semi-automatic method described in Fer- sible deformation functions g and E is the criterion. For the pro- nandez-de-Manuel et al. (2009) and Jimenez-Carretero et al. posed application, we consider the criterion: (2011). EðgÞ¼EdðW; RÞþcErðgÞð6Þ The algorithm has been validated and compared with the stan- dard MI on a simulated 3D dataset with 63 image pairs, using a de- where Ed is an MI-based image dissimilarity criterion and Er is a reg- layed T1-weighted Gd–EOB–DTPA-enhanced MRI simulation ularization term with weight c used to prevent discontinuities and framework. Additionally a dataset of seven real subjects referred to guarantee overall smoothness. For this particular problem, we to surgery with one contrast-enhanced portal-phase CT and one use a discrete approximation to the norm of the Laplacian of the delayed T1-weighted Gd–EOB–DTPA-enhanced MRI each have continuous deformation as Er (Kybic, 2001). been used. To minimize the criterion E with respect to a finite number of parameters c we use a gradient descent optimizer with quadratic step size estimation, as recommended in Kybic and Unser (2003). 2. Methods The optimization uses a multiresolution approach for the image model. The multiresolution methodology used creates a pyramid In Sections 2.1 and 2.2, we will introduce the general frame- of subsampled images optimal in the L2 sense, taking advantage work and MI, describing briefly the equations presented in previ- of the spline representation (Unser et al., 1993). The problem is ous work (Kybic and Unser, 2003; Thévenaz and Unser, 2000). solved by starting at the coarser level of the pyramid (the most We will then explain our contribution in Section 2.3. subsampled image) and proceeding to the finest level.
2.1. Problem definition and registration framework 2.2. Mutual information
The intensity-based nonrigid registration algorithm used ex- The joint intensity probability distribution is estimated by tends the previous B-spline method of Kybic and Unser (2003). means of Parzen windows because of their good properties, such The algorithm determines a set of B-spline coefficients that de- as computational efficiency (Unser, 1999). scribe a nonrigid transformation that maximizes an image similar- Following Thévenaz and Unser (2000), the contribution to the ity measure. The transformation model is defined as a linear joint histogram of a single pair of pixels with intensities (fw, fr) is dis- combination of B-spline basis functions located on a uniform grid. tributed over several discrete bins (t,r) with t and r belonging to dis- B-spline functions have been widely used to represent deforma- crete sets of intensities associated with the test and reference tions (Kybic and Unser, 2003; Ledesma-Carbayo et al., 2005; Oguro images, with ranges from 0 to nbinsT 1 and nbinsR 1, respectively. et al., 2009; Rueckert et al., 1999; Schnabel et al., 2001), motivated Intensities (fw, fr) can take values in a continuum in the ranges (fw by their compact support, computational simplicity, good approx- min, fw max) and (fr min, fr max), respectively. Using B-spline functions imation properties, and implicit smoothness. We also use B-spline of degree m1 and m2, the discrete joint probability of co-occurring functions for representing continuous images derived from a set of intensities in the overlap of the two images fw and fr is expressed as: samples (Kybic and Unser, 2003; Thévenaz and Unser, 2000). 1 X Moreover, B-spline basis functions are used as Parzen windows p t; r; c b t i; c b r i 7 ð Þ¼ m1 ð sð ÞÞ m2 ð qð ÞÞ ð Þ jIcj N (Thévenaz and Unser, 2000) in the similarity criteria, as described i2Ic Z later. The input images are given as two N-dimensional discrete sig- and the discrete marginal probability distributions for the warped nals: the test image T and the reference image R with intensities test and the reference images, respectively, are: N X ft(i) and fr(i), respectively, where i 2 I Z , and I is an N-dimen- 1 p ðt; cÞ¼ b ðt sði; cÞÞ sional discrete interval representing the set of all voxel coordinates T m1 jIcj N i2Ic Z in the image. For convenience in our formulation we use a contin- X ð8Þ c 1 uous representation ft ðxÞ of the discrete test image ft(i) as follows: p r b r i Rð Þ¼ m2 ð qð ÞÞ X jIcj N c i2Ic Z ft ðxÞ¼ aibmðx iÞ N i2Ia Z ð4Þ where Ic is a discrete set of samples Ic I. s and q are the test and c N reference images after scaling the continuous interval (0, nbinsT 1) ftðiÞ¼ft ðiÞ 8i 2 I Z and (0, nbinsR 1), respectively: where bm represents an N-dimensional tensor product of centered nbinsT 1 B-splines of degree m (Kybic, 2001), ai are the B-spline coefficients sði; cÞ¼ðÞ fwði; cÞ fw min fw max fw min that represent the original test image given by its samples ft(i), and ð9Þ nbinsR 1 Ia is the set of nodes used to represent the image. qðiÞ¼ðÞ frðiÞ fr min Let g(x) be a deformation function that finds the spatial corre- fr max fr min spondence between coordinates in the test and reference images. The MI-based dissimilarity criterion EMI can be defined from the The deformation is represented using splines: above probabilities as a function of the deformation parameters c: X XX gðxÞ¼x þ cjbnðx=h jÞð5Þ pðt; r; cÞ Ed ¼ EMIðW; RÞ¼ pðt; r; cÞ log ð10Þ j2I ZN p t; c p r b 8t 8r T ð Þ Rð Þ
N described by a finite number of parameters c ¼fcjg; j 2 Ib Z ; Notice that in this work we denote by EMI the negative version where Ib is an N-dimensional discrete interval representing the set of the standard mutual information, so we search for the EMI of parameter indexes, h is the knot spacing on a regular grid over minimum. 26 L. Fernandez-de-Manuel et al. / Medical Image Analysis 18 (2014) 22–35
We can also express MI-based dissimilarity criterion EMI be- gery planning procedure. Uncertainty of the segmentations could tween a warped image W and a reference image R in terms of be taken into account by smoothing the mask edges using a Gauss- the marginal and joint entropies: ian filtering; however, experiments revealed that this does not bring any benefit for this application. Therefore, in this work, the Ed ¼ EMIðW; RÞ¼HðW; RÞ HðWÞ HðRÞð11Þ voxel probability of belonging to a region will be either 0 or 1; as For the optimization algorithm, partial derivatives of the EMI each voxel contributes only to one region. with respect to cj are needed: Considering a binary image f (x) with dimensions identical to X X c fr(x) that represents the clinical segmented liver in the reference @EMI @EMI @ft ðxÞ @gkðiÞ ¼ ð12Þ image, we define PXl ðiÞ as follows: @cj;k @fwðiÞ @xk x¼gðiÞ @cj;k i2Ia fXðxÞjx¼i if l ¼ 1 where k is the dimension of the N-dimensional cj. For further details PXl ðiÞ¼ ð19Þ on the derivatives calculation, we refer the reader to Thévenaz and 1 fXðxÞjx¼i if l ¼ 0 Unser (2000). In this work the initial liver binary images fX(x) have been cre- ated semi-automatically by a liver segmentation application based 2.3. Organ-focused mutual information on active contours previously described in Fernandez-de-Manuel et al. (2009) and Jimenez-Carretero et al. (2011). We present here an OF-MI criterion that allows including prob- Notice that with our proposed OF-MI we are not neglecting any abilities of voxels belonging to the object or the background. area of the image, as all the regions are represented in the 3D joint We introduce an additional information channel consisting of intensity probability distributions and optimized together. an image L containing for every voxel its probability PXl ðiÞ of belonging to the background X and the object (liver) X , P i sat- P 0 1 Xl ð Þ 2.3.2. MI versus OF-MI synthetic examples isfying 8lPXl ðiÞ¼18i; l ¼ 0; 1. Based on (3) (Studholme et al., 1996) we define OF-MI-based To illustrate theoretically the behavior of the proposed ap- proach compared with MI and its shortcomings related to its dissimilarity criterion EOF-MI between a warped image W and a pair (R,L) consisting of a reference image R and the probability image L assumption of equal statistical relationships over the whole do- in terms of the marginal and joint entropies: main of the images, two 2D basic synthetic images were created representing a contrast-enhanced portal-phase CT model and a de- Ed ¼ EOF MIðW; R; LÞ¼HðW; R; LÞ HðWÞ HðR; LÞð13Þ layed T1-weighted Gd–EOB–DTPA-enhanced MRI model. Both The joint probability histogram is extended with a third dimen- models contained the liver, the kidneys, the spleen, and an intrahe- sion of size 2, with l the coordinate representing inside (l = 1) and patic lesion and vessel. They represented a realistic distribution of outside (l = 0) the liver region. Using B-spline functions of degree intensities and were initially registered completely. We used the CT liver segmentation (liver region mask in Fig. 2). We then applied m1 and m2, we define a 3D discrete joint probability distribution: 1 X a mild horizontal nonrigid deformation with maximum amplitude pðt; r; l; cÞ¼ PX ðiÞ b ðt sði; cÞÞ b ðr qðiÞÞ ð14Þ l m1 m2 dx around the center of the hepatic vessel to the Gd–EOB–DTPA- jIcj N i2Ic Z enhanced MRI model and evaluated the dependency of the crite- The marginal organ-focused joint intensity probability distribu- rion on dx, with dx ranging from –1.5 pixels to 1.5 pixels (Fig. 2). tion for the reference image is: The EMI exhibits multiple local minima and a global minimum far from the correct location. This happens because the intensity 1 X p r; l P i b r i 15 relationships between CT and MRI pixels in extrahepatic organs RLð Þ¼ Xl ð Þ m2 ð qð ÞÞ ð Þ jIcj N i2Ic Z (spleen) mislead the algorithm into considering that the hepatic vessel in MRI should be aligned with the liver tissue in CT. On The marginal intensity probability distribution for the test image is the other hand, the EOF-MI cost function shows a clean global min- given in (8). imum at the correct location, as the intensity relationships inside The dissimilarity criterion EOF-MI is defined from the above and outside the liver are considered separately. probabilities as follows: XXX pðt; r; l; cÞ E ðW; R; LÞ¼ pðt; r; l; cÞ log ð16Þ OF MI p ðt; cÞ p ðr; lÞ 3. Validation methodology 8t 8r 8l T RL
The partial derivatives of the EOF-MI with respect to cj are: In this section, we will first describe the simulated and real im- X c @ðEOF MIÞ @ðEOF MIÞ @ft ðxÞ @gkðiÞ age datasets. Then, we will illustrate the definition of the quantita- ¼ ð17Þ @cj;k @fwðiÞ @xk @cj;k tive measures used to validate the experiments. After that, the i2Ia x¼gðiÞ registration parameters for both, the MI and the OF-MI criteria (For further detail on the derivatives calculation, we refer the reader are given, followed by the validation results. to the Appendix A). From (13) and (16) the expressions for the marginal and joint 3.1. Data entropies remainX asX follows:X HðW; R; LÞ¼ pðt; r; l; cÞ log pðt; r; l; cÞ 3.1.1. Simulated images dataset 8t 8r 8l X A dataset of 63 image pairs, each composed of one simulated H W p t; c log p t; c ð Þ¼ T ð Þ T ð Þ ð18Þ hepatic delayed T1-weighted Gd–EOB–DTPA-enhanced MRI and 8t XX one contrast-enhanced portal-phase CT was generated; the images HðR; LÞ¼ pRLðr; lÞ log pRLðr; lÞ had different noise levels and deformation values. Gd–EOB–DTPA- r 8 8l enhanced MRI images were simulated from seven real CT images of the abdominal body region at the liver level. CT images were taken
2.3.1. Estimation of the region probabilities PXl ðiÞ as provided. The processing steps to simulate MRI images con-
The probability image L containing PXl ðiÞ; l ¼ 0; 1 is computed sisted of a nonlinear intensity transformation, a low-pass Gaussian from the hepatic masks (liver segmentation) obtained for the sur- filter, morphological edge detection, and Gaussian noise addition. L. Fernandez-de-Manuel et al. / Medical Image Analysis 18 (2014) 22–35 27
The gray values of the simulated MRI images were calculated by The MRI signal is corrupted by an additive noise process (Kwan a nonlinear intensity transformation based on intensity distribu- et al., 1999). As noise distributions in MRI images are nearly white tions of contrast-enhanced portal-phase CT and delayed T1- Gaussian for signal-to-noise ratios (SNR) greater than 2 (Gudbjarts- weighted Gd–EOB–DTPA-enhanced MRI (Fig. 3a and b). By direct son and Patz, 1995), we added independent realizations of white observation of these distributions, we assigned the interval of Gaussian noise to our simulated dataset. We measured the amount intensities approximating those in MRI to the interval of intensities of added noise as a signal-to-noise ratio according to Bushberg in the CT for each relevant organ or structure, differentiating be- et al. (2002): tween liver structures and background. Therefore, different trans- formations were applied to different CT regions, assigning tissue- SNR ¼ A=rn ð20Þ dependent signal intensity to each region (liver and liver back- where A is the mean image pixel intensity, and rn is the standard ground) (Fig. 3c and d). In order to apply these intensity transfor- deviation of the Gaussian noise. Different levels of Gaussian noise mations, CT regions were calculated by segmenting the liver in were added to the simulated data ranging from SNR 3 to SNR 7. the CT. Segmentations were made by an independent person blind Additionally, known transformations were applied to the origi- to those used during the registration process in order to guarantee nal CT images to generate reference images for the registration the independence of the simulation step with respect to the experiments. The transformations were modeled using a closed- registrations. form function that defines the spatial dependence of the deforma- To simulate the partial volume effect in MRI (Tohka et al., 2004), tion, mimicking nonrigid organ movements due to respiration and a 3D low-pass Gaussian filter with a standard deviation for the volume changes along time caused by differences in patient Gaussian kernel of [1,1,3] was applied after the intensity weight, organ disposition, or liver volume growth due to illness transformation. evolution or portal vein embolization. In our transformation model Gd–EOB–DTPA-enhanced MRI images commonly show better we assume that the deformation is 0 in the center of the body and edge enhancement than CT images. To simulate this particular fea- maximum at a liver distance, as the soft-tissue motion of the liver ture, borders were first calculated by morphological edge detection is highly influenced by the motion of both the diaphragm and the and added by summation to the existing image. ribcage (Villard et al., 2011). According to some sources, local liver
Fig. 3. Intensity distribution analysis in contrast-enhanced portal-phase CT (a) and delayed T1-weighted Gd–EOB–DTPA-enhanced MRI (b) for abdominal organs and important regions inside the liver (parenchyma, metastasis and vessels). Nonlinear intensity transformations applied in liver region (c) and liver background (d) to generate MRI simulated images from CT images. 28 L. Fernandez-de-Manuel et al. / Medical Image Analysis 18 (2014) 22–35 deformation due to respiration can range from 10 to 26 mm in with a 16-MDCT scanner (Brilliance 16; Philips Medical Systems, amplitude between the extremes of the respiratory cycle (Blackall Eindhoven, The Netherlands) in all cases. The scanning parameters et al., 2005; Clifford et al., 2002; Rohlfing et al., 2001). Moreover, were 120 kVp, 250–300 mA s, 2-mm slice thickness with an over- we approximate the deformation as a continuous movement. lap of 1 mm (pitch, 0.9), and a single-breath-hold helical acquisi- Therefore, we represent the dependency of the deformation on tion. The images were obtained in the craniocaudal direction. the distance to the center of the image by using a sinusoidal func- Hepatic portal-phase scanning began 70 s after injection of tion that allows us to have a symmetric deformation all around the 120 ml of a nonionic iodinated contrast agent (Ioversol, Optiray contour of the body with maximum value in the center of the liver. Ultraject 300; Covidien). For delayed T1-weighted Gd–EOB–
The simulated deformation is defined by t(i)={tk(ik)}k=1, 2, 3, where DTPA-enhanced MRI, 20-min delayed hepatobiliary phase images i 2 I Z3 and I is the 3-dimensional discrete interval representing were obtained with a T1-weighted 3D turbo-field-echo sequence the set of all voxel coordinates in the image. t(i) is applied voxel by (T1 high-resolution isotropic volume examination, THRIVE; Philips voxel as defined by the equation: Medical Systems, Eindhoven, The Netherlands) (3.4/1.8; flip angle 10 ; matrix size, 336 206; bandwidth, 995.7 Hz/pixel) with a 2- x i p t ði Þ¼i m sin c;k k ð21Þ mm section thickness, no intersection gap, and a field of view of k k k k x x 2 j c;k w;kj 32–38 cm. Details regarding clinical information are summarized where xc are the voxel coordinates with minimum deformation, xw in Table 1. are the voxel coordinates with maximum deformation and m is the MRI images were manually aligned onto the corresponding CT maximum deformation. xc represents the center of the image and by a point-based rigid registration. xw represents the points at 1/3 of the extreme of the image that The images were then cropped in axes X, Y, and Z to restrict the comprise the liver in all the models. Therefore, these parameters de- subsequent image-processing steps to the complete body region at pend on the image size sim ={sim,k}k=1, 2, 3 as follows: liver level. The images were then resampled to pixel size [1,1,1] mm, guaranteeing isotropy. CT was used as the reference image, xc ¼ p sim c ð22Þ and MRI as the test image. xw ¼ pw sim 3.2. Error measures where pc = 0.5 and pw = 1/3. Considering the literature references about local liver deformation (Blackall et al., 2005; Clifford et al., 3.2.1. Measures on the simulated dataset 2002; Rohlfing et al., 2001), we decided to apply to each of our 63 Registration of simulated CT and MRI datasets was evaluated in models a maximum deformation value that varies randomly be- terms of geometric error by comparing the resulting transforma- tween 4 mm and 28 mm at liver level. m represents the magnitude k tion with the applied analytical one on a voxel-by-voxel basis of the maximum deformation for each dimension and takes random using the warping index (WI) (Thévenaz et al., 1998). The WI cal- values in the range [4,28] (mm). culation was restricted to the liver region: An example of a synthetic delayed T1-weighted Gd–EOB–DTPA- enhanced MRI can be seen and compared with a real MRI image in 1 X WI ¼ kgðiÞ g ðiÞk ð23Þ Fig. 4 as well as an example of a simulated deformation. jRj i2R
3.1.2. Real images dataset where g is the true deformation, R represents the set of all voxel A dataset consisting of seven clinical subjects with a wide vari- coordinates inside the liver, and || || the Euclidean distance. ety of pathological scenarios was used. Each case has one contrast-enhanced portal-phase CT and one delayed T1-weighted 3.2.2. Measures on the real dataset Gd–EOB–DTPA-enhanced MRI from a retrospective clinical dataset. To establish an independent validation procedure, radiologists The contrast-enhanced portal-phase helical CTs were performed annotated all real images manually, defining a set of 10 intrinsic anatomical hepatic landmarks for each pair of images. Registration results were evaluated in terms of the mean distance error be- tween corresponding anatomical landmarks before and after regis- tration for each subject (landmark-based mean errors). Landmarks were located in vessel intersections, hepatic fissures and ligaments, and small lesions. Because of the high dependency of error results on the accuracy of the landmark selections, additional error criteria were also con- sidered. For this purpose, the liver was manually segmented in the CT and MRI images. Segmentations were made by an independent expert blind to those used during the registration process in order to guarantee the independence of the registrations with respect to the validation. Comparing liver segmentation before and after reg- istration allows calculation of surface-based mean errors (ME): 1 X ME ¼ d ð24Þ jSj i i2S where S is an N-dimensional discrete interval representing the set of all voxel coordinates in both liver segmentation surfaces; it is ob- tained by considering voxels inside the segmented liver with at least one of their 18 nearest neighbors not belonging to the liver.
di represents the Euclidean distance of a voxel i to the closest one Fig. 4. (a) Original CT, (b) CT with simulated deformation, (c) synthetic gadoxetic in the other segmentation. ME (mm) is zero for two perfectly regis- acid MRI, (d) real gadoxetic acid MRI. tered surfaces. L. Fernandez-de-Manuel et al. / Medical Image Analysis 18 (2014) 22–35 29
Table 1 Real dataset clinical description ( Roman numerals represent Couinaud hepatic segments (Couinaud, 1999)).
Subject Clinical information Image dates CT-visible lesions MRI-visible lesions 1 Hepatectomy MRI 26 days post-CT 1 in hepatic duct 1 in hepatic duct Metastases 1 in IVb Chemotherapy 2 Metastases CT 45 days post-MRI 0 1 in II/III Chemotherapy 1 in IVa/VIII 1inVI 1inI 3 Metastases MRI 21 days post-CT 1 in IV 1 in VIII 1 in VIII 4 Hepatectomy MRI 10 months post-CT 0 0 Metastases 5 Metastases MRI 12 days post-CT 3 in VIII 1 in VIII Cholecystectomy 1 in IV 1 in VII/VIII 1 in II 1 in II 6 Metastases MRI 4 months post-CT 0 0 Sectorectomy 7 Metastases MRI 1 month post-CT 1 in VII 1 in IV 1inII
3.3. Parameter optimizations
To investigate the best parameter combination, we tested the performance of the algorithm with one real training subject. Com- mon parameters in the intensity-based nonrigid registration algo- rithm are: