UNIVERSITY OF CALGARY
Efficient S-Transform Techniques for Magnetic Resonance Imaging
by
Sylvia Drabycz
A THESIS
SUBMITTED TO THE FACULTY OF GRADUATE STUDIES
IN PARTIAL FULFILLMENT OF THE REQUIREMENTS FOR THE
DEGREE OF DOCTOR OF PHILOSOPHY
GRADUATE PROGRAM IN BIOMEDICAL ENGINEERING
CALGARY, ALBERTA
MARCH, 2009
© Sylvia Drabycz 2009
ISBN: 978-0-494-51180-0
Abstract
The S-transform (ST) is a promising technique for magnetic resonance (MR) image texture analysis that has previously been successfully applied to the study of multiple sclerosis (MS) and brain cancer. Two of the major limitations of the ST are the computation time and the storage requirements. This thesis introduces two new techniques that reduce the complexity and redundancy of the ST, addressing these limitations.
A version of the ST that uses circularly symmetric windows, known as the circu- lar S-transform (cST), is proposed and its properties are studied. It is shown that the cST produces results similar to the polar S-transform (pST) but requires less computation time.
Space-frequency transforms such as the ST can also be utilized for image filtering and denoising. A novel application of the ST is proposed to study lesion dynamics in MS. It is found that a texture measure derived from the cST can discriminate between the core of a hyperintense active lesion and its less hyperintense periphery.
A two-dimensional (2D) frequency-domain implementation of the discrete or- thonormal S-transform (DOST) is also presented. The DOST addresses both the computational and storage requirements of the ST by removing redundancy from the ST and providing a rapid calculation of the space-frequency domain. Novel texture features are derived from the DOST and it is shown that these features are more powerful when classifying unknown texture patterns than a leading method using the discrete wavelet transform (DWT).
Finally, the cST, pST and the DOST are applied to a study of glioblastoma iii (GBM) texture. It is shown that the DOST is the most accurate texture analysis approach when blindly classifying tumors based on their methylation status. Inter- estingly, differences were only found when analyzing small region of interests (ROIs) within the tumors, not when analyzing the entire image and extracting spectra from the entire tumor.
iv Acknowledgements
I would like to begin by thanking my parents for their never-ending support and encouragement. It is because of the sacrifices they made in their own lives that I have had the opportunity to pursue higher education. I learned from them that hard work and persistence are the keys to success. Thank you both for everything you have provided for me and for always supporting my love of learning — you are an inspiration. I also want to take this opportunity to thank my best friend and husband-to-be for all of his patience, love, kindness and understanding. Thank you to all of my friends and family for being incredibly supportive throughout this journey. I could not have reached this point without all of you.
I offer my thanks to all of the past and present students and researchers at the Imaging Informatics Lab and the Seaman Family MR Research Centre. I have learned so much from all of you about MR imaging, programming and fundamental research concepts. I have enjoyed working with and getting to know all of you. I would especially like to thank Danny Ramotowski, Mark Simpson and Brian O’Brien for their technical assistance and help in accessing data as well as Pamela Wilkinson and Joanne Morgan for their help with travel claims, paperwork and always making sure I got paid.
I would like to thank my supervisory committee, including Brad Goodyear and
Gary Margrave, as well as Michael Lamoureux for their explanations of concepts from MR image acquisition to time-frequency analysis, and for providing direction and feedback about my research. I would like to acknowledge the help of Robert
Stockwell for answering my questions about his DOST work and for his contributions v to my own extensions of his work. I would like to extend my sincere thanks to
Gregory Cairncross, Gloria Rold´an, Paula de Robles and Daniel Adler for all of their hard work with the GBM project. They have been invaluable in the design and execution of this project. Without their hard work, support and encouragement, this part of my research project would not have been possible.
I give my thanks to all of my past supervisors and teachers. Thank you to my professors at the University of Regina for inspiring my love of physics, mathematics and computer science. Thank you to my past academic and employment supervi- sors, especially Alex MacKay at UBC for giving me my first opportunity to pursue independent research and for introducing me to MR.
I wish to recognize the support I have received from the following funding agen- cies: the Natural Sciences and Engineering Research Council of Canada for the
Alexander Graham Bell Canada Graduate Scholarship as well as the NSERC Post- graduate Scholarship; L’Or´eal Canada and the Canadian Commission for UNESCO for the Women in Science Mentor Fellowship; the Alberta Government Informatics
Circle of Research Excellence for the Graduate Student Scholarship; and the Al- berta Heritage Foundation for Medical Research for the Studentship Award. I would also like to acknowledge the financial support I have received from the University of Calgary through the Dean’s Research Excellence Award, the Dean’s Entrance
Scholarship and the Graduate Travel Award as well as the financial support from my supervisor, Ross Mitchell.
I would like to thank the following organizations that have allowed me to share my work with young people and help inspire the next generation of scientists and engineering: the Actua mentoring program for allowing me to travel to Saskatoon
vi and Regina to work with the SCI-FI and EYES science camps; the Women in Science and Engineering group at the University of Calgary for inviting me to participate with groups such as the Calgary Inner City Resource Centre’s “Saturday All Stars” program; the SCIberMENTOR email mentoring program; and the Telus World of
Science for inviting me to speak at the 2008 Beauty and Brains Conference.
Finally, I offer my sincere thanks and gratitude to my supervisor, Ross Mitchell.
Thank you for all of the hard work you have put in over the last four and a half years.
Your enthusiasm and dedication to medical imaging informatics is what inspired me to come to Calgary and pursue my research project. Thank you for having faith in me to pursue my own research direction while providing guidance, support and invaluable suggestions for all of the research presented in this thesis. All of the work I have published throughout my research program has benefited from your revisions and corrections. The hard work and long hours you have put in to secure funding through grants and industrial partnerships have given me the opportunity to travel internationally to present my work and network with and learn from leading scientists around the world.
While this is by no means a complete list, I would like to thank all of those people who have helped and supporting me in reaching this goal. Individual contributions to each publication are outlined in detail at the beginning of each chapter.
vii Table of Contents
Abstract ...... iii Acknowledgements ...... v Table of Contents ...... viii List of Tables ...... xii List of Figures ...... xiii List of Abbreviations ...... xxiv
CHAPTER 1: INTRODUCTION ...... 1 1.1 Thesis Objectives...... 1 1.2 Thesis Structure ...... 2 1.3 Significance and Contribution ...... 4
CHAPTER 2: INTRODUCTION TO SPACE-FREQUENCY ANALYSIS . 6 2.1 Fourier Transform ...... 6 2.2 Windowed Fourier Transforms ...... 9 2.2.1 Gabor Transform and Gabor Filters ...... 12 2.3 Wavelets ...... 18 2.3.1 Continuous 1D Wavelet Transform ...... 18 2.3.2 Discrete 1D Wavelet Transform ...... 19 2.3.3 Discrete 2D Wavelet Transform ...... 22 2.4 Curvelets...... 23 2.5 S-Transform ...... 26 2.5.1 1D S-Transform ...... 26 2.5.2 2D S-Transform ...... 29 2.5.3 S-Transform Inversion ...... 30 2.5.4 The Polar S-Transform ...... 31 2.5.5 Discrete Orthonormal S-Transform...... 33
CHAPTER 3: INTRODUCTION TO MAGNETIC RESONANCE IMAG- ING...... 37 3.1 MR Physics ...... 37 3.2 Generating an Image ...... 43 3.2.1 Image Reconstruction ...... 44 3.2.2 Neurological MR Imaging...... 45 3.3 MR Sequences and Contrast Mechanisms ...... 47 3.3.1 T2 Weighting ...... 47 3.3.2 T1 Weighting ...... 47
viii 3.3.3 Proton Density Weighting ...... 50 3.3.4 Contrast Enhancement ...... 50
CHAPTER 4: APPLICATIONS OF SPACE-FREQUENCY ANALYSIS TO MEDICAL IMAGE PROCESSING ...... 53 4.1 Filtering ...... 53 4.1.1 Image-Space Filtering ...... 53 4.1.2 Frequency-Domain Filtering ...... 56 4.1.3 Space-Frequency Filtering ...... 58 4.2 Texture Analysis ...... 63 4.2.1 Statistical Methods ...... 63 4.2.2 Multiscale Texture Features ...... 65 4.3 Classification ...... 69 4.3.1 Evaluating Classifier Performance ...... 71
CHAPTER 5: THE CIRCULAR S-TRANSFORM ...... 76 5.1 Introduction...... 76 5.1.1 Limitations of the pST ...... 77 5.1.2 Objectives ...... 78 5.1.3 Theory ...... 78 5.1.4 The Circular S-Transform ...... 81 5.2 Methods ...... 86 5.2.1 Preliminary Analysis on an MS Image ...... 86 5.2.2 Further Analysis ...... 86 5.3 Results ...... 88 5.3.1 Preliminary Results ...... 88 5.3.2 Further Results ...... 88 5.4 Conclusions ...... 92
CHAPTER 6: PROPERTIES AND PERFORMANCE OF THE CST . . . . . 95 6.1 Introduction...... 95 6.1.1 Phase Shift...... 96 6.1.2 Localizing Windows ...... 96 6.1.3 Smoothing ...... 98 6.1.4 Geometric Transformations ...... 99 6.1.5 Image Non-Uniformity...... 99 6.1.6 Inversion ...... 100 6.2 Methods ...... 100 6.2.1 Phase Shift...... 101 6.2.2 Localizing Windows ...... 102
ix 6.2.3 Visual Analysis of Local Spectra ...... 102 6.2.4 Frequency Response ...... 102 6.2.5 Smoothing ...... 103 6.2.6 Effect of Geometric Transformations ...... 103 6.2.7 Effect of Image Non-Uniformity ...... 105 6.2.8 Inversion ...... 106 6.3 Results ...... 108 6.3.1 Phase Shift...... 108 6.3.2 Localizing Windows ...... 109 6.3.3 Visual Analysis of Local Spectra ...... 112 6.3.4 Frequency Response ...... 118 6.3.5 Smoothing ...... 119 6.3.6 Effect of Geometric Transformations ...... 123 6.3.7 Effect of Image Non-Uniformity ...... 126 6.3.8 Inversion ...... 126 6.4 Discussion ...... 129 6.5 Conclusions ...... 133
CHAPTER 7: LESION EVOLUTION IN MULTIPLE SCLEROSIS US- ING THE CIRCULAR S-TRANSFORM ...... 134 7.1 Introduction...... 135 7.1.1 S-Transform Texture Analysis ...... 135 7.1.2 Clinical Application: Multiple Sclerosis ...... 136 7.2 Materials and Methods ...... 139 7.3 Results ...... 144 7.4 Discussion ...... 150 7.5 Conclusion ...... 154
CHAPTER 8: TEXTURE ANALYSIS USING THE 2D DISCRETE OR- THONORMAL S-TRANSFORM ...... 156 8.1 Background ...... 157 8.2 Theory ...... 160 8.2.1 1D Discrete Orthonormal S-Transform ...... 160 8.2.2 2D Discrete Orthonormal S-Transform ...... 166 8.2.3 Pixel-wise Local Spatial Frequency Description ...... 170 8.2.4 Primary Frequency Component ...... 172 8.2.5 Effect of Spatial Transformations on the DOST ...... 173 8.2.6 Rotationally Invariant DOST Features ...... 174 8.3 Methods ...... 176 8.3.1 Transform Characterization...... 178
x 8.3.2 Response to Noise ...... 179 8.3.3 Classification Experiment...... 180 8.4 Results ...... 184 8.5 Discussion and Conclusions ...... 188
CHAPTER 9: TEXTURE ANALYSIS OF GLIOBLASTOMA USING THE PST, CST AND DOST ...... 189 9.1 Introduction...... 190 9.2 Methods ...... 192 9.2.1 Patients ...... 192 9.2.2 DNA Samples ...... 192 9.2.3 Texture Analysis ...... 193 9.3 Results ...... 196 9.3.1 Qualitative Texture Analysis ...... 198 9.3.2 Quantitative Texture Analysis ...... 200 9.3.3 Texture-Based MGMT Prediction...... 205 9.4 Discussion ...... 207
CHAPTER 10: CONCLUSIONS ...... 209 10.1 Summary ...... 209 10.2 Future Work ...... 211 10.2.1 cST Image Filtering ...... 212 10.2.2 Application of the DOST to Texture Analysis of Multiple Sclerosis ...... 217 10.2.3 Correlation of Texture Features with Survival in GBM . . . . . 221
References ...... 227
APPENDIX A: REPRINT PERMISSION ...... 248
xi List of Tables
2.1 Low-pass (g) and high-pass (h) filters applied to the rows and columns of an image to generate horizontal (H), vertical (V), and diagonal (D) detail and approximation (A) coefficients...... 23
5.1 The minimum ROI sizes required to reach stable LFE values...... 91 5.2 P-values for the difference between pST and cST LFE measures for three bandwidths (columns 2–4), along with the correlation coeffi- cients (column 5). The pST and cST give similar LFE values, partic- ularly for large ROIs and bandwidths that exclude very low frequen- cies. The spectra are highly correlated...... 92
6.1 L1 norms for inverted images (average over 5 texture images)...... 131 7.1 New enhancing lesions that occurred over the course of 9 months of serial MR scanning. Note that month 0 corresponds to the start of treatment. Lesions were identified on T1 post-contrast images...... 140
8.1 Misclassifications (/100) of the Primary Frequency Component (Ori- ented Diagonally) ...... 187 8.2 Classification Accuracy for Invariant Wavelet (db4) and Invariant DOST on 9 Brodatz Texture Images ...... 188
9.1 Differentiation between MGMT promoter methylation status using qualitative texture features assessed visually on T1-post contrast and T2-weighted MR images...... 198 9.2 Classification results using linear discriminant analysis for qualitative and quantitative texture features...... 205 9.3 Classification results using quadratic discriminant analysis for quali- tative and quantitative texture features...... 205
10.1 Characteristics of best/worst survival groups...... 222 10.2 Texture features found to be significantly different between patients who survived for the longest amount of time after diagnosis (best, n = 15) and those who survived the shortest (worst, n = 15). These features were used in the Cox regression analysis...... 223 10.3 Variable bin cutoffs...... 223
xii List of Figures
2.1 Tilings of the time-frequency plane for (a) optimal time localization and no frequency localization (Dirac basis), (b) optimal frequency localization and no time localization (Fourier basis), (c) uniform par- titioning obtained with a constant width window and (d) dyadic par- titioning. Figures adapted from [1,2]...... 10 2.2 A signal consisting of a low-frequency sine wave (0.068 Hz), a quadratic chirp increasing in frequency from 0.039 to 0.39 Hz and a pulse from 154–166 s (bottom) and its FT (left). The STFT with a wide tem- poral window resolves the low frequency components well, but misses the pulse entirely...... 13 2.3 As the STFT window width decreases, the pulse becomes visible, but the time resolution does not allow for precise identification of when the pulse starts and ends. The frequency resolution of the remaining components is not as good as in Fig. 2.2...... 14 2.4 With a narrow temporal window, the STFT resolves the location of the pulse well, at the expense of the degrading frequency resolution. . 15 2.5 Sampling of the time-frequency plane for the GT. a and b determine the distance of the samples in the time and frequency directions, respectively...... 16 2.6 Dyadic tiling of the time-scale domain of the DWT. The width of each partition decreases by a factor of 2 as the scale decreases and the scale bandwidth doubles by a factor of two...... 20 2.7 DWT decomposition of a signal x[n] with low pass filters g[n] and high pass filters h[h]. The symbol ↓ 2 represents downsampling by a factor of two by discarding every second sample...... 21 2.8 Three levels of a 2D DWT (levels are denoted by the subscripts). Av- erage coefficients are denoted by “A” and detail coefficients represent: H: horizontal, D:diagonal, V:vertical...... 24 2.9 (a) The frequency domain tiling of the curvelet transform into polar “wedges”. (b) The Cartesian tiling based on concentric squares often used in practice when calculating the curvelet transform...... 25 2.10 The ST of the signal from Fig. 2.2 provides good resolution of both time and frequency, allowing one to identify all of the spectral com- ponents...... 28 2.11 Real (solid) and imaginary (dashed) portions of the DOST basis func- tions, defined in (2.52), for orders p = 0 to 5 and N = 100. This figure was intended to replicate Fig. 2 from [3]...... 35
xiii 2.12 The DOST of the signal from Fig. 2.2. The signal is plotted on the bottom and its FT on the left side. Note that the DOST coefficients are replicated to fill the N × N domain; in actuality the DOST con- tains only N coefficients for a N-point signal...... 36 3.1 The axis of rotation of a nuclear spin, μ , precesses about B0 at an angular frequency ω ∝ B0, known as the Larmor frequency...... 38 3.2 A static B0 field induces a net magnetization M along the z-axis. An applied B1 field along the x-axis rotates M into the transverse plane. The strength and duration of B1 determine the angle of M with respect to the z-axis, α, known as the flip angle...... 39 3.3 (a) Simulated regrowth of longitudinal magnetization over t =0–2500 ms with T1 = 500 ms. (b) Simulated decay of transverse magnetiza- tion over t =0–2500 ms with T2 = 500 ms...... 41 3.4 A diagram of acquisition space for MR imaging, known as k-space. max max The maximum frequency in k-space is kx = NxΔx/2 and ky = NyΔy/2. The center frequency (kx,ky)=(0, 0) is known as the DC component. The spacing between samples in k-space is given by Δkx =1/F OVx and Δky =1/F OVy. In a traditional acquisition scheme, one line of k-space is acquired each TR...... 46 3.5 Illustration of the three most common MR imaging planes: axial, sagittal and coronal...... 48 3.6 An example of a normal PD-weighted axial MR image. R and L refer to the right and left sides of the patient, respectively and A and P to anterior (front) and posterior (back), respectively...... 49 3.7 MR images of a MS patient obtained using a spin-echo sequence with the following weightings (TR/TE): (a) T2 (2716/80 ms), (b) T1 (650/8 ms) and (c) PD (2716/30 ms). Note the good contrast between lesions and surrounding white matter in (a). Examples of MS lesions are indicated by white arrows...... 51 3.8 Examples of T1-weighted images taken (a) before and (b) after the injection of gadolinium contrast agent (Gd-DTPA). Note the improve- ment in the contrast between the large brain tumor (arrow) and the normal brain tissue in (b) along with the detail evident within the tumor itself...... 52
4.1 (a) A 256 × 256 pixel MR image. (b) The same image after blurring with a Gaussian g of width σ =1.28 pixels and (c) after applying g with σ =2.56 pixels...... 55
xiv 4.2 Anisotropic diffusion filtering of the image shown in Fig. 4.1(a) with (a) 10 iterations at κ = 28, (b) 50 iterations at κ = 28 and (c) 10 iterations at κ =10, 000...... 57 4.3 Examples of low-pass, high-pass and band-pass filters...... 59 4.4 (a) A signal (left column) that contains a sinuosoid whose frequency increases with time along with its FT (right column). The results after applying the (b) low-pass, (c) high-pass and (d) band-pass filters from Fig. 4.3. The filters are shown as dotted lines in the right column (multiplied by 30 for display)...... 60 4.5 (a) An example of a signal that contains a low-frequency component for the first half of the signal, a mid-range frequency for the second half and a high-frequency burst that occurs from 0.12 to 0.27 s (bot- tom) and its ST (top). (b) The result after applying a multiplicative box filter to the ST (with value zero in the area indicated by the white box and one elsewhere) and inverting...... 62 4.6 Example of a 4 × 4 image with four grey-level values ranging from 0 to 3 (left). The structure of the grey-level spatial dependence matrix (right). Figure adapted from [4]...... 64 4.7 (a) Three levels (n = 3) of a wavelet decomposition of an image. Average coefficients are denoted by “A” and detail coefficients: H: horizontal, V: vertical, D: diagonal. (b) For rotationally-invariant coefficients, V & H coefficients are averaged to obtain Wn for each level and the average coefficients, while D coefficients (shaded grey in (a)) are omitted. (c) The average of each Wn is calculated to obtain the rotationally-invariant coefficients...... 68 4.8 The 2D feature space of two features denoted f1 and f2. (a) An ex- ample of a linear discriminant function that separates the two classes and (b) a quadratic discriminant function. (c) In k-NN classification an unknown sample (denoted by the circle with question mark) is assigned to the majority class of its k nearest neighbors. If k = 2 the object is assigned to the class represented by the triangles; if k =5 it is assigned to the class represented by squares...... 70 4.9 Example of a contingency table showing how sensitivity, specificity, PPV and NPV are calculated...... 73 4.10 A ROC plot showing the performance of various classifiers. The best possible classifier would be in the upper left corner. The dashed line indicates performance of a random guess (the line of no discrimina- tion). The predictions of classifiers worse than a random guess can be inverted to improve their performance (dashed black arrow)...... 75
xv 5.1 A flow diagram illustrating the process of calculating the pST on an ROI (shown as a box on the image)...... 80 5.2 A flow diagram illustrating the process of calculating the cST...... 84 5.3 The 3D cST (x, y, k) can be viewed in two ways: as “frequency maps” that show the power of a particular k at all spatial locations, cST (∗, ∗,k) or by taking projections at a particular (x, y) location, cST (x, y, ∗). . . 85 5.4 Average local spectra for a 5 × 5 region in (a) a MS lesion and (b) NAWM. The shaded area indicates the region where LFE is calculated (0.73 to 3.2 cm−1). By allowing the entire image to be transformed, the cST is able to improve the resolution of the spectra by a factor of 8 over the 64 × 64 pST analysis and by a factor of 16 over the 32 × 32 analysis. The improved resolution appears to resolve and emphasize peaks not obvious in an ROI analysis...... 89 5.5 The average LFE values from the pST and cST, averaged over a 5x5 region in an MS lesion and NAWM. The numbers in brackets indicate the size of the ROI analyzed...... 90 5.6 The spectrum of a point in an MR image computed using the cST at the full image FOV as well as with the pST at varying FOV sizes (128 × 128, 64 × 64 and 32 × 32)...... 90 5.7 Effect of ROI size on LFE using the pST for one patient. The ∗ and † indicate values that are not significantly different (p>0.05). Error bars are standard deviation within a 5×5 pixel region...... 91 5.8 Examples of spectra obtained from the pST (gray) and cST (black) for FOVs: (a) 32 × 32, (b) 64 × 64 and (c) 128 × 128...... 93 5.9 The pST takes longer to calculate (time ∝ N 3.8) than the cST (time ∝ N 2.6−2.7) for various N × N image sizes...... 94
6.1 (a) The maximum frequency measured when applying circularly sym- metric windows (grey) up to the Nyquist frequency (nmax = N/2) is kmax =1/(2Δx) or 0.50 if Δx = 1. The resulting spectra will contain N/2 points. While the corner frequencies are not directly sampled, their contribution is included in lower frequencies due to window overlap. (b)√ The maximum frequency√ of a localizing window (grey) if nmax = N/ 2iskmax =1/( 2Δx) or 0.71 if Δx =1...... 98 6.2 (a) A T2-weighted MR image with a FOV = 22 cm and matrix size = 256 × 256. The same image is shown cropped to FOV= 11 cm, matrix size = 128 × 128 (b) and FOV= 5.5 cm, matrix size = 64 × 64.104
xvi 6.3 (a) An example of a Brodatz texture image used to perform the non- uniformity experiment. The effect of (b) low, (c) medium and (d) high levels of non-uniformity are modeled by multiplying the image in (a) by a Gaussian of standard deviation 3N/2,N and N/2 pixels, respectively...... 107 6.4 (a) A 1D signal consisting of a low-frequency component (ξ=0.04) for the first half, a mid-range frequency (ξ=0.16) for the second half and a high-frequency burst (ξ=0.38) at t=20. (b) The magnitude of the ST, S(τ,ξ). (c) and (d) show the real portion and phase maps of S(τ,ξ), respectively. (e) and (f) show the real and phase portions, respectively, of S (τ,ξ)=S(τ,ξ)p(τ,ξ) where p(τ,ξ)=e2πiτξ is a phase rotation that rotates the “ripples” from the vertical to the horizontal orientation...... 110 6.5 (a) The magnitude of the ST (shown in Fig. 6.4) after averaging the frequency components for each point to a dyadic sampling grid, without applying the phase rotation. Destructive phase interference causes the relevant information to disappear; only the edge effect is present in the frequency-averaged domain. (b) The magnitude of the frequency-averaged domain after applying the phase ramp; the relevant information is retained...... 111 6.6 (a) The circularly symmetric Gaussian window provides a more even spectral weighting than (b) the corresponding pST window, inte- grated over all values of θ, (128×128), F =20. (c) The circularly symmetric window in the spatial domain is a sinc-like function...... 113 6.7 (a) A profile of the cST window and Gabor filter in the spectral domain for F =20,N = 128. Note that the cST window tapers to zero more quickly than the Gabor windows. These windows have been scaled to the same maximum amplitude for comparison. (b) The cST window is slightly broader than the Gabor filter in the spatial domain, meaning that the ringing will be spread over a slightly larger area. The effect of the ringing can be reduced by smoothing the amplitude frequency maps...... 114 6.8 (a) Test pattern (256×256) with four frequencies of wavenumber (hor- izontal, vertical): (0,4), (8,10), (60,0) and (20,20), starting in the up- per left, clockwise. (b) The resulting cST spectra (power, magnitude squared) from the centre of each region (UL: upper left, UR: upper right, LL: lower left, LR: lower right)...... 115
xvii 6.9 The squared magnitude (power) of frequency maps before (top) and after smoothing (middle) with a Gaussian kernel (σ =1/k) along with the corresponding pST power maps (bottom) for F = 4, 13, 28 and60...... 116 6.10 (a) A synthetic image consisting of two sinusoids added together: (kx,ky) = (0.16, 0) and (0, 0.31) where N=128. The spectra from the centre of the image are calculated using the cST, smoothed with σ =1.5/k (solid black); the invariant Gabor filter evaluated at F = √ √ 2 N/2,N/2/ 2.5,N/2/ 2.5 ,...; and the invariant wavelet method at 7 levels of decomposition as in [5]. Note that the amplitude of the wavelet coefficients have been divided by four for display purposes.
The Gabor and wavelet spectra contain only 7 points (log2 N) while thecSThas64points...... 117 6.11 Frequency maps for the delta function at F = 6 (top) and F =12 (bottom). The left column shows the unsmoothed data. The next two columns show the results after smoothing with a Gaussian of standard deviation σ = s/k, where s = 1 (middle) and s =1.5 (right). The rightmost column shows the Gabor-filtered response...... 118 6.12 (a) The local spatial frequency spectrum of a delta function using the cST with various levels of smoothing (σ = s/k). (b) The square root of the spectra shows that the amplitude increases with k2 for the unsmoothed case. The smoothed spectra deviate from this rela- tionship. The shaded areas represent the “corner” frequencies above the Nyquist frequency that may be unreliable in analysis due to the drop-off in spectral power...... 120 6.13 An image from the Brodatz texture library [6] and the resulting fre- quency maps for wavenumbers: F = 5, 20, 40 and 80 for the cST with no smoothing (left), smoothing with σ =1.5 (middle) and the magnitude Gabor filter response (right)...... 121 6.14 A cross-section of the F = 12 frequency map for the delta response. While smoothing with s = 1 leaves some residual artifact, a value of s =1.5 results in a smooth spatial distribution and removes the ringing caused by the localizing window...... 122 6.15 The local spectra at the centre of a T2-weighted MR image at three different scales: 256 × 256 (original) and cropped to 128 × 128 and 64 × 64. The differences at low frequency (below 1 cycle/cm) arise due to anatomy cropped out of the field-of-view (FOV) as well as due to truncation artifact...... 124
xviii 6.16 The L1 norm between the spectrum of a pixel of a 256 × 256 T2- weighted MR image and the corresponding spectrum after shifting the image by -6 to +6 cm in steps of 2 cm...... 125 6.17 The effect of image non-uniformity on cST spectra at a location 20 pixels away from the center of the image horizontally and vertically (radial distance = 28 pixels). The non-uniformity field is modelled as a multiplicative Gaussian function of standard deviation 3N/2, N, N/2 to represent low, medium and high levels of non-uniformity, respectively...... 127 6.18 (a) A synthetic signal containing a sinusoid of increasing frequency: f(t) = sin(2π(t/6)t/N) where t = 0 to 255 and N = 256. (b) A contour plot of the magnitude ST for the positive frequencies. (c) In- version of the ST along the frequency axis by directly implementing (2.43). (d) The difference between the reconstructed and original sig- nals shows that a large low-frequency error is apparent in the residual as well as high-frequency artifacts. (e) The reconstructed signal after applying AN according to (2.44) and (2.45) and using the traditional time-axis inversion according to (2.42) are shown as one line since they are indistinguishable. (f) The residuals show that frequency- axis inversion after correction (black) has maximum errors on the order of 10−14, which is comparable to that of time-axis inversion (gray) with maximum errors on the order of 10−15...... 128 6.19 Difference between original image and reconstructed using equation (6.7)...... 129 6.20 Images generated by applying a box-shaped filter G(x, y, F ) to the cST of the synthetic image in Fig. 6.8(a) and inverting. G(x, y, F ) consists of ones from (a)F=10to40and(b)F=1to10and40to 120, and zero elsewhere...... 130
7.1 The results of the NAWM segmentation, shown in red, overlaid on the month-1 PD-weighted image A. The lesion is shown in blue and the peripheral region, denoted as the lesion border, in black. The dashed yellow boxes show the 32× 32 ROIs used to get a representative sample of NAWM for the pST calculations...... 142 7.2 The average local spectra within (a) an enhancing lesion, (b) the lesion border, and (c) NAWM (month 1 of image A), using the tra- ditional pST technique with a 32 × 32 ROI (squares) and our new method with a 428 × 428 pixel ROI (circles). The shaded region shows the LFE range of 0.8 cm−1 ≤ k ≤ 3.6cm−1...... 145
xix 7.3 The magnitude spectra for images A (a,b), B (c,d) and C (e,f) using the traditional pST method (left column) and our new method (right column). The axes are zoomed in to show the low frequency range. The errorbars represent the standard error of the mean. Note the im- provement in spectral resolution using our method, and the improved distinction between lesion and border regions...... 147 7.4 (a) The T2-weighted image of patient #1 at month 1 when lesion A in the left hemisphere enhances. The red square shows the loca- tion of the 32 × 32 pixel ROI used for closer examination. (b) The time-evolution of T2 signal intensity within the ROI surrounding the enhancing lesion. (c) LFE maps generated by applying our new trans- form to the T2-weighted images, filtering with the band-pass filter G(x, y, k) and inverting...... 148 7.5 The average LFE (left column) and T2 (right column) intensity changes for lesions A (a,b), B (c,d) and C (e,f) that enhance in months 1, -2, -2, respectively, the lesion borders and NAWM. LFE signal is calcu- lated by filtering our new transform to isolate only signal components within the LFE bandwidth. Error bars represent standard error of the mean...... 149 7.6 An example of the raw cST of image A at month 1 (k =3cm−1). The × indicates the location of the lesion. Areas in the brain that contain edges and other structures at the corresponding spatial scale appear bright...... 151 7.7 Image A at month 1, filtered to only retain the frequencies in the LFE bandwidth. Note that while areas corresponding to lesion activity are highlighted using this technique (white arrow), borders between brain tissue and CSF are also highlighted. This is likely due to the shape of the window and filter used, but may help to separate demylinating lesions from CSF and transient damage...... 154
8.1 Spectral partitioning of the 1D-DOST for seven orders (N = 128). The scaled and shifted rect functions are the FTs of the sinc basis functions defined in (2.52)...... 161 8.2 (a) A modulated sinusoid time-series from [3]. (b) The DOST calcu- lated using the basis functions in (2.52) and (c) by partitioning the frequency domain, according to (8.4). (d) The full, redundant ST from (2.36)...... 164
xx 8.3 (a) A signal from a functional MR imaging experiment, sampled at 3.33Hz. The plot shows the change in signal corresponding to five repetitions of a task 12s in duration (subject moving their fingers), followed by a 24s rest period. (b) The ST calculated according to (2.36). (c) The DOST of the signal...... 165 8.4 Partitioning of (a) the DWT and (b) the DOST for 6 orders. The squares indicate the sub-images for each order. Both transforms use a dyadic sampling scheme but provide different information about the frequency content of the image. The DWT gives horizontal, vertical and diagonal “detail” coefficients for each order, while the DOST pro- vides information about the voice wavenumbers (νx,νy) that contain a bandwidth of 2px−1 × 2py−1 frequencies...... 168 8.5 (a) A mosaic of four texture images: straw, wood, sand and grass. (b) The 2D-DOST of the mosaic. (c) The DWT of the image using the db4 wavelet at five levels of decomposition. Note that the square root of the amplitude of both the DOST and DWT coefficients is shown for display purposes...... 171 8.6 The local DOST spectra of: (a) straw, (b) wood, (c) sand and (d) grass obtained from the center pixel of each region in Fig. 8.5a...... 175 8.7 (a) The wavelet decomposition for N = 8 at three levels of decom- position. Invariant wavelet coefficients are calculated by averaging horizontal and vertical coefficients at each level (marked with the same letter: A=level 0, B=level 1, etc.). The diagonal channels are excluded from the feature extraction. (b) The local frequency domain generated from the 2D-DOST for N = 8. Features marked with the same letter are averaged together to get an invariant DOST spectrum; a similar approach is taken of excluding the diagonal elements, where px = py. For a given N, the DOST approach provides more texture features (A to J = 10) than the wavelet approach (A to D = 4). . . . . 177 8.8 The nine Brodatz texture images used for texture analysis experi- ments...... 181 8.9 (a) The 2D-DOST of a delta function and (b) its local domain at the central point of the image (128,128)...... 183 8.10 (a) The calculated increase in memory required to store the ST and the DOST of a real N × N image. The ST requires N 4 floating point values while the DOST requires only N 2 (double for complex images). (b) The measured time to compute the ST and DOST. We were unable to compute the ST of images larger than 128 × 128 since the memory requirements became too large...... 185
xxi 8.11 (a) The SNR (dB) of the main frequency peak of the local DOST do- main at the centre of a 256 × 256 image containing a single horizontally- oriented frequency component. (b) The SNR of the main peak when examining an image with a horizontally-and vertically-oriented single frequency component. Note that the SNR of these peaks drops more quickly with increasing noise and increasing frequency than when only one component is present...... 186
9.1 Examples of T2 (a, b), FLAIR (c, d) and T1-post contrast (e,f) MR images of an unmethylated (left column) and a methylated (right column) GBM...... 197 9.2 Examples of the texture features assessed visually on T1-post contrast images. (a) sharp and (b) undefined tumor borders; (c) ring and (d) nodular enhancement; (e) presence of a cyst (white arrow)...... 199 9.3 Examples of T2-weighted images assessed visually that were judged to have (a) homogeneous and (b) heterogeneous signal...... 200 9.4 (a) Mean log-transformed cST spectra from the ROIs of T2-weighted images for unmethylated (n = 28) and methylated (n = 31) tumors. Error bars are standard error of the mean. (b) The mean group dif- ferences and 95% CIs indicate that the spectral power of features highlighted with an asterisk are significantly higher in the unmethy- lated group than the methylated group...... 201 9.5 (a) Mean log-transformed invariant pST spectra from the ROIs of T2- weighted images. Error bars are the standard error of the mean. (b) The mean differences between methylated and unmethylated groups. Dashed lines indicate the 95% CI. The asterisks indicate where the 95% CI does not include zero...... 202 9.6 (a) Mean log-transformed invariant DOST features from the ROIs of T2-weighted images. Error bars are the standard error of the mean. (b) The mean differences between methylated and unmethy- lated groups. Error bars are the 95% CI. Asterisks indicate where the 95% CI does not include zero...... 203 9.7 (a) Mean log-transformed invariant DOST features from the ROIs of FLAIR-weighted images. Error bars are the standard error of the mean. (b) The mean differences between methylated and unmethy- lated groups. Error bars are the 95% CI. The asterisk indicates where the 95% CI does not include zero...... 204
xxii 9.8 ROC analysis of various combinations of texture features using linear (black) and quadratic (grey) discriminant analysis. Filled symbols represent results when adding the ring enhancement variable. Clas- sification using the DOST features along with the ring enhancement variable produced the best sensitivity and specificity of all the various methods tested...... 206
10.1 The gradient image G(x, y) corresponding to the image shown in Fig. 10.3...... 214 10.2 Examples of sixth order Butterworth filters for values of k0 =0.3to 0.6...... 215 10.3 (a) An example of a T2-weighted MR image. The results after ap- plying the cST filter are shown using (b) C =0.003, (c) C =0.005, and (d) C =0.01...... 216 10.4 A 512× 512 MR image of a relapsing-remitting MS patient, along with two areas examined for texture: NAWM (dotted white) and a lesion (dotted black)...... 219 10.5 The local frequency maps derived from the DOST of the central 5× 5 regions of the (a) NAWM (solid white box from Fig. 10.4) and (b) the lesion (solid black box from Fig. 10.4). The local frequency maps indicate the magnitude of each frequency order (px,py) that contains the total contribution of spatial frequencies from 2px−1 to 2px (horizontal) and 2py−1 to 2py (vertical). Note that the negative frequencies are computed through Hermitian symmetry of the space- frequency domain. The lesion contains more low-frequency power (low orders px,py) than the NAWM, consistent with the results of [7]. 220 10.6 Cumulative survival function. The plot shows the percentage of pa- tients that survive with time after diagnosis. Those patients with high values of the T1 spectrum have a 70% lower risk of death than those with low values (P = 0.002)...... 224 10.7 Cumulative hazard function (the negative log of survival). The plot shows how the cumulative risk of death increases with time after diagnosis. Patients with low values of the T1 cST spectrum have a 3 times higher risk of death than those with low values (P = 0.002). . . . 225
xxiii List of Abbreviations
1D one-dimensional
2D two-dimensional
3D three-dimensional
4D four-dimensional
CI confidence interval
CSF cerebrospinal fluid cST circular S-transform
CT computed tomography
CWT continuous wavelet transform dB decibels db4 Daubechies 4-tap mother wavelet
DFT discrete Fourier transform
DNA deoxyribonucleic acid
DOST discrete orthonormal S-transform
DWT discrete wavelet transform
FFT fast Fourier transform
FLAIR fluid-attenuated inversion recovery fMRI functional magnetic resonance imaging
FOV field-of-view
FT Fourier transform
GBM glioblastoma
GT Gabor transform
xxiv LFE low frequency energy MGMT O6-methylguanine-DNA methyltransferase MR magnetic resonance MS multiple sclerosis MS-PCR methylation-specific polymerase chain reaction N3 nonparametric intensity nonuniformity normalization NAWM normal appearing white matter NMR nuclear magnetic resonance NPV negative predictive value PD proton density PPV positive predictive value pST polar S-transform RF radiofrequency RMS root mean squared ROC receiver operating characteristic ROI region of interest SNR signal to noise ratio ST S-transform STFT short time Fourier transform
T1 T1 relaxation time
T2 T2 relaxation time TE echo time TI inversion time TMZ temozolomide TR repetition time
xxv 1
Chapter 1
Introduction
1.1 Thesis Objectives
My PhD studies were focused on expanding current knowledge of space-frequency analysis of medical images. In particular, my work has focused on improving image analysis methods based on the S-transform (ST). ST-based techniques are desirable for medical imaging because: (a) they are closely related to the Fourier transform
(FT) (the basis of MR and computed tomography (CT) image reconstruction); and,
(b) they are invertible — that is, one can move between the image domain and the
ST domain without loss of information. This property allows sophisticated filtering and analysis of images.
This thesis describes several novel contributions resulting from my research.
These include: (1) new ways of reducing the amount of computational resources required to compute and store the ST; (2) new ways of extracting and using infor- mation from the ST for applications in medical image processing (such as filtering and texture analysis); and (3) the application of these ST-based techniques to help solve clinical problems, such as monitoring lesions in MS and quantifying texture differences in sub-types of brain cancer. Along the way, I tried to take advantage of novel aspects of the techniques I developed and show how these features can be used to perform new types of analysis, such as space-frequency image filtering. Finally,
I promoted the use of the ST in medical imaging to scientists and clinicians and 2 demonstrated that it is a robust tool for image and signal analysis.
1.2 Thesis Structure
The first three chapters of the thesis contain background information related to my PhD research. The first chapter introduces the objectives and structure of the thesis and outlines the scientific contributions described within. The second chapter provides a mathematical overview of transforms related to the ST, describes the motivation for choosing this project and details the significance and contribution of the work. The third chapter outlines the basic theory of MR imaging necessary to understand the application of my work to clinical problems. The fourth chapter describes how space-frequency analysis can be applied to medical imaging in terms of image filtering, texture analysis and classification. In the introductory chapters, limitations of current methods are discussed and further motivation for pursuing this research project are presented.
Chapters five through nine each describe a novel scientific contribution. Each of these chapters has been published, or submitted for publication, in a peer-reviewed journal. The fifth chapter examines why the pST requires such lengthy com- putation times and what effect different image ROI sizes has on ST-based texture measures. In response to these concerns, I present the introductory theory of a novel transform, known as the cST. I present a study examining the relationship between the new transform and the pST and illustrate some of the advantages of the new approach, including improved spectral resolution. In the sixth chapter,I describe the characteristics of the cST in more detail and outline some of its unique 3 properties, including a novel application of a newly proposed inversion method for the ST. Experiments are presented that attempt to characterize the performance of the cST in simulated and real-world conditions. The seventh chapter presents an application of the cST to images of patients with MS. This work illustrates a novel application of space-frequency analysis that is not possible with the pST — applying filters to the spectra to isolate spectral components thought to be involved in MS lesion progression.
The eighth chapter presents another novel method for medical image analysis known as the DOST. Motivated by new work published by Dr. Stockwell in 2007, I change direction in this chapter and adapt this discrete orthonormal transform for image texture analysis. The eighth chapter outlines the frequency-domain DOST algorithm, presents the 2D-DOST for the first time and outlines potential texture descriptors that can be derived from the DOST domain. I conduct experiments to characterize the performance of the DOST and to evaluate its response to noise. I also compare the classification power of features derived from the DOST features to those from a leading wavelet-based technique on a library of texture images.
The ninth chapter presents a clinical study examining the texture of a brain tumor known as GBM. Texture differences are sought that correlate with genetic alterations that predict tumor response to chemotherapy. I use all of the ST-based techniques presented in this thesis (the cST, pST and DOST) to perform texture analysis of GBM tumors. I perform blind classification of the tumors with regard to their genetic status using an analysis of entire image slices as well as small ROIs located within the tumors. Texture characteristics are also assessed visually by an expert. It is shown that the DOST of image ROIs, in combination with visually 4 assessed features, produces texture features that most accurately classify GBM pa- tients. It is also shown that texture features correlate with survival — an interesting and novel finding.
The tenth chapter concludes the thesis by summarizing the major research
findings and providing suggestions for future research directions.
1.3 Significance and Contribution
The work presented in this thesis has been published and presented in various forms.
The introduction to each chapter describes the publications, as well as any awards or patents earned based on the work. In total, the work presented in this thesis has culminated in 3 manuscripts published or submitted for publication in peer- reviewed scientific journals and one paper in a peer-reviewed conference proceeding;
4 abstracts published in peer-reviewed conference proceedings and one that has been accepted for presentation; 4 poster and 3 podium presentations; and numerous other presentations at seminars, rounds, invited talks, and journal clubs. In addition, the intellectual property based on the two novel transforms described in this thesis has been transferred to Calgary Scientific, Inc. and forms the foundation of two provisional patents. I have been awarded 5 investigator awards and travel stipends to present this work at Canadian and international conferences and have won awards for best poster and best podium presentations at local and international conferences.
The research presented in this thesis is a collaborative effort from a team of researchers at the Foothills Hospital, University of Calgary with contributions from others as well. I have been the lead researcher and the first author of all publica- 5 tions presented in this thesis. Dr. Ross Mitchell has provided support and guidance throughout the research project and is a co-author on all of the publications pre- sented in this thesis. Individual contributions to each project are outlined at the beginning of each chapter.
Please note that while I have tried to keep the notation in this thesis as consistent as possible, because I am incorporating sources from different disciplines that use differing notation, at times I have taken liberties with defining variables. For exam- ple, I refer to some continuous and discrete variables with the same symbol. While such definitions may not always be strictly correct in a mathematical sense, I have attempted to convey the information in this thesis as clearly and simply as possible.
My hope is to keep the information accessible to readers of varying backgrounds. 6
Chapter 2
Introduction to Space-Frequency Analysis
This chapter provides an introduction to Fourier theory and space-frequency anal- ysis. The most common approaches to multiscale analysis, based on windowed
Fourier transforms, wavelets and the S-transform, are introduced. Limitations of each method are discussed and the motivations for new techniques are introduced.
The curvelet transform is briefly mentioned in this chapter for completeness; how- ever, analysis using the curvelet transform is not pursued in this thesis. The com- parison of the ST to curvelets is left as future work.
2.1 Fourier Transform
The FT is a mathematical procedure that expresses a signal (or image) in terms of sines and cosines. Joseph Fourier first described the concept of representing a signal as the superposition of waves (harmonic analysis) in a decomposition known as a
Fourier series, for the purposes of solving the heat equation [8]. Fourier analysis is now used in a wide variety of areas including mathematics, science, engineering and medicine, for applications in signal processing, communications, astronomy, geology and optics, as well as being integral to the development of modern medical image reconstruction in modalities such as MR [9] and CT.
The FT of a signal f(t) is defined as: