Multiscale Total Variation and Multiscale Anisotropic Diffusion Algorithms for Image Denoising

MULTISCALE TOTAL VARIATION AND MULTISCALE ANISOTROPIC DIFFUSION ALGORITHMS FOR IMAGE DENOISING TONY CHAN, YANG WANG, AND HAOMIN ZHOU Abstract. As digital photography rapidly replacing the traditional film photography as the photography of choice for all but a few devoted professionals, post processing to enhance images such as denoising becomes increasingly an integral part of digital photography. In this paper we propose the multiscale total variation (MTV) and the multiscale anisotropic diffusion (MAD) algorithms for denoising. Both methods offers more flexibility than the classical TV method and the related anisotropic diffusion method. We shall dis- cuss the algorithms as well as their implementation in details. An advantage of the MTV and the MAD methods is that an automatic stopping criterion can easilt be implemented to prevent over-processing of an image. We also raise several mathematical questions. 1. Introduction The advent of digital imaging technologies such as digital cameras, video recorders and medical imaging devices such as MRI has changed our ways of life. Today technologies continue to evolve. The digial cameras and medical imgaing devices such as MRI continue to improve in terms of resolution and performance. On the other hand, as technologies continue to improve, consumers and patients come to expect ever more in terms of image quality. Ironically what we expect in technologies can often be contradictory. For example, as we demand more resolution in images more pixels must be packed into a sensor of a given size, which will lead to more noise in captured images. Thus to meet the expectations image denoising becomes an important challenge in digital photography and medical imaging. Noise in images can present problems depending on the applications. For medical images noise can mask and blur important but subtle features in the images; for photography noise is a nuisance in terms of visual effect, and can ruin an otherwise memorable shot. Denoising Key words and phrases. Denoising, total variation, wavelet TV, anisotropic diffusion, multiscale total variation, multiscale anisotropic diffusion, automatic stopping criterion. This paper was completed while the first authors is an IPA at the National Science Foundation. The views expressed in this paper are those of the authors, and do not necessarily reflect those of the NSF. The second author is supported in part by the National Science Foundation grant DMS-0813750. The third author is supported in part by the National Science Foundation grant DMS-0410062. 1 2 TONY CHAN, YANG WANG, AND HAOMIN ZHOU of an image, however, poses challenges of its own. Most cameras have built-in denoising algorithms that often perform very poorly with soft edges and smeared out details. Post processing denoising can be more effective, but similar problems remain. Denoising methods for monochramatic images can essentially be classified into four cat- egories: neighborhood filters, frequency domain methods, variational PDE based methods and non-local methods. The most commonly used neighborhood filters include blurring filters, median filter or variations of it, and others such as the filters described in [32, 33]. Neighborhood filters are easy to implement and fast, and in some applications they can be effective, although in general their effectiveness is limited. In frequency domain methods a wavelet, DCT or other type of transformations is first performed. A filter is then ap- plied to the transformation. The most common frequency domain method is the wavelet thresholding. The wavelet thresholding method assumes that noise appear in the wavelet transformation as small nonzero coefficients in the high frequency range and sets them to zero. The wavelet thresholding method is very effective in removing noise, and very fast. It is still perhaps the best \quick and dirty" denoising scheme. However, it suffers from Gibbs oscillations at discintinuities. These oscillations can be reduced, although not eliminated, by using soft wavelet thresholing [16, 17] and translational invariant wavelet thresholding [13]. Other generalizations use curvelets [3] and framelets [7]. In particular the method of iterating soft thresholded images using framelet representations in [7] is effective in removing the Gibbls oscillations. Methods that are not PDE based and do not fit the description of the other two, including some statistical methods, can be classified as non-local denoising methods, see e.g. [2, 31, 20, 25]. These methods are typically slower, and some of them assume certain statistical properties are known. Given the right images, non-local methods can yield excellent results. A particular kind of non-local method, developed in [2], works well for images with repeat patterns or large homogeneous areas, see also [20, ?, ?, ?]. But in our tests on general images the success is more limited. In this paper we propose the multiscale total variation (MTV) algorithm for image denoising, which has its root in the classical total variation (TV) scheme pioneered by Rudin, Osher and Fatemi [26]. Before discussing in details of the MTV algorithm we first describe denoising based on variational principles and PDE techniques. We start with a standard MTV AND MAD ALGORITHMS FOR IMAGE DENOISING 3 noisy monochromatic image model (1.1) z(x) = u0(x) + n(x); 2 where z(x), u0(x) and n(x) are real valued functions defined on Ω ⊂ R , where Ω is a finite domain such as a rectangle. The function u0(x) denotes the underlying noise-free image, z(x) the observed image, and n(x) the noise. In our general model, we assume that 2 1 z(x), u0(x) and n(x) are in some space of functions F, such as L (Ω) or C (Ω). With a variational PDE based denoising method, the denoised image is the minimizer of certain energy functional E(u). Typically E(u) can be written as (1.2) E(u) = D(u; z) + R(u); where D(u; z) denotes the \distance" between u and observed image z, and R(u) is a regularization term that smoothes out the image. The idea of using variational method described here has been around for some time. In most cases the distance D(u; z) is taken to be the 2 R 2 L distance D(u; z) = Ω(u − z) . Earlier efforts focused on least square based functionals 2 2 R(u)'s such as k∆uk2, kruk2 and others. While noise can be effectively removed, these regularization functionals penalize discontinuity, resulting in soft and smooth reconstructed images, with subtle details lost. This is not acceptable in digital photography, as photogra- phers often place premium emphasis on sharpness. The innovation of the total variational (TV) scheme by Rudin, Osher and Fatemi [26] is to set R(u) to be the total variation R Ω jruj of u. With the total variation regularizer, extensive studies have shown that it does not penalize edges in u, thus it allows for sharper reconstructions, see e.g. [1, 6, 9, 15]. Among all the variational PDE based techniques, the TV minimization scheme offers one of the better combinations of noise removal and feature preservation. It is easy to show that the TV minimization scheme leads to solving the PDE ru (1.3) r · − λ(u − z) = 0: jruj But in practice, one introduces the time variable t and solve for u(x; t) by time-marching the equation ru (1.4) u = r · − λ(u − z) = 0; u(x; 0) = z(x): t jruj The end result u(x; T ), if T is large enough, will have noise removed or reduced. In essence the time-marching by (1.4) is to use gradient flow to minimize the energy E(u). An important attribute of the TV minimization scheme is that it takes the geometric information 4 TONY CHAN, YANG WANG, AND HAOMIN ZHOU of the original images into account, in that it does not penalize edges. On the contrary, significant edges are sharpened. This is similar to the anisotropic diffusion methods, see [?]. See also [28] and the references therein. 2. Multiscale Total Variational Method for Image Denoising The TV minimization scheme has its own weaknesses. It is well known that when (2.4) is left running for too long a denoised image will tend to become a cartoon-like piecewise constant image, wiping out all subtle details [22, 24]. With a more gentle run, one may not remove enough noise. For optimal results it is important to have an automatic stopping criterion. This is difficult to do. Although there are some attempts in this direction [19, 23, 30], some properties of the noise (such as the variance) are assumed to be known, which is not entirely realistic for natural color photos. A modified version of the TV denoising scheme based on wavelets was introduced in Chan and Zhou [12]. Using this scheme, Wang and Zhou [29] devised an automatic stopping criterion for the time-marching process (2.4). This criterion works surprisingly well in experiments, see [29] for a detailed discussion. In this paper we propose a couple of new methods based on the wavelet TV denoising scheme. The cartoon-like tendency in the standard TV method is a result of excessive diffusion of low frequency features in an images. The new methods, which we call Multiscale Total Variation (MTV) and the Multiscale Anisotropic Diffusion (MAD) algorithms, will concentrate the diffusion on high frequency features where the noise resides. Our tests illustrate that the MTV and the MAD methods are highly effective. Comparing to the standard TV or wavelet TV method, the two methods require fewer number of iterations to obtain comparable or better results. Figure ?? shows some comparisons. To see how MTV and MAD algorithms work, let f j : j 2 Ig be an orthonormal or biorthonormal wavelet basis for L2(Ω) such as those found in [14, 27].

Load more