Multichannel Color Image Denoising Via Weighted Schatten P-Norm Minimization

Total Page:16

File Type:pdf, Size:1020Kb

Multichannel Color Image Denoising Via Weighted Schatten P-Norm Minimization Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence (IJCAI-20) Multichannel Color Image Denoising via Weighted Schatten p-norm Minimization Xinjian Huang , Bo Du∗ and Weiwei Liu∗ School of Computer Science, Institute of Artificial Intelligence and National Engineering Research Center for Multimedia Software, Wuhan University, Wuhan, China h [email protected], [email protected], [email protected] Abstract the-art denoising methods are mainly based on sparse repre- sentation [Dabov et al., 2007b], low-rank approximation [Gu The R, G and B channels of a color image gen- et al., 2017; Xie et al., 2016], dictionary learning [Zhang and erally have different noise statistical properties or Aeron, 2016; Marsousi et al., 2014; Mairal et al., 2012], non- noise strengths. It is thus problematic to apply local self-similarity [Buades et al., 2005; Dong et al., 2013a; grayscale image denoising algorithms to color im- Hu et al., 2019] and neural networks [Liu et al., 2018; age denoising. In this paper, based on the non-local Zhang et al., 2017a; Zhang et al., 2017b; Zhang et al., 2018]. self-similarity of an image and the different noise Current methods for RGB color image denoising can be strength across each channel, we propose a Multi- categorized into three classes. The first kind involves apply- Channel Weighted Schatten p-Norm Minimization ing grayscale image denoising algorithms to each channel in (MCWSNM) model for RGB color image denois- a channel-wise manner. However, these methods ignore the ing. More specifically, considering a small local correlations between R, G and B channels, meaning that un- RGB patch in a noisy image, we first find its nonlo- satisfactory results may be obtained. The second method is cal similar cubic patches in a search window with to transform the RGB color image into other color spaces [D- an appropriate size. These similar cubic patches are abov et al., 2007a]. This transform, however, may change the then vectorized and grouped to construct a noisy noise distribution of the original observation data and intro- low-rank matrix, which can be recovered using the duce artifacts. The third type of method involves making full Schatten p-norm minimization framework. More- use of the correlation information across each channel and over, a weight matrix is introduced to balance each conduct the denoising task on R, G and B channels simulta- channel’s contribution to the final denoising result- neously [Zhang et al., 2017a]. s. The proposed MCWSNM can be solved via the Similar to the case of grayscale images, noise from each alternating direction method of multipliers. Con- channel can be generally, regarded as additive white Gaussian vergence property of the proposed method are also noise. However, the noise levels of each channel are diverse theoretically analyzed . Experiments conducted on due to camera sensor characteristics and the imagery environ- both synthetic and real noisy color image dataset- ment, such as fog, haze, illumination intensity, etc. Moreover, s demonstrate highly competitive denoising perfor- the on-board processing steps in digital camera pipelines may mance, outperforming comparison algorithms, in- also introduce different noise strength assigned to different cluding several methods based on neural networks. channels [Nam et al., 2016]. This reveals the difficulties and challenges faced by RGB color image denoising. Intuitive- 1 Introduction ly, if we apply the grayscale image denoising algorithm to color images in a band-wise manner without considering the Noise corruption is inevitable during the image acquisition mutual information and noise difference between each chan- process and may heavily degrade the visual quality of an ac- nel, artifacts or false colors could be generated [Mairal et al., quired image. Image denoising is thus an essential prepro- 2008]. An example of this is presented in Fig.1. Here, we cessing step in various image processing and computer vi- apply a representative low-rank-based method, WSNM [Xie [ sion tasks Chatterjee and Milanfar, 2010; Ye et al., 2018; et al., 2016], and our proposed method, MCWSNM (see Sec- ] Wang et al., 2018b ; moreover, it is also an ideal test platform tion 2), to the Kodak PhotoCD Dataset1. It can be seen from for evaluating image prior models and optimization methods the figure that WSNM retains a large amount of noise and, [ ] Roth and Black, 2005 . As a result, image denoising re- to some extent, introduces some artifacts. Thus, the issue of mains a challenging yet fundamental problem. Early denois- how noise differences in each channel should be modeled is ing algorithms were mainly devised on the basis of filter and key to designing a good RGB color denoising algorithm. transformation [Dabov et al., 2007b], such as wavelet trans- One well-known color image denoising method is color form and curvelet transform [Starck et al., 2002]. State-of- ∗Co-corresponding Author. 1http://r0k.us/graphics/kodak/ 637 Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence (IJCAI-20) V 1 b V g 1 r … (a) Clean (b) Noisy (c) WSNM (d) MCWSNM V 2 … 3V V ଶ … Figure 1: Denoised results of ”kodim07” in the Kodak PhotoCD Dataset with σr = 50, σg = 5 and σb = 20. WSNM performs in a band-wise manner; MCWSNM jointly processes three channels V M < jointly and considers their noise differences simultaneously. (It is V M best to zoom in on these images on-screen). Figure 2: Illusion of the grouped local patches. block-matching 3D (CBM3D) [Dabov et al., 2007a]. This 2 Our Proposed Method is a transformation field method that may destroy the noise 2.1 Method Formulation distribution across each channel and introduce artifacts. The The image denoising task involves recovering a clean image method in [Zhu et al., 2016] concatenates similar patches of xc from its noisy observation data yc, where c = fr; g; bg RGB channels into a long vector. However, this concatena- is the index of the R, G and B channels. Here, we assume tion operation regards each channel equally and ignores the that the noisy image is corrupted by additive white Gaussian noise level differences across each channel. For a long time, noise nc with σc = (σr; σg; σb), where σr; σg; σb denote the low-rank theory has been widely used in many fields, such noise standard deviation in the R, G, B channels, respectively. [ et al. ] as recommendation system Wang , 2018a , community Mathematically, y = x + n . Low-rank-based denoising [ et al. et al. ] c c c search Fang , 2020b; Fang , 2020a , multi-out task methods utilize the low-rank property of the grouped non- [ et al. ] [ et al. ] Liu , 2019 and multi-label Liu , 2017 , etc. local similarity patches; this group procedure is illustrated in y Recently, based on the non-local self-similarity of an image Fig.2. Given a noisy RGB color image c, each local patch of size s × s × 3 is extracted and stretched into a vector and SVD, several low-rank-based denoising algorithms have 2 2 y = (yT ; yT ; yT )T 2 R3s y ; y ; y 2 Rs been proposed, such as weighted nuclear norm minimization r g b , where r g b are (WNNM) [Gu et al., 2017] and WSNM [Xie et al., 2016]. the corresponding patches in the R, G and B channels. For y M The nuclear norm and Schatten p-norm [Xie et al., 2016] are each vectorized local patch , we find its most similar y two surrogates of the rank function [Xu et al., 2017a]. In patches (including itself) by Euclidean distance in a proper M WNNM and WSNM, singular values are assigned different size search window around it. We then stack the similar weights using a weight vector. It has been proven that WNN- patches vector column by column to obtain the noisy patch R3s2×M M is a special case of WSNM and that WSNM outperform- matrix Y = X + N 2 , where X and N are the s WNNM. Multi-channel WNNM (MCWNNM) [Xu et al., corresponding clean and noise patch matrix, respectively. It 2017b] is an extension of WNNM from grayscale image to is clear that Y is a low-rank matrix and X can be recovered color image that operates by introducing a weight matrix to using a low-rank approximation algorithm. adjust each channel’s contribution to the final results based As a nonconvex surrogate of the rank function, the weight- Rm×n on noise level. ed Schatten p-norm of a matrix Z 2 is defined as minfm;ng 1 P p p kZkw;Sp = ( i=1 wiσi ) ; 0 < p < 1, where w = T In this paper, based on the low-rank property of the non- (w1; w2; : : : ; wminfm;ng) is a non-negative weight vector local self-similar patches and the different noise strength and σi is the i-th singular value of Z. The weighted nuclear across each channel, we propose a multi-channel weighted norm [Gu et al., 2017] is a special case of the weighted Schat- Schatten p-norm minimization (MCWSNM) model for RG- ten p-norm when the power p = 1. It has been proven that the B color image denoising. More specifically, considering a Schatten p-norm with 0 < p < 1 is a better approximation of small local RGB patch in a noisy image, we first find its non- rank function than the nuclear norm [Zhang et al., 2013]. local similar cubic patches in a search window with an appro- Intuitively, when denoising a noisy color image, if a chan- priate size. These similar cubic patches are then vectorized nel is heavily polluted, the contribution of this channel to the and concatenated to construct a noisy low-rank matrix; then final denoised results should be less, and vice versa.
Recommended publications
  • Image Processing Terms Used in This Section Working with Different Screen Bit Describes How to Determine the Screen Bit Depth of Your Depths (P
    13 Color This section describes the toolbox functions that help you work with color image data. Note that “color” includes shades of gray; therefore much of the discussion in this chapter applies to grayscale images as well as color images. Topics covered include Terminology (p. 13-2) Provides definitions of image processing terms used in this section Working with Different Screen Bit Describes how to determine the screen bit depth of your Depths (p. 13-3) system and provides recommendations if you can change the bit depth Reducing the Number of Colors in an Describes how to use imapprox and rgb2ind to reduce the Image (p. 13-6) number of colors in an image, including information about dithering Converting to Other Color Spaces Defines the concept of image color space and describes (p. 13-15) how to convert images between color spaces 13 Color Terminology An understanding of the following terms will help you to use this chapter. Terms Definitions Approximation The method by which the software chooses replacement colors in the event that direct matches cannot be found. The methods of approximation discussed in this chapter are colormap mapping, uniform quantization, and minimum variance quantization. Indexed image An image whose pixel values are direct indices into an RGB colormap. In MATLAB, an indexed image is represented by an array of class uint8, uint16, or double. The colormap is always an m-by-3 array of class double. We often use the variable name X to represent an indexed image in memory, and map to represent the colormap. Intensity image An image consisting of intensity (grayscale) values.
    [Show full text]
  • Real-Time Supervised Detection of Pink Areas in Dermoscopic Images of Melanoma: Importance of Color Shades, Texture and Location
    Missouri University of Science and Technology Scholars' Mine Electrical and Computer Engineering Faculty Research & Creative Works Electrical and Computer Engineering 01 Nov 2015 Real-Time Supervised Detection of Pink Areas in Dermoscopic Images of Melanoma: Importance of Color Shades, Texture and Location Ravneet Kaur P. P. Albano Justin G. Cole Jason R. Hagerty et. al. For a complete list of authors, see https://scholarsmine.mst.edu/ele_comeng_facwork/3045 Follow this and additional works at: https://scholarsmine.mst.edu/ele_comeng_facwork Part of the Chemistry Commons, and the Electrical and Computer Engineering Commons Recommended Citation R. Kaur et al., "Real-Time Supervised Detection of Pink Areas in Dermoscopic Images of Melanoma: Importance of Color Shades, Texture and Location," Skin Research and Technology, vol. 21, no. 4, pp. 466-473, John Wiley & Sons, Nov 2015. The definitive version is available at https://doi.org/10.1111/srt.12216 This Article - Journal is brought to you for free and open access by Scholars' Mine. It has been accepted for inclusion in Electrical and Computer Engineering Faculty Research & Creative Works by an authorized administrator of Scholars' Mine. This work is protected by U. S. Copyright Law. Unauthorized use including reproduction for redistribution requires the permission of the copyright holder. For more information, please contact [email protected]. Published in final edited form as: Skin Res Technol. 2015 November ; 21(4): 466–473. doi:10.1111/srt.12216. Real-time Supervised Detection of Pink Areas in Dermoscopic Images of Melanoma: Importance of Color Shades, Texture and Location Ravneet Kaur, MS, Department of Electrical and Computer Engineering, Southern Illinois University Edwardsville, Campus Box 1801, Edwardsville, IL 62026-1801, Telephone: 618-210-6223, [email protected] Peter P.
    [Show full text]
  • Accurately Reproducing Pantone Colors on Digital Presses
    Accurately Reproducing Pantone Colors on Digital Presses By Anne Howard Graphic Communication Department College of Liberal Arts California Polytechnic State University June 2012 Abstract Anne Howard Graphic Communication Department, June 2012 Advisor: Dr. Xiaoying Rong The purpose of this study was to find out how accurately digital presses reproduce Pantone spot colors. The Pantone Matching System is a printing industry standard for spot colors. Because digital printing is becoming more popular, this study was intended to help designers decide on whether they should print Pantone colors on digital presses and expect to see similar colors on paper as they do on a computer monitor. This study investigated how a Xerox DocuColor 2060, Ricoh Pro C900s, and a Konica Minolta bizhub Press C8000 with default settings could print 45 Pantone colors from the Uncoated Solid color book with only the use of cyan, magenta, yellow and black toner. After creating a profile with a GRACoL target sheet, the 45 colors were printed again, measured and compared to the original Pantone Swatch book. Results from this study showed that the profile helped correct the DocuColor color output, however, the Konica Minolta and Ricoh color outputs generally produced the same as they did without the profile. The Konica Minolta and Ricoh have much newer versions of the EFI Fiery RIPs than the DocuColor so they are more likely to interpret Pantone colors the same way as when a profile is used. If printers are using newer presses, they should expect to see consistent color output of Pantone colors with or without profiles when using default settings.
    [Show full text]
  • Predictability of Spot Color Overprints
    Predictability of Spot Color Overprints Robert Chung, Michael Riordan, and Sri Prakhya Rochester Institute of Technology School of Print Media 69 Lomb Memorial Drive, Rochester, NY 14623, USA emails: [email protected], [email protected], [email protected] Keywords spot color, overprint, color management, portability, predictability Abstract Pre-media software packages, e.g., Adobe Illustrator, do amazing things. They give designers endless choices of how line, area, color, and transparency can interact with one another while providing the display that simulates printed results. Most prepress practitioners are thrilled with pre-media software when working with process colors. This research encountered a color management gap in pre-media software’s ability to predict spot color overprint accurately between display and print. In order to understand the problem, this paper (1) describes the concepts of color portability and color predictability in the context of color management, (2) describes an experimental set-up whereby display and print are viewed under bright viewing surround, (3) conducts display-to-print comparison of process color patches, (4) conducts display-to-print comparison of spot color solids, and, finally, (5) conducts display-to-print comparison of spot color overprints. In doing so, this research points out why the display-to-print match works for process colors, and fails for spot color overprints. Like Genie out of the bottle, there is no turning back nor quick fix to reconcile the problem with predictability of spot color overprints in pre-media software for some time to come. 1. Introduction Color portability is a key concept in ICC color management.
    [Show full text]
  • Color Images, Color Spaces and Color Image Processing
    color images, color spaces and color image processing Ole-Johan Skrede 08.03.2017 INF2310 - Digital Image Processing Department of Informatics The Faculty of Mathematics and Natural Sciences University of Oslo After original slides by Fritz Albregtsen today’s lecture ∙ Color, color vision and color detection ∙ Color spaces and color models ∙ Transitions between color spaces ∙ Color image display ∙ Look up tables for colors ∙ Color image printing ∙ Pseudocolors and fake colors ∙ Color image processing ∙ Sections in Gonzales & Woods: ∙ 6.1 Color Funcdamentals ∙ 6.2 Color Models ∙ 6.3 Pseudocolor Image Processing ∙ 6.4 Basics of Full-Color Image Processing ∙ 6.5.5 Histogram Processing ∙ 6.6 Smoothing and Sharpening ∙ 6.7 Image Segmentation Based on Color 1 motivation ∙ We can differentiate between thousands of colors ∙ Colors make it easy to distinguish objects ∙ Visually ∙ And digitally ∙ We need to: ∙ Know what color space to use for different tasks ∙ Transit between color spaces ∙ Store color images rationally and compactly ∙ Know techniques for color image printing 2 the color of the light from the sun spectral exitance The light from the sun can be modeled with the spectral exitance of a black surface (the radiant exitance of a surface per unit wavelength) 2πhc2 1 M(λ) = { } : λ5 hc − exp λkT 1 where ∙ h ≈ 6:626 070 04 × 10−34 m2 kg s−1 is the Planck constant. ∙ c = 299 792 458 m s−1 is the speed of light. ∙ λ [m] is the radiation wavelength. ∙ k ≈ 1:380 648 52 × 10−23 m2 kg s−2 K−1 is the Boltzmann constant. T ∙ [K] is the surface temperature of the radiating Figure 1: Spectral exitance of a black body surface for different body.
    [Show full text]
  • Arxiv:1902.00267V1 [Cs.CV] 1 Feb 2019 Fcmue Iin Oto H Aaesue O Mg Classificat Image Th for in Used Applications Datasets Fundamental the Most of the Most Vision
    ColorNet: Investigating the importance of color spaces for image classification⋆ Shreyank N Gowda1 and Chun Yuan2 1 Computer Science Department, Tsinghua University, Beijing 10084, China [email protected] 2 Graduate School at Shenzhen, Tsinghua University, Shenzhen 518055, China [email protected] Abstract. Image classification is a fundamental application in computer vision. Recently, deeper networks and highly connected networks have shown state of the art performance for image classification tasks. Most datasets these days consist of a finite number of color images. These color images are taken as input in the form of RGB images and clas- sification is done without modifying them. We explore the importance of color spaces and show that color spaces (essentially transformations of original RGB images) can significantly affect classification accuracy. Further, we show that certain classes of images are better represented in particular color spaces and for a dataset with a highly varying number of classes such as CIFAR and Imagenet, using a model that considers multi- ple color spaces within the same model gives excellent levels of accuracy. Also, we show that such a model, where the input is preprocessed into multiple color spaces simultaneously, needs far fewer parameters to ob- tain high accuracy for classification. For example, our model with 1.75M parameters significantly outperforms DenseNet 100-12 that has 12M pa- rameters and gives results comparable to Densenet-BC-190-40 that has 25.6M parameters for classification of four competitive image classifica- tion datasets namely: CIFAR-10, CIFAR-100, SVHN and Imagenet. Our model essentially takes an RGB image as input, simultaneously converts the image into 7 different color spaces and uses these as inputs to individ- ual densenets.
    [Show full text]
  • 24-Bit Color Images  in a Color 24-Bit Image, Each Pixel Is Represented by Three Bytes, Usually Representing RGB
    A:Color look-up table B: Dithering A:Color look-up table 1-bit Images Each pixel is stored as a single bit (0 or 1), so also referred to as binary image. Such an image is also called a 1-bit monochrome image since it contains no color. Fig. 3.1 shows a 1-bit monochrome image (called \Lena" by multimedia scientists | this is a standard image used to illustrate many algorithms). 8-bit Gray-level Images Each pixel has a gray-value between 0 and 255. Each pixel is represented by a single byte; e.g., a dark pixel might have a value of 10, and a bright one might be 230. Bitmap: The two-dimensional array of pixel values that represents the graphics/image data. Image resolution refers to the number of pixels in a digital image (higher resolution always yields better quality). Fairly high resolution for such an image might be 1600 X 1200, whereas lower resolution might be 640 X 480. 24-bit Color Images In a color 24-bit image, each pixel is represented by three bytes, usually representing RGB. This format supports 256 X 256 X 256 possible combined colors, or a total of 16,777,216 possible colors. However such flexibility does result in a storage penalty: A 640X480 24-bit color image would require 921.6 KB of storage without any compression. An important point: many 24-bit color images are actually stored as 32-bit images, with the extra byte of data for each pixel used to store an alpha value representing special effect information (e.g., transparency).
    [Show full text]
  • Robust Analysis of Feature Spaces: Color Image Segmentation
    Robust Analysis of Feature Spaces: Color Image Segmentation Dorin Comaniciu Peter Meer Department of Electrical and Computer Engineering Rutgers University, Piscataway, NJ 08855, USA Keywords: robust pattern analysis, low-level vision, content-based indexing Abstract mo des can b e very large, of the order of tens. The highest density regions corresp ond to clusters A general technique for the recovery of signi cant centered on the mo des of the underlying probability image features is presented. The technique is basedon distribution. Traditional clustering techniques [6], can the mean shift algorithm, a simple nonparametric pro- b e used for feature space analysis but they are reliable cedure for estimating density gradients. Drawbacks of only if the number of clusters is small and known a the current methods (including robust clustering) are priori. Estimating the number of clusters from the avoided. Featurespace of any naturecan beprocessed, data is computationally exp ensive and not guaranteed and as an example, color image segmentation is dis- to pro duce satisfactory result. cussed. The segmentation is completely autonomous, Amuch to o often used assumption is that the indi- only its class is chosen by the user. Thus, the same vidual clusters ob ey multivariate normal distributions, program can produce a high quality edge image, or pro- i.e., the feature space can b e mo deled as a mixture of vide, by extracting al l the signi cant colors, a prepro- Gaussians. The parameters of the mixture are then cessor for content-based query systems. A 512 512 estimated by minimizing an error criterion. For exam- color image is analyzed in less than 10 seconds on a ple, a large class of thresholding algorithms are based standard workstation.
    [Show full text]
  • Developing a New Color Model for Image Analysis and Processing
    UDC 004.421 Developing a New Color Model for Image Analysis and Processing Rashad J. Rasras1 , Ibrahiem M. M. El Emary2, Dmitriy E. Skopin1 1 Faculty of Engineering Technology, Amman, Al Balqa Applied University Jordan rashad_ras@yahoo. 2 Faculty of Engineering, Al Ahliyya Amman University, Amman, Jordan com [email protected] Abstract. The theoretical outcomes and experimental results of new color model implemented in algorithms and software of image processing are presented in the paper. This model, as it will be shown below, may be used in modern real time video processing applications such as radar tracking and communication systems. The developed model allows carrying out the image process with the least time delays (i.e. it speeding up image processing). The proposed model can be used to solve the problem of true color object identification. Experimental results show that the time spent during RGI color model conversion may approximately four times less than the time spent during other similar models. 1. Introduction Digital image processing is a new and promptly developing field which finds more and more application in various information and technical systems such as: radar-tracking, communications, television, astronomy, etc. [1]. There are numerous methods of digital image processing techniques such as: histogram processing, local enhancement, smoothing and sharpening, color segmentation, a digital image filtration and edge detection. Initially, theses methods were designed specially for gray scale image processing [2, 3]. The RGB color model is standard design of computer graphics systems is not ideal for all of its applications. The red, green, and blue color components are highly correlated.
    [Show full text]
  • Interactive Color Image Segmentation Using HSV Color Space
    Science and Technology Journal, Vol. 7 Issue: 1 ISSN: 2321-3388 Interactive Color Image Segmentation using HSV Color Space D. Hema1 and Dr. S. Kannan2 1Assistant Professor, Department of Computer Science, Lady Doak College, Madurai 2Professor, Department of Computer Applications, School of Information Technology, Madurai Kamaraj University, Madurai E-mail: [email protected], [email protected] Abstract—The primary goal of this research work is to extract only the essential foreground fragments of a color image through segmentation. This technique serves as the foundation for implementing object detection algorithms. The color image can be segmented better in HSV color space model than other color models. An interactive GUI tool is developed in Python and implemented to extract only the foreground from an image by adjusting the values for H (Hue), S (Saturation) and V (Value). The input is an RGB image and the output will be a segmented color image. Keywords: Color Spaces, CMYK, HSV, Segmentation INTRODUCTION vision is another branch of research where Digital Image processing plays a vital role to learn, make inferences and perform actions based on visual inputs. Human eyes can perceive and understand the objects in an image. In order to design a machine that can understand like a human, it requires appropriate algorithm and massive training. An image is a visual representation of any real A color model is an abstract mathematical model in world object that can be manipulated computationally. which colors are represented as tuples of numbers either Image Processing is an extensive area of research that gains three or four values or color components.
    [Show full text]
  • Decolorize: Fast, Contrast Enhancing, Color to Grayscale Conversion
    Decolorize: fast, contrast enhancing, color to grayscale conversion Mark Grundland*, Neil A. Dodgson Computer Laboratory, University of Cambridge Abstract We present a new contrast enhancing color to grayscale conversion algorithm which works in real-time. It incorporates novel techniques for image sampling and dimensionality reduction, sampling color differences by Gaussian pairing and analyzing color differences by predominant component analysis. In addition to its speed and simplicity, the algorithm has the advantages of continuous mapping, global consistency, and grayscale preservation, as well as predictable luminance, saturation, and hue ordering properties. Keywords: Color image processing; Color to grayscale conversion; Contrast enhancement; Image sampling; Dimensionality reduction. 1 Introduction We present a new contrast enhancing color to grayscale conversion algorithm which works in real-time. Compared with standard color to grayscale conversion, our approach better preserves equiluminant chromatic contrasts in grayscale. Alternatively, compared with standard principal component color projection, our approach better preserves achromatic luminance contrasts in grayscale. Finally, compared with recent optimization methods [1,2] for contrast enhancing color to grayscale conversion, this approach proves much more computationally efficient at producing visually similar results with stronger mathematical guarantees on its performance. Many legacy pattern recognition systems and algorithms only operate on the luminance channel and entirely ignore chromatic information. By ignoring important chromatic features, such as edges between isoluminant regions, they risk reduced recognition rates. Although luminance predominates human visual processing, on its own it may not capture the perceived color contrasts. On the other hand, maximizing variation by principal component analysis can result in high contrasts, but it may not prove visually meaningful.
    [Show full text]
  • A Perception-Based Color Space for Illumination-Invariant Image Processing
    A Perception-based Color Space for Illumination-invariant Image Processing Hamilton Y. Chong∗ Steven J. Gortler† Todd Zickler‡ Harvard University Harvard University Harvard University (a) Source image 1. (b) Source image 2. (c) Foreground (red) and (d) Initial foreground and (e) Source 1 segmentation: background (blue) pixels. background mask. GammaRGB. (f) Source 2 segmentation: (g) Source 1 segmentation: (h) Source 2 segmentation: (i) Source 1 segmentation: (j) Source 2 segmentation: GammaRGB. Lab. Lab. Ours. Ours. Figure 1: Segmentations of the same flower scene under two different illuminants. For (nonlinear) RGB and Lab color spaces, segmentation parameters for the two images were tweaked individually (optimal parameters for the image pairs were separated by 2 and 5 orders of magnitude respectively); for our color space, the same parameters were used for segmenting both images. Abstract required by a large class of applications that includes segmenta- tion and Poisson image editing [Perez´ et al. 2003]. In this work, Motivated by perceptual principles, we derive a new color space in we present a new color space designed specifically to address this which the associated metric approximates perceived distances and deficiency. color displacements capture relationships that are robust to spectral changes in illumination. The resulting color space can be used with We propose the following two color space desiderata for image pro- existing image processing algorithms with little or no change to the cessing: methods. (1) difference vectors between color pixels are unchanged by re- illumination; CR Categories: I.3.m [Computer Graphics]: Misc.—Perception Keywords: perception, image processing, color space (2) the `2 norm of a difference vector matches the perceptual dis- tance between the two colors.
    [Show full text]