Robust Analysis of Feature Spaces: Color Image Segmentation

Total Page:16

File Type:pdf, Size:1020Kb

Robust Analysis of Feature Spaces: Color Image Segmentation Robust Analysis of Feature Spaces: Color Image Segmentation Dorin Comaniciu Peter Meer Department of Electrical and Computer Engineering Rutgers University, Piscataway, NJ 08855, USA Keywords: robust pattern analysis, low-level vision, content-based indexing Abstract mo des can b e very large, of the order of tens. The highest density regions corresp ond to clusters A general technique for the recovery of signi cant centered on the mo des of the underlying probability image features is presented. The technique is basedon distribution. Traditional clustering techniques [6], can the mean shift algorithm, a simple nonparametric pro- b e used for feature space analysis but they are reliable cedure for estimating density gradients. Drawbacks of only if the number of clusters is small and known a the current methods (including robust clustering) are priori. Estimating the number of clusters from the avoided. Featurespace of any naturecan beprocessed, data is computationally exp ensive and not guaranteed and as an example, color image segmentation is dis- to pro duce satisfactory result. cussed. The segmentation is completely autonomous, Amuch to o often used assumption is that the indi- only its class is chosen by the user. Thus, the same vidual clusters ob ey multivariate normal distributions, program can produce a high quality edge image, or pro- i.e., the feature space can b e mo deled as a mixture of vide, by extracting al l the signi cant colors, a prepro- Gaussians. The parameters of the mixture are then cessor for content-based query systems. A 512 512 estimated by minimizing an error criterion. For exam- color image is analyzed in less than 10 seconds on a ple, a large class of thresholding algorithms are based standard workstation. Gray level images are hand led on the Gaussian mixture mo del of the histogram, e.g. as color images having only the lightness coordinate. [11]. However, there is no theoretical evidence that an 1 Intro duction extracted normal cluster necessarily corresp onds to a Feature space analysis is a widely used to ol for solv- signi cant image feature. On the contrary, a strong ing low-level image understanding tasks. Given an im- artifact cluster may app ear when several features are age, feature vectors are extracted from lo cal neighb or- mapp ed into partially overlapping regions. hoods and mapp ed into the space spanned by their Nonparametric density estimation [4, Chap. 6] comp onents. Signi cant features in the image then avoids the use of the normality assumption. The two corresp ond to high density regions in this space. Fea- families of metho ds, Parzen window, and k-nearest ture space analysis is the pro cedure of recovering the neighb ors, b oth require additional input information centers of the high density regions, i.e., the represen- (typ e of the kernel, numb er of neighb ors). This infor- tations of the signi cant image features. Histogram mation must b e provided by the user, and for multi- based techniques, Hough transform are examples of mo dal distributions it is dicult to guess the optimal the approach. setting. When the number of distinct feature vectors is Nevertheless, a reliable general technique for fea- large, the size of the feature space is reduced by group- ture space analysis can be develop ed using a simple ing nearbyvectors into a single cell. A discretized fea- nonparametric density estimation algorithm. In this ture space is called an accumulator. Whenever the size pap er we prop ose such a technique whose robust b e- of the accumulator cell is not adequate for the data, havior is sup erior to metho ds employing robust esti- serious artifacts can app ear. The problem was exten- mators from statistics. sively studied in the context of the Hough transform, 2 Requirements for Robustness e.g. [5]. Thus, for satisfactory results a featurespace should have continuous coordinate system. The con- Estimation of a cluster center is called in statistics tent of a continuous feature space can b e mo deled as the multivariate lo cation problem. To be robust, an a sample from a multivariate, multimo dal probability estimator must tolerate a p ercentage of outliers, i.e., distribution. Note that for real images the number of data p oints not ob eying the underlying distribution 1 of the cluster. Numerous robust techniques were pro- image domain. That is, the feature vectors satisfy ad- posed [10, Sec. 7.1], and in computer vision the most ditional, spatial constraints. While these constraints widely used is the minimum volume el lipsoid (MVE) are indeed used in the currenttechniques, their role is estimator prop osed by Rousseeuw [10,p. 258]. mostly limited to comp ensating for feature allo cation errors made during the independent analysis of the The MVE estimator is ane equivariant(anane feature space. To b e robust the featurespace analysis transformation of the input is passed on to the es- must ful ly exploit the image domain information. timate) and has high breakdown point (tolerates up As a consequence of the increased role of image to half the data b eing outliers). The estimator nds domain information the burden on the feature space the center of the highest density region by searching analysis can b e reduced. First al l the signi cant fea- for the minimal volume ellipsoid containing at least tures are extracted, and only after then are the clusters h data p oints. The multivariate lo cation estimate is containing the instances of these features recovered. the center of this ellipsoid. To avoid combinatorial The latter pro cedure uses image domain information explosion a probabilistic search is employed. Let the and avoids the normality assumption. dimension of the data b e p. A small number of (p + 1)- tuple of p oints are randomly chosen. For each(p + 1)- Signi cant features corresp ond to high densityre- tuple the mean vector and covariance matrix are com- gions and to lo cate these regions a search windowmust puted, de ning an ellipsoid. The ellipsoid is in ated be employed. The number of parameters de ning to include h points, and the one having the minimum the shap e and size of the window should b e minimal, volume provides the MVE estimate. and therefore whenever it is p ossible the featurespace should be isotropic. A space is isotropic if the distance Based on MVE, a robust clustering technique with between two p oints is indep endent on the lo cation of applications in computer vision was prop osed in [7]. the p oint pair. The most widely used isotropic space is The data is analyzed under several \resolutions" by the Euclidean space, where a sphere, having only one applying the MVE estimator rep eatedly with h values parameter (its radius) can b e employed as searchwin- representing xed p ercentages of the data p oints. The dow. The isotropy requirement determines the map- b est cluster then corresp onds to the h value yielding ping from the image domain to the feature space. If the highest density inside the minimum volume ellip- the isotropy condition cannot be satis ed, a Maha- soid. The cluster is removed from the feature space, lanobis metric should be de ned from the statement and the whole pro cedure is rep eated till the space is of the task. not empty. The robustness of MVE should ensure that each cluster is asso ciated with only one mo de of We conclude that robust feature space analysis re- the underlying distribution. The numb er of signi cant quires a reliable pro cedure for the detection of high clusters is not needed a priori. density regions. Such a pro cedure is presented in the next section. The robust clustering metho d was successfully em- ployed for the analysis of a large variety of feature spaces, but was found to b ecome less reliable once 3 Mean Shift Algorithm the number of mo des exceeded ten. This is mainly A simple, nonparametric technique for estimation due to the normality assumption emb edded into the of the density gradientwas prop osed in 1975 byFuku- metho d. The ellipsoid de ning a cluster can be also naga and Hostetler [4, p. 534]. The idea was recently viewed as the high con dence region of a multivari- generalized by Cheng [2]. ate normal distribution. Arbitrary feature spaces are Assume, for the moment, that the probability den- not mixtures of Gaussians and constraining the shap e sity function p(x ) of the p-dimensional feature vectors of the removed clusters to b e elliptical can intro duce x is unimo dal. This condition is for sake of clarity serious artifacts. The e ect of these artifacts propa- only, later will be removed. A sphere S of radius x gates as more and more clusters are removed. Fur- r , centered on x contains the feature vectors y such thermore, the estimated covariance matrices are not that k y x k r . The exp ected value of the vector reliable since are based on only p +1 points. Subse- z = y x,given x and S is x quent p ostpro cessing based on all the p oints declared inliers cannot fully comp ensate for an initial error. Z To be able to correctly recover a large number of = E [ zjS ] = (y x)p(y jS )dy (1) x x S signi cant features, the problem of feature space anal- x Z p(y ) ysis must b e solved in context. In image understand- dy (y x) = ing tasks the data to be analyzed originates in the p(y 2S ) x S x 2 If S is suciently small we can approximate lo cation of the mo de detected by the one-dimensional x MVE mo de detector, i.e., the center of the shortest p p(y 2S )=p(x )V where V = c r (2) x rectangular window containing half the data points S S x x [10, Sec.
Recommended publications
  • Image Processing Terms Used in This Section Working with Different Screen Bit Describes How to Determine the Screen Bit Depth of Your Depths (P
    13 Color This section describes the toolbox functions that help you work with color image data. Note that “color” includes shades of gray; therefore much of the discussion in this chapter applies to grayscale images as well as color images. Topics covered include Terminology (p. 13-2) Provides definitions of image processing terms used in this section Working with Different Screen Bit Describes how to determine the screen bit depth of your Depths (p. 13-3) system and provides recommendations if you can change the bit depth Reducing the Number of Colors in an Describes how to use imapprox and rgb2ind to reduce the Image (p. 13-6) number of colors in an image, including information about dithering Converting to Other Color Spaces Defines the concept of image color space and describes (p. 13-15) how to convert images between color spaces 13 Color Terminology An understanding of the following terms will help you to use this chapter. Terms Definitions Approximation The method by which the software chooses replacement colors in the event that direct matches cannot be found. The methods of approximation discussed in this chapter are colormap mapping, uniform quantization, and minimum variance quantization. Indexed image An image whose pixel values are direct indices into an RGB colormap. In MATLAB, an indexed image is represented by an array of class uint8, uint16, or double. The colormap is always an m-by-3 array of class double. We often use the variable name X to represent an indexed image in memory, and map to represent the colormap. Intensity image An image consisting of intensity (grayscale) values.
    [Show full text]
  • Real-Time Supervised Detection of Pink Areas in Dermoscopic Images of Melanoma: Importance of Color Shades, Texture and Location
    Missouri University of Science and Technology Scholars' Mine Electrical and Computer Engineering Faculty Research & Creative Works Electrical and Computer Engineering 01 Nov 2015 Real-Time Supervised Detection of Pink Areas in Dermoscopic Images of Melanoma: Importance of Color Shades, Texture and Location Ravneet Kaur P. P. Albano Justin G. Cole Jason R. Hagerty et. al. For a complete list of authors, see https://scholarsmine.mst.edu/ele_comeng_facwork/3045 Follow this and additional works at: https://scholarsmine.mst.edu/ele_comeng_facwork Part of the Chemistry Commons, and the Electrical and Computer Engineering Commons Recommended Citation R. Kaur et al., "Real-Time Supervised Detection of Pink Areas in Dermoscopic Images of Melanoma: Importance of Color Shades, Texture and Location," Skin Research and Technology, vol. 21, no. 4, pp. 466-473, John Wiley & Sons, Nov 2015. The definitive version is available at https://doi.org/10.1111/srt.12216 This Article - Journal is brought to you for free and open access by Scholars' Mine. It has been accepted for inclusion in Electrical and Computer Engineering Faculty Research & Creative Works by an authorized administrator of Scholars' Mine. This work is protected by U. S. Copyright Law. Unauthorized use including reproduction for redistribution requires the permission of the copyright holder. For more information, please contact [email protected]. Published in final edited form as: Skin Res Technol. 2015 November ; 21(4): 466–473. doi:10.1111/srt.12216. Real-time Supervised Detection of Pink Areas in Dermoscopic Images of Melanoma: Importance of Color Shades, Texture and Location Ravneet Kaur, MS, Department of Electrical and Computer Engineering, Southern Illinois University Edwardsville, Campus Box 1801, Edwardsville, IL 62026-1801, Telephone: 618-210-6223, [email protected] Peter P.
    [Show full text]
  • Accurately Reproducing Pantone Colors on Digital Presses
    Accurately Reproducing Pantone Colors on Digital Presses By Anne Howard Graphic Communication Department College of Liberal Arts California Polytechnic State University June 2012 Abstract Anne Howard Graphic Communication Department, June 2012 Advisor: Dr. Xiaoying Rong The purpose of this study was to find out how accurately digital presses reproduce Pantone spot colors. The Pantone Matching System is a printing industry standard for spot colors. Because digital printing is becoming more popular, this study was intended to help designers decide on whether they should print Pantone colors on digital presses and expect to see similar colors on paper as they do on a computer monitor. This study investigated how a Xerox DocuColor 2060, Ricoh Pro C900s, and a Konica Minolta bizhub Press C8000 with default settings could print 45 Pantone colors from the Uncoated Solid color book with only the use of cyan, magenta, yellow and black toner. After creating a profile with a GRACoL target sheet, the 45 colors were printed again, measured and compared to the original Pantone Swatch book. Results from this study showed that the profile helped correct the DocuColor color output, however, the Konica Minolta and Ricoh color outputs generally produced the same as they did without the profile. The Konica Minolta and Ricoh have much newer versions of the EFI Fiery RIPs than the DocuColor so they are more likely to interpret Pantone colors the same way as when a profile is used. If printers are using newer presses, they should expect to see consistent color output of Pantone colors with or without profiles when using default settings.
    [Show full text]
  • Predictability of Spot Color Overprints
    Predictability of Spot Color Overprints Robert Chung, Michael Riordan, and Sri Prakhya Rochester Institute of Technology School of Print Media 69 Lomb Memorial Drive, Rochester, NY 14623, USA emails: [email protected], [email protected], [email protected] Keywords spot color, overprint, color management, portability, predictability Abstract Pre-media software packages, e.g., Adobe Illustrator, do amazing things. They give designers endless choices of how line, area, color, and transparency can interact with one another while providing the display that simulates printed results. Most prepress practitioners are thrilled with pre-media software when working with process colors. This research encountered a color management gap in pre-media software’s ability to predict spot color overprint accurately between display and print. In order to understand the problem, this paper (1) describes the concepts of color portability and color predictability in the context of color management, (2) describes an experimental set-up whereby display and print are viewed under bright viewing surround, (3) conducts display-to-print comparison of process color patches, (4) conducts display-to-print comparison of spot color solids, and, finally, (5) conducts display-to-print comparison of spot color overprints. In doing so, this research points out why the display-to-print match works for process colors, and fails for spot color overprints. Like Genie out of the bottle, there is no turning back nor quick fix to reconcile the problem with predictability of spot color overprints in pre-media software for some time to come. 1. Introduction Color portability is a key concept in ICC color management.
    [Show full text]
  • Color Images, Color Spaces and Color Image Processing
    color images, color spaces and color image processing Ole-Johan Skrede 08.03.2017 INF2310 - Digital Image Processing Department of Informatics The Faculty of Mathematics and Natural Sciences University of Oslo After original slides by Fritz Albregtsen today’s lecture ∙ Color, color vision and color detection ∙ Color spaces and color models ∙ Transitions between color spaces ∙ Color image display ∙ Look up tables for colors ∙ Color image printing ∙ Pseudocolors and fake colors ∙ Color image processing ∙ Sections in Gonzales & Woods: ∙ 6.1 Color Funcdamentals ∙ 6.2 Color Models ∙ 6.3 Pseudocolor Image Processing ∙ 6.4 Basics of Full-Color Image Processing ∙ 6.5.5 Histogram Processing ∙ 6.6 Smoothing and Sharpening ∙ 6.7 Image Segmentation Based on Color 1 motivation ∙ We can differentiate between thousands of colors ∙ Colors make it easy to distinguish objects ∙ Visually ∙ And digitally ∙ We need to: ∙ Know what color space to use for different tasks ∙ Transit between color spaces ∙ Store color images rationally and compactly ∙ Know techniques for color image printing 2 the color of the light from the sun spectral exitance The light from the sun can be modeled with the spectral exitance of a black surface (the radiant exitance of a surface per unit wavelength) 2πhc2 1 M(λ) = { } : λ5 hc − exp λkT 1 where ∙ h ≈ 6:626 070 04 × 10−34 m2 kg s−1 is the Planck constant. ∙ c = 299 792 458 m s−1 is the speed of light. ∙ λ [m] is the radiation wavelength. ∙ k ≈ 1:380 648 52 × 10−23 m2 kg s−2 K−1 is the Boltzmann constant. T ∙ [K] is the surface temperature of the radiating Figure 1: Spectral exitance of a black body surface for different body.
    [Show full text]
  • Arxiv:1902.00267V1 [Cs.CV] 1 Feb 2019 Fcmue Iin Oto H Aaesue O Mg Classificat Image Th for in Used Applications Datasets Fundamental the Most of the Most Vision
    ColorNet: Investigating the importance of color spaces for image classification⋆ Shreyank N Gowda1 and Chun Yuan2 1 Computer Science Department, Tsinghua University, Beijing 10084, China [email protected] 2 Graduate School at Shenzhen, Tsinghua University, Shenzhen 518055, China [email protected] Abstract. Image classification is a fundamental application in computer vision. Recently, deeper networks and highly connected networks have shown state of the art performance for image classification tasks. Most datasets these days consist of a finite number of color images. These color images are taken as input in the form of RGB images and clas- sification is done without modifying them. We explore the importance of color spaces and show that color spaces (essentially transformations of original RGB images) can significantly affect classification accuracy. Further, we show that certain classes of images are better represented in particular color spaces and for a dataset with a highly varying number of classes such as CIFAR and Imagenet, using a model that considers multi- ple color spaces within the same model gives excellent levels of accuracy. Also, we show that such a model, where the input is preprocessed into multiple color spaces simultaneously, needs far fewer parameters to ob- tain high accuracy for classification. For example, our model with 1.75M parameters significantly outperforms DenseNet 100-12 that has 12M pa- rameters and gives results comparable to Densenet-BC-190-40 that has 25.6M parameters for classification of four competitive image classifica- tion datasets namely: CIFAR-10, CIFAR-100, SVHN and Imagenet. Our model essentially takes an RGB image as input, simultaneously converts the image into 7 different color spaces and uses these as inputs to individ- ual densenets.
    [Show full text]
  • Multichannel Color Image Denoising Via Weighted Schatten P-Norm Minimization
    Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence (IJCAI-20) Multichannel Color Image Denoising via Weighted Schatten p-norm Minimization Xinjian Huang , Bo Du∗ and Weiwei Liu∗ School of Computer Science, Institute of Artificial Intelligence and National Engineering Research Center for Multimedia Software, Wuhan University, Wuhan, China h [email protected], [email protected], [email protected] Abstract the-art denoising methods are mainly based on sparse repre- sentation [Dabov et al., 2007b], low-rank approximation [Gu The R, G and B channels of a color image gen- et al., 2017; Xie et al., 2016], dictionary learning [Zhang and erally have different noise statistical properties or Aeron, 2016; Marsousi et al., 2014; Mairal et al., 2012], non- noise strengths. It is thus problematic to apply local self-similarity [Buades et al., 2005; Dong et al., 2013a; grayscale image denoising algorithms to color im- Hu et al., 2019] and neural networks [Liu et al., 2018; age denoising. In this paper, based on the non-local Zhang et al., 2017a; Zhang et al., 2017b; Zhang et al., 2018]. self-similarity of an image and the different noise Current methods for RGB color image denoising can be strength across each channel, we propose a Multi- categorized into three classes. The first kind involves apply- Channel Weighted Schatten p-Norm Minimization ing grayscale image denoising algorithms to each channel in (MCWSNM) model for RGB color image denois- a channel-wise manner. However, these methods ignore the ing. More specifically, considering a small local correlations between R, G and B channels, meaning that un- RGB patch in a noisy image, we first find its nonlo- satisfactory results may be obtained.
    [Show full text]
  • 24-Bit Color Images  in a Color 24-Bit Image, Each Pixel Is Represented by Three Bytes, Usually Representing RGB
    A:Color look-up table B: Dithering A:Color look-up table 1-bit Images Each pixel is stored as a single bit (0 or 1), so also referred to as binary image. Such an image is also called a 1-bit monochrome image since it contains no color. Fig. 3.1 shows a 1-bit monochrome image (called \Lena" by multimedia scientists | this is a standard image used to illustrate many algorithms). 8-bit Gray-level Images Each pixel has a gray-value between 0 and 255. Each pixel is represented by a single byte; e.g., a dark pixel might have a value of 10, and a bright one might be 230. Bitmap: The two-dimensional array of pixel values that represents the graphics/image data. Image resolution refers to the number of pixels in a digital image (higher resolution always yields better quality). Fairly high resolution for such an image might be 1600 X 1200, whereas lower resolution might be 640 X 480. 24-bit Color Images In a color 24-bit image, each pixel is represented by three bytes, usually representing RGB. This format supports 256 X 256 X 256 possible combined colors, or a total of 16,777,216 possible colors. However such flexibility does result in a storage penalty: A 640X480 24-bit color image would require 921.6 KB of storage without any compression. An important point: many 24-bit color images are actually stored as 32-bit images, with the extra byte of data for each pixel used to store an alpha value representing special effect information (e.g., transparency).
    [Show full text]
  • Developing a New Color Model for Image Analysis and Processing
    UDC 004.421 Developing a New Color Model for Image Analysis and Processing Rashad J. Rasras1 , Ibrahiem M. M. El Emary2, Dmitriy E. Skopin1 1 Faculty of Engineering Technology, Amman, Al Balqa Applied University Jordan rashad_ras@yahoo. 2 Faculty of Engineering, Al Ahliyya Amman University, Amman, Jordan com [email protected] Abstract. The theoretical outcomes and experimental results of new color model implemented in algorithms and software of image processing are presented in the paper. This model, as it will be shown below, may be used in modern real time video processing applications such as radar tracking and communication systems. The developed model allows carrying out the image process with the least time delays (i.e. it speeding up image processing). The proposed model can be used to solve the problem of true color object identification. Experimental results show that the time spent during RGI color model conversion may approximately four times less than the time spent during other similar models. 1. Introduction Digital image processing is a new and promptly developing field which finds more and more application in various information and technical systems such as: radar-tracking, communications, television, astronomy, etc. [1]. There are numerous methods of digital image processing techniques such as: histogram processing, local enhancement, smoothing and sharpening, color segmentation, a digital image filtration and edge detection. Initially, theses methods were designed specially for gray scale image processing [2, 3]. The RGB color model is standard design of computer graphics systems is not ideal for all of its applications. The red, green, and blue color components are highly correlated.
    [Show full text]
  • Interactive Color Image Segmentation Using HSV Color Space
    Science and Technology Journal, Vol. 7 Issue: 1 ISSN: 2321-3388 Interactive Color Image Segmentation using HSV Color Space D. Hema1 and Dr. S. Kannan2 1Assistant Professor, Department of Computer Science, Lady Doak College, Madurai 2Professor, Department of Computer Applications, School of Information Technology, Madurai Kamaraj University, Madurai E-mail: [email protected], [email protected] Abstract—The primary goal of this research work is to extract only the essential foreground fragments of a color image through segmentation. This technique serves as the foundation for implementing object detection algorithms. The color image can be segmented better in HSV color space model than other color models. An interactive GUI tool is developed in Python and implemented to extract only the foreground from an image by adjusting the values for H (Hue), S (Saturation) and V (Value). The input is an RGB image and the output will be a segmented color image. Keywords: Color Spaces, CMYK, HSV, Segmentation INTRODUCTION vision is another branch of research where Digital Image processing plays a vital role to learn, make inferences and perform actions based on visual inputs. Human eyes can perceive and understand the objects in an image. In order to design a machine that can understand like a human, it requires appropriate algorithm and massive training. An image is a visual representation of any real A color model is an abstract mathematical model in world object that can be manipulated computationally. which colors are represented as tuples of numbers either Image Processing is an extensive area of research that gains three or four values or color components.
    [Show full text]
  • Decolorize: Fast, Contrast Enhancing, Color to Grayscale Conversion
    Decolorize: fast, contrast enhancing, color to grayscale conversion Mark Grundland*, Neil A. Dodgson Computer Laboratory, University of Cambridge Abstract We present a new contrast enhancing color to grayscale conversion algorithm which works in real-time. It incorporates novel techniques for image sampling and dimensionality reduction, sampling color differences by Gaussian pairing and analyzing color differences by predominant component analysis. In addition to its speed and simplicity, the algorithm has the advantages of continuous mapping, global consistency, and grayscale preservation, as well as predictable luminance, saturation, and hue ordering properties. Keywords: Color image processing; Color to grayscale conversion; Contrast enhancement; Image sampling; Dimensionality reduction. 1 Introduction We present a new contrast enhancing color to grayscale conversion algorithm which works in real-time. Compared with standard color to grayscale conversion, our approach better preserves equiluminant chromatic contrasts in grayscale. Alternatively, compared with standard principal component color projection, our approach better preserves achromatic luminance contrasts in grayscale. Finally, compared with recent optimization methods [1,2] for contrast enhancing color to grayscale conversion, this approach proves much more computationally efficient at producing visually similar results with stronger mathematical guarantees on its performance. Many legacy pattern recognition systems and algorithms only operate on the luminance channel and entirely ignore chromatic information. By ignoring important chromatic features, such as edges between isoluminant regions, they risk reduced recognition rates. Although luminance predominates human visual processing, on its own it may not capture the perceived color contrasts. On the other hand, maximizing variation by principal component analysis can result in high contrasts, but it may not prove visually meaningful.
    [Show full text]
  • A Perception-Based Color Space for Illumination-Invariant Image Processing
    A Perception-based Color Space for Illumination-invariant Image Processing Hamilton Y. Chong∗ Steven J. Gortler† Todd Zickler‡ Harvard University Harvard University Harvard University (a) Source image 1. (b) Source image 2. (c) Foreground (red) and (d) Initial foreground and (e) Source 1 segmentation: background (blue) pixels. background mask. GammaRGB. (f) Source 2 segmentation: (g) Source 1 segmentation: (h) Source 2 segmentation: (i) Source 1 segmentation: (j) Source 2 segmentation: GammaRGB. Lab. Lab. Ours. Ours. Figure 1: Segmentations of the same flower scene under two different illuminants. For (nonlinear) RGB and Lab color spaces, segmentation parameters for the two images were tweaked individually (optimal parameters for the image pairs were separated by 2 and 5 orders of magnitude respectively); for our color space, the same parameters were used for segmenting both images. Abstract required by a large class of applications that includes segmenta- tion and Poisson image editing [Perez´ et al. 2003]. In this work, Motivated by perceptual principles, we derive a new color space in we present a new color space designed specifically to address this which the associated metric approximates perceived distances and deficiency. color displacements capture relationships that are robust to spectral changes in illumination. The resulting color space can be used with We propose the following two color space desiderata for image pro- existing image processing algorithms with little or no change to the cessing: methods. (1) difference vectors between color pixels are unchanged by re- illumination; CR Categories: I.3.m [Computer Graphics]: Misc.—Perception Keywords: perception, image processing, color space (2) the `2 norm of a difference vector matches the perceptual dis- tance between the two colors.
    [Show full text]