Fast Color Quantization Using Macqueen's K-Means Algorithm

Total Page:16

File Type:pdf, Size:1020Kb

Fast Color Quantization Using Macqueen's K-Means Algorithm Journal of Real-Time Image Processing (2020) 17:1609–1624 https://doi.org/10.1007/s11554-019-00914-6 ORIGINAL RESEARCH PAPER Fast color quantization using MacQueen’s k-means algorithm Skyler Thompson1 · M. Emre Celebi1 · Krizia H. Buck1 Received: 22 May 2019 / Accepted: 6 September 2019 / Published online: 16 October 2019 © Springer-Verlag GmbH Germany, part of Springer Nature 2019 Abstract Color quantization (CQ) is an important operation with many applications in computer graphics and image processing and analysis. Clustering algorithms have been extensively applied to this problem. However, despite its popularity as a general purpose clustering algorithm, k-means has not received much attention in the CQ literature because of its high computational requirements and sensitivity to initialization. In this paper, we propose a novel CQ method based on an online k-means for- mulation due to MacQueen. The proposed method utilizes adaptive and efcient cluster center initialization and quasirandom sampling to attain deterministic, high speed, and high-quality quantization. Experiments on a diverse set of publicly available images demonstrate that the proposed method is signifcantly faster than the more common batch k-means formulation due to Lloyd while delivering nearly identical results. Keywords Color quantization · Clustering · MacQueen k-means · Lloyd k-means 1 Introduction Popular preclustering methods include median-cut [27], octree [22], variance-based method [59], binary splitting Given an input image, CQ involves the reduction of the [40], greedy orthogonal bipartitioning [61], center-cut [33], number of distinct colors ( N ) in the image to K ( ≪ N ) RWM-cut [66], and the more recent methods proposed by with minimum possible distortion. CQ is a challenging prob- Celebi et al. [8] and Ueda et al. [56]. On the other hand, lem as most real-world images contain tens of thousands postclustering algorithms adapted to CQ include maximin of colors. Recent applications of CQ include compression, [63], k-means [9, 10, 29, 30, 57], k-harmonic means [21], segmentation, text localization/detection, color analysis, competitive learning [11, 12], fuzzy c-means [47, 60], rough watermarking, non-photorealistic rendering, and content- c-means [50], and self-organizing maps [16, 17, 64]. based retrieval (for specifc references, the reader is referred Recent CQ methods are typically based on metaheuristics to our earlier work [8]). or a hybrid of metaheuristics and preclustering/postcluster- A large number of CQ methods have been developed over ing methods. These methods cast CQ as a global optimiza- the past four decades. These methods can be categorized into tion problem, which they then solve by means of a variety two groups: preclustering (hierarchical clustering) methods of physics- or nature-inspired metaheuristics. Metaheuristics and postclustering (partitional clustering) methods [7]. The applied to CQ to date include physics-inspired single-solu- former methods recursively fnd nested clusters either in a tion-based metaheuristics such as simulated annealing [42] top-down (divisive) or bottom-up (agglomerative) fashion. as well as nature-inspired population-based metaheuristics In contrast, the latter ones fnd all the clusters simultane- such as evolutionary algorithms (genetic algorithms [51], ously as a partition of the data and do not impose a hierarchi- evolution strategies [24], diferential evolution [31, 48, 54], cal structure [32]. Compared to preclustering methods, post- etc.) and swarm intelligence algorithms (particle swarm clustering methods generally produce better results, that is, optimization [39], ant colony optimization [43, 44], artifcial lower distortion, but are computationally more demanding. bee colony optimization [41], artifcial fsh swarm optimiza- tion [20], etc.). These methods are more powerful than pre- clustering/postclustering methods in that they can optimize * M. Emre Celebi nonsmooth, nonconvex objective functions. Unfortunately, [email protected] these “black-box” methods have several major disadvan- 1 Department of Computer Science, University of Central tages. First they are generally randomized. Second, they Arkansas, Conway, AR, USA Vol.:(0123456789)1 3 1610 Journal of Real-Time Image Processing (2020) 17:1609–1624 have several parameters that are often difcult to fne tune Bregman divergences are a family of nonmetric dissimilar- (initial/fnal temperature, cooling schedule, population size, ity functions that include the squared Euclidean distance 2 crossover/mutation probability, etc.) Third, due to the vast ( 2 ), squared Mahalanobis distance, Kullback–Leibler search space in CQ applications, they often require a large divergence, and Itakura–Saito divergence. In practice, the number of iterations, which renders them computationally most popular Bregman divergence is the squared Euclidean demanding (they can be orders of magnitude slower than distance, in which case the SE is called the sum of squared k-means). We should, however, mention that some of the error (SSE). For any Bregman divergence, it can be shown C recent studies on metaheuristics-based CQ report signif- that the optimal center i for cluster i is given by the cen- cantly reduced computational requirements [44]. troid (or center of mass) of the cluster [1], that is In this paper, we present an efective and efcient clus- 1 tering method for CQ. The rest of the paper is organized as i = , ni C (2) follows. Section 2 describes two variants of the k-means ∈ i clustering algorithm and the proposed k-means based CQ C method. Section 3 presents the experimental setup and com- where ni denotes the cardinality of cluster i , that is, the C pares the proposed method to other CQ methods. Finally, number of points that belong to i. Section 4 gives the conclusions. The pseudocode for the k-means algorithm is given below. Starting with a set of K centers, the algorithm alter- nates between two steps. First, each point is assigned to the 2 k‑means clustering for CQ nearest cluster. Each cluster center is then recomputed to be the centroid of all points that belong to that cluster. Together, In this section, we frst describe two common variants of these two steps are referred to as a “Lloyd iteration”. These the k-means clustering algorithm, one due to Lloyd and the iterations continue until cluster memberships of points no other due to MacQueen. We then elaborate on the proposed longer change. CQ method based on MacQueen’s k-means algorithm. 1. Let { 1, … , K} be the initial set of centers. C 2.1 Lloyd’s k‑means algorithm 2. For each i ∈{1, … , K} , set cluster i to be the set of X points in that are closer in terms of d to i than they Lloyd’s algorithm [35] is perhaps the most common cluster- are to any other center, that is, ing algorithm in scientifc and engineering applications [14]. C X i = ∈ : d( , i) ≤ d( , ̂� ), for all ̂� ≠ i. Commonly referred to as (batch) k-means, Lloyd’s algorithm 1 X ℝD i ∈{1, … , K} C starts with a data set ={ 1, … , N } ⊆ and a posi- 3. For each , set the center i of cluster i to C tive integer value K, which denotes the desired number of be the centroid of all points in i using Eq. (2). clusters. In the context of CQ, the parameters N, D, and K 4. Repeat Lloyd iterations, that is, steps (2) and (3), until correspond, respectively, to the number of pixels in the input convergence. image, number of color channels (which is typically three), and number of colors desired by the user. The algorithm then The most crucial aspect of k-means is step 1, that is, ini- assigns each data point ∈ X to the closest cluster, thereby tialization. A common way to determine the initial cluster X minimizing the sum of error (SE) given by centers is to select K points uniformly at random from and take these as the initial cluster centers. Unfortunately, SE = d( , { 1, … , K}), k-means is relatively sensitive to initialization [13]. Some of X (1) ∈ the negative efects of improper initialization include empty where d( , { 1, … , K}) denotes the Bregman divergence of clusters, slower convergence, and a higher probability of to the nearest center in { 1, … , K} , that is getting stuck at a poor local minimum. d( , { 1, … , K}) = min d( , i). i∈{1,…,K} 2.2 MacQueen’s k‑means algorithm MacQueen [36] proposed an online formulation of the batch k-means algorithm. The two k-means algorithms are similar in the sense that each point is assigned to the cluster repre- 1 Strictly speaking, in practice, data sets are not implemented as sented by the nearest center to that point. The algorithms, sets, but as sequences, where elements are ordered and duplicates are allowed. This is especially true for image data sets, which are often however, difer in the way the cluster centers are recom- stored and accessed in raster order. puted. The online algorithm updates the nearest center after 1 3 Journal of Real-Time Image Processing (2020) 17:1609–1624 1611 (t) the presentation of each point, whereas the batch algorithm t (t = 1, 2, … ) and be the corresponding nearest (winner) updates all centers after the presentation of the entire set unit according to the 2 distance. The adaptation equation for (t) of points. is given by The pseudocode for the online k-means algorithm is given (t+1) = (t) + (t)((t) − (t)), below. It can be seen that, unlike the batch algorithm, this (3) algorithm does not generate the centroidal Voronoi diagram where ∈[0, 1] is the learning rate, which is typically a itself, but approximate positions of its generators [19]. This monotonically decreasing function of time. The larger the allows the algorithm to achieve O(DK) per-iteration time value, the more emphasis is given to the new input (this complexity, compared to the O(NDK) time complexity of is more obvious in Eq. 5) and hence the faster the learning.
Recommended publications
  • The Effects of Learning Rate Schedules on Color Quantization
    THE EFFECTS OF LEARNING RATE SCHEDULES ON COLOR QUANTIZATION by Garrett Brown A thesis presented to the Department of Computer Science and the Graduate School of the University of Central Arkansas in partial fulfillment of the requirements for the degree of Master of Science in Computer Science Conway, Arkansas May 2020 ProQuest Number:27834147 All rights reserved INFORMATION TO ALL USERS The quality of this reproduction is dependent on the quality of the copy submitted. In the unlikely event that the author did not send a complete manuscript and there are missing pages, these will be noted. Also, if material had to be removed, a note will indicate the deletion. ProQuest 27834147 Published by ProQuest LLC (2020). Copyright of the Dissertation is held by the Author. All Rights Reserved. This work is protected against unauthorized copying under Title 17, United States Code Microform Edition © ProQuest LLC. ProQuest LLC 789 East Eisenhower Parkway P.O. Box 1346 Ann Arbor, MI 48106 - 1346 TO THE OFFICE OF GRADUATE STUDIES: The members of the Committee approve the thesis of Garrett Brown presented on March 31, 2020. M. Emre Celebi, Ph.D., Committee Chairperson Ahmad Patooghy, Ph.D. Mahmut Karakaya, Ph.D. PERMISSION Title The Effects of Learning Rate Schedules on Color Quantization Department Computer Science Degree Master of Science In presenting this thesis/dissertation in partial fulfillment of the requirements for a graduate degree from the University of Central Arkansas, I agree that the Library of this University shall make it freely available for inspections. I further agree that permission for extensive copying for scholarly purposes may be granted by the professor who supervised my thesis/dissertation work, or, in the professor’s absence, by the Chair of the Department or the Dean of the Graduate School.
    [Show full text]
  • A New Steganographic Method for Palette-Based Images
    IS&T's 1999 PICS Conference A New Steganographic Method for Palette-Based Images Jiri Fridrich Center for Intelligent Systems, SUNY Binghamton, Binghamton, New York Abstract messages. The gaps in human visual and audio systems can be used for information hiding. In the case of images, the In this paper, we present a new steganographic technique human eye is relatively insensitive to high frequencies. This for embedding messages in palette-based images, such as fact has been utilized in many steganographic algorithms, GIF files. The new technique embeds one message bit into which modify the least significant bits of gray levels in one pixel (its pointer to the palette). The pixels for message digital images or digital sound tracks. Additional bits of embedding are chosen randomly using a pseudo-random information can also be inserted into coefficients of image number generator seeded with a secret key. For each pixel transforms, such as discrete cosine transform, Fourier at which one message bit is to be embedded, the palette is transform, etc. Transform techniques are typically more searched for closest colors. The closest color with the same robust with respect to common image processing operations parity as the message bit is then used instead of the original and lossy compression. color. This has the advantage that both the overall change The steganographer’s job is to make the secretly hidden due to message embedding and the maximal change in information difficult to detect given the complete colors of pixels is smaller than in methods that perturb the knowledge of the algorithm used to embed the information least significant bit of indices to a luminance-sorted palette, except the secret embedding key.* This so called such as EZ Stego.1 Indeed, numerical experiments indicate Kerckhoff’s principle is the golden rule of cryptography and that the new technique introduces approximately four times is often accepted for steganography as well.
    [Show full text]
  • Certified Digital Designer Professional Certification Examination Review
    Digital Imaging & Editing and Digital & General Photography Certified Digital Designer Professional Certification Examination Review Within this presentation – We will use specific names and terminologies. These will be related to specific products, software, brands and trade names. ADDA does not endorse any specific software or manufacturer. It is the sole decision of the individual to choose and purchase based on their personal preference and financial capabilities. the Examination Examination Contain at Total 325 Questions 200 Questions in Digital Image Creation and Editing Image Editing is applicable to all Areas related to Digital Graphics 125 Question in Photography Knowledge and History Photography is applicable to General Principles of Photography Does not cover Photography as a General Arts Program Examination is based on entry level intermediate employment knowledge Certain Processes may be omitted that are required to achieve an end result ADDA Professional Certification Series – Digital Imaging & Editing the Examination Knowledge of Graphic and Photography Acronyms Knowledge of Graphic Program Tool Symbols Some Knowledge of Photography Lighting Ability to do some basic Geometric Calculations Basic Knowledge of Graphic History & Theory Basic Knowledge of Digital & Standard Film Cameras Basic Knowledge of Camera Lens and Operation General Knowledge of Computer Operation Some Common Sense ADDA Professional Certification Series – Digital Imaging & Editing This is the Comprehensive Digital Imaging & Editing Certified Digital Designer Professional Certification Examination Review Within this presentation – We will use specific names and terminologies. These will be related to specific products, software, brands and trade names. ADDA does not endorse any specific software or manufacturer. It is the sole decision of the individual to choose and purchase based on their personal preference and financial capabilities.
    [Show full text]
  • A Framework for Detection of Linear Gradient Filled Regions and Their
    21st International Conference on Computer Graphics, Visualization and Computer Vision 2013 A framework for detection of linear gradient filled regions and their reconstruction for vector graphics Ruchin Kansal Prof. Subodh Kumar Department of computer Department of computer science science IIT Delhi IIT Delhi India, New Delhi India, New Delhi [email protected] [email protected] ABSTRACT Linear gradients are commonly applied in non-photographic art-work for shading and other artistic effects. It is sometimes necessary to generate a vector graphics form of raster images comprising such art-work with the expec- tation to obtain a simple output that is further editable, say, using any popular vector editing software. Further, this vectorization process should be automatic with minimal user intervention. We present a simple image vectorization algorithm that meets these requirements by detecting linear gradient filled regions and reconstructing the vector definition using that information. It uses a novel interval gradient optimization scheme to derive large regions of uniform gradient. We also demonstrate the technique on noisy images. Keywords Image vectorization Gradient reconstruction Region detection With the increasing demand for vector content, the need for converting raster images into vectors has also in- creased. This process is called Vectorization. We set out to obtain automatic vectorization with minimal human intervention even on potentially noisy images. Later re-rasterization of our vector representation should pro- duce the appearance of the original image at various (a) Original Image (b) Adobe Livetrace scales. Further, for wide-spread usage, this vector rep- output resentation should be in a standard form and be ed- itable.
    [Show full text]
  • Color Representation and Quantization
    Lecture 2 Color Representation and Quantization Yao Wang Polytechnic Institute of NYU, Brooklyn, NY 11201 With contributions from Zhu Liu, AT&T Labs Most figures from Gonzalez/Woods: Digital Image Processing Lecture Outline • Color perception and representation – Human ppperception of color – Trichromatic color mixing theory – Different color representations • ClColor image dildisplay – True color image – Indexed color images • Pseudo color images • Quantization Fundamentals – Uniform, non-uniform, dithering • Color quantization Yao Wang, NYU-Poly EL5123: color and quantization 2 Light is part of the EM wave Yao Wang, NYU-Poly EL5123: color and quantization 3 Illuminating and Reflecting Light • Illuminating sources (primary light): – emit light (e.g. the sun, light bulb, TV monitors) – perceived color depends on the emitted freq. – follow additive rule • R+G+B=White • Reflecting sources (secondary light): – reflect an incoming light (e.g. the color dye, matte surface, cloth) – perceived color depends on reflected freq (=emitted freq-absorbed freq.) – follow subtractive rule • R+G+B=Black Yao Wang, NYU-Poly EL5123: color and quantization 4 Eye vs. Camera Camera components Eye components Lens Lens, cornea Shutter Iris, pupil Film RtiRetina Cable to transfer images Optic nerve send the info to the brain Yao Wang, NYU-Poly EL5123: color and quantization 5 Human Perception of Color • Retina contains receptors – Cones • Day vision, can perceive color tone • Red, green, and blue cones – Rods • Night vision, perceive brightness only • Color sensation
    [Show full text]
  • Color Quantization and Its Impact on Color Histogram Based Image Retrieval
    Color Quantization and its Impact on Color Histogram Based Image Retrieval Khouloud Meskaldji1, Samia Boucherkha2 et Salim Chikhi3 [email protected],2 [email protected],3 [email protected] Abstract. The comparison of color histograms is one of the most widely used techniques for Content-Based Image Retrieval. Before establishing a color histogram in a defined model (RGB, HSV or others), a process of quantization is often used to reduce the number of used colors. In this paper, we present the results of an experimental investigation studying the impact of this process on the accuracy of research results and thus will determine the number of intensities most appropriate for a color quantization for the best accuracy of research through tests applied on an image database of 500 color images. Keywords: histogram-based image retrieval, color quantization, color model, precision, Histogram similarity measures. 1 Introduction Content-based image retrieval (CBIR) is a technique that uses visual attributes (such as color, texture, shape) to search for images. These attributes are extracted directly from the image using specific tools and then stored on storage media. The Search in this case is based on a comparison process between the visual attributes of the image request and those of the images of the base (Fig.1). This technique appeared as an answer to an eminent need of techniques of automatic annotaion of images. Manual annotation of images is an expensive, boring, subjective, sensitive to the context and incomplete task. Color attributes are considered among the most used low-level features for image search in large image databases.
    [Show full text]
  • Fast Color Quantization by K-Means Clustering Combined with Image Sampling
    S S symmetry Article Fast Color Quantization by K-Means Clustering Combined with Image Sampling Mariusz Frackiewicz * , Aron Mandrella and Henryk Palus Institute of Automatic Control, Silesian University of Technology, Akademicka 16, 44-100 Gliwice, Poland * Correspondence: [email protected]; Tel.: +48-32-2371066 Received: 18 May 2019; Accepted: 17 July 2019; Published: 1 August 2019 Abstract: Color image quantization has become an important operation often used in tasks of color image processing. There is a need for quantization methods that are fast and at the same time generating high quality quantized images. This paper presents such color quantization method based on downsampling of original image and K-Means clustering on a downsampled image. The nearest neighbor interpolation was used in the downsampling process and Wu’s algorithm was applied for deterministic initialization of K-Means. Comparisons with other methods based on a limited sample of pixels (coreset-based algorithm) showed an advantage of the proposed method. This method significantly accelerated the color quantization without noticeable loss of image quality. The experimental results obtained on 24 color images from the Kodak image dataset demonstrated the advantages of the proposed method. Three quality indices (MSE, DSCSI and HPSI) were used in the assessment process. Keywords: color image; color quantization; clustering; K-Means; sampling; image quality 1. Introduction Color image quantization plays an auxiliary, but still very important role in such tasks of color image processing as image compression, image segmentation, image watermarking, etc. This operation significantly reduces the number of colors in the image and at the same time it should maintain a quantized image similar to the original image.
    [Show full text]
  • Fast Color Quantization Using Macqueen's K-Means Algorithm
    FAST COLOR QUANTIZATION USING MACQUEEN’S K-MEANS ALGORITHM by Skyler Thompson A thesis presented to the Department of Computer Science and the Graduate School of the University of Central Arkansas in partial fulfillment of the requirements for the degree of Master of Science in Computer Science Conway, Arkansas May 2020 ProQuest Number:27744999 All rights reserved INFORMATION TO ALL USERS The quality of this reproduction is dependent on the quality of the copy submitted. In the unlikely event that the author did not send a complete manuscript and there are missing pages, these will be noted. Also, if material had to be removed, a note will indicate the deletion. ProQuest 27744999 Published by ProQuest LLC (2020). Copyright of the Dissertation is held by the Author. All Rights Reserved. This work is protected against unauthorized copying under Title 17, United States Code Microform Edition © ProQuest LLC. ProQuest LLC 789 East Eisenhower Parkway P.O. Box 1346 Ann Arbor, MI 48106 - 1346 TO THE OFFICE OF GRADUATE STUDIES: The members of the Committee approve the thesis of ______________________________________Skyler Thompson presented on ___________________________________________________________03/03/2020 . Digitally signed by M. Emre Celebi _______________________________________M. Emre Celebi Date: 2020.03.21 10:48:41 -05'00' Committee Chairperson Digitally signed by Sinan Kockara ______________________________________Sinan Kockara Date: 2020.03.23 15:49:33 -05'00' Committee Member Digitally signed by Yu Sun DN: cn=Yu Sun, o=University of Central Arkansas, ou=Computer Yu Sun Science Dept., [email protected], c=US ______________________________________Date: 2020.03.23 16:31:32 -05'00' Committee Member ______________________________________ Committee Member ______________________________________ Committee Member © 2020 Skyler Thompson iv ACKNOWLEDGEMENT I must first and foremost thank God for providing me with the knowledge and guidance to get through my several years in college.
    [Show full text]
  • Gif2video: Color Dequantization and Temporal Interpolation of GIF Images
    GIF2Video: Color Dequantization and Temporal Interpolation of GIF images Yang Wang1, Haibin Huang2†, Chuan Wang2, Tong He3, Jue Wang2, Minh Hoai1 1Stony Brook University, 2Megvii Research USA, 3UCLA, †Corresponding Author Abstract Color Color Quantization Dithering Graphics Interchange Format (GIF) is a highly portable graphics format that is ubiquitous on the Internet. De- !"#"$ %&#'((' spite their small sizes, GIF images often contain undesir- )$$"$ *+,,-.+"/ able visual artifacts such as flat color regions, false con- tours, color shift, and dotted patterns. In this paper, we propose GIF2Video, the first learning-based method for en- hancing the visual quality of GIFs in the wild. We focus Artifacts: Artifacts: 1. False Contour 4. Dotted Pattern on the challenging task of GIF restoration by recovering 2. Flat Region 3. Color Shift information lost in the three steps of GIF creation: frame sampling, color quantization, and color dithering. We first Figure 1. Color quantization and color dithering. Two major propose a novel CNN architecture for color dequantization. steps in the creation of a GIF image. These are lossy compression It is built upon a compositional architecture for multi-step processes that result in undesirable visual artifacts. Our approach is able to remove these artifacts and produce a much more natural color correction, with a comprehensive loss function de- image. signed to handle large quantization errors. We then adapt the SuperSlomo network for temporal interpolation of GIF frames. We introduce two large datasets, namely GIF-Faces and GIF-Moments, for both training and evaluation. Ex- temporal resolution of the image sequence by using a mod- perimental results show that our method can significantly ified SuperSlomo [20] network for temporal interpolation.
    [Show full text]
  • Color Image Quantization: a Short Review and an Application with Artificial Bee Colony Algorithm
    INFORMATICA, 2014, Vol. 25, No. 3, 485–503 485 2014 Vilnius University DOI: http://dx.doi.org/10.15388/Informatica.2014.25 Color Image Quantization: A Short Review and an Application with Artificial Bee Colony Algorithm Celal OZTURK∗, Emrah HANCER, Dervis KARABOGA Department of Computer Engineering, Erciyes University, Kayseri, 38039, Turkey e-mail: [email protected], [email protected], [email protected] Received: December 2012; accepted: July 2013 Abstract. Color quantization is the process of reducing the number of colors in a digital image. The main objective of quantization process is that significant information should be preserved while reducing the color of an image. In other words, quantization process shouldn’t cause significant information loss in the image. In this paper, a short review of color quantization is presented and a new color quantization method based on artificial bee colony algorithm (ABC) is proposed. The performance of the proposed method is evaluated by comparing it with the performance of the most widely used quantization methods such as K-means, Fuzzy C Means (FCM), minimum variance and particle swarm optimization (PSO). The obtained results indicate that the proposed method is superior to the others. Key words: color quantization, artificial bee colony, particle swarm optimization, K-means, fuzzy C means. 1. Introduction It is known that high quality images can be displayed and stored without much effort with the rapid development of computer software and hardware. However, these images can contain a huge amount of detailed information causing transfer time and processing problems. In order to avoid these problems, unnecessaryinformationshould be eliminated from images with some pre-processing methods before processing and transmission.
    [Show full text]
  • Color Image Quantization and Dithering Method Based on Human Visual System Characteristics*
    Color Image Quantization and Dithering Method Based on Human Visual System Characteristics* Kyeong Man Kim, Chae Soo Lee, Eung Joo Lee, and Yeong Ho Ha Department of Electronic Engineering, Kyungpook National University Taegu 702-701, Korea Abstract of an initially selected palette. Variations on this idea include the manner in which the initial palette is chosen New methods for both color palette design and dither- and the color space in which the quantization is per- ing based on human visual system (HVS) characteris- formed. The refinement algorithm, commonly known as tics are proposed. Color quantization for palette design the K-means or Linde–Buzo–Gray (LBG) algorithm,1 uses the relative visual sensitivity and spatial masking is a vector extension of the Lloyd quantizer for scalars. effect of HVS. The dithering operation for printing uses The algorithm seeks to reduce the total squared error nonlinear quantization, which considers the overlapping (TSE) between the original and the quantized image at phenomena among neighbor printing dots, and then a each iteration until a (local) minimum is found. This modified dot-diffusion algorithm is followed to compen- method yields high-quality images and, with a properly sate the degradation produced in the quantization pro- chosen initial palette, will result in the lowest TSE for a cess. The proposed techniques can produce high-quality given palette size. The method is, however, images in low-bit color devices. computationally intensive, and its performance is sen- sitive to the choice of the initial palette. Also there are Introduction splitting algorithms2,3,6 that divide the color space into disjoint regions and pick a representative color from each Recently, the use of color image data has been growing region as a palette color.
    [Show full text]
  • Gifnets: Differentiable GIF Encoding Framework
    GIFnets: Differentiable GIF Encoding Framework Innfarn Yoo, Xiyang Luo, Yilin Wang, Feng Yang, and Peyman Milanfar Google Research - Mountain View, California [innfarn, xyluo, yilin, fengyang, milanfar]@google.com Abstract Graphics Interchange Format (GIF) is a widely used im- age file format. Due to the limited number of palette col- ors, GIF encoding often introduces color banding artifacts. Traditionally, dithering is applied to reduce color banding, but introducing dotted-pattern artifacts. To reduce artifacts and provide a better and more efficient GIF encoding, we introduce a differentiable GIF encoding pipeline, which in- cludes three novel neural networks: PaletteNet, DitherNet, and BandingNet. Each of these three networks provides an (a) Original (b) Palette important functionality within the GIF encoding pipeline. PaletteNet predicts a near-optimal color palette given an input image. DitherNet manipulates the input image to re- duce color banding artifacts and provides an alternative to traditional dithering. Finally, BandingNet is designed to detect color banding, and provides a new perceptual loss specifically for GIF images. As far as we know, this is the first fully differentiable GIF encoding pipeline based on deep neural networks and compatible with existing GIF de- coders. User study shows that our algorithm is better than Floyd-Steinberg based GIF encoding. (c) Floyd-Steinberg (d) Our Method Figure 1: Comparison of our method against GIF encod- 1. Introduction ing with no dithering and Floyd-Steinberg dithering. Com- pared to GIF without dithering (b) and Floyd-Steinberg (c), GIF is a widely used image file format. At its core, GIF our method (d) shows less banding artifacts as well as less represents an image by applying color quantization.
    [Show full text]