Lossless Data Compression of Grid-Based Digital Elevation Models: a Png Image Format Evaluation
Total Page:16
File Type:pdf, Size:1020Kb
ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences, Volume II-5, 2014 ISPRS Technical Commission V Symposium, 23 – 25 June 2014, Riva del Garda, Italy LOSSLESS DATA COMPRESSION OF GRID-BASED DIGITAL ELEVATION MODELS: A PNG IMAGE FORMAT EVALUATION G. Scarmana University of Southern Queensland, Australia [email protected] Commission V KEY WORDS: Image processing; DEM compression; Terrain modelling; PNG (Portable Networks Graphics). ABSTRACT: At present, computers, lasers, radars, planes and satellite technologies make possible very fast and accurate topographic data acquisition for the production of maps. However, the problem of managing and manipulating this data efficiently remains. One particular type of map is the elevation map. When stored on a computer, it is often referred to as a Digital Elevation Model (DEM). A DEM is usually a square matrix of elevations. It is like an image, except that it contains a single channel of information (that is, elevation) and can be compressed in a lossy or lossless manner by way of existing image compression protocols. Compression has the effect of reducing memory requirements and speed of transmission over digital links, while maintaining the integrity of data as required. In this context, this paper investigates the effects of the PNG (Portable Network Graphics) lossless image compression protocol on floating-point elevation values for 16-bit DEMs of dissimilar terrain characteristics. The PNG is a robust, universally supported, extensible, lossless, general-purpose and patent-free image format. Tests demonstrate that the compression ratios and run decompression times achieved with the PNG lossless compression protocol can be comparable to, or better than, proprietary lossless JPEG variants, other image formats and available lossless compression algorithms. 1. INTRODUCTION observations and DEM-reconstructed elevation values (Owen and Grigg, 2004). Traditionally DEM errors are reported by DEMs are generally used to describe the surface of the earth or summary statistics, using a single value, such as the Root Mean planets, and virtual worlds in video games. They are produced Square Error (RMSE), which quantifies the average deviation in a number of ways, most of them by direct field measurements between ground-based observation and DEM values at a set of of elevations at specific locations using for example LIDAR control locations. These statistics are global measures of DEMs (Light Detection and Ranging), photogrammetry or accuracy and are not specific to any particular location. Interferometric Synthetic Aperture Radar (Fujisada et al., 2012). Therefore it is assumed that the error rates are similar everywhere, from the highest sub areas to the flattest ones. DEMs represent a continuous surface and depending on the application they can be very dense, with grid point distances However, if the sampling data is evenly and densely distributed ranging from one metre to less than one hundred metres, thus (as it is the case of the DEMs considered in this work) over the incorporating large amounts of data (Poli and Soille, 2011). It grid, these error measures may indeed be indicative of DEMs should be pointed out that this work relates to rectangular, accuracy. A complete evaluation of accuracy assessment of regularly sampled elevation datasets. Other DEM DEMs is beyond the scope of this work which basically relates representations include irregularly sampled elevation points to the compression of the DEM once the DEM is generated or stored in a TIN (Triangulated Irregular Network) and the constructed according to specific accuracy requirements, contour representation (Maune, 2007). Spatial data structures applications and uses. For a comprehensive study of local and such as the quadtree of Samet (Samet, 1987). can also be used global probabilistic accuracy assessments of DEMs the reader is for constructing and/or representing a DEM. referred to Harvey (2008). A basic processes operating on DEMs is data compression. DEM data can be very dense it requires relatively high memory Compressing a DEM reduces the storage space and facilitates for its storage. Costs for storage space may be considered as no faster access to the data as well as faster transfer. Image longer crucial, since the price for storage media has been almost compression algorithms are relevant to DEM compression exponentially decreased (at present <$50 per Terabyte). because they deal with comparable issues. Image compression Nonetheless, more significant are system dependent parameters, algorithms target natural images, where the information loss like bandwidth and CPU speed. Time is crucial for Internet and brought by imperfect reconstruction is usually not a problem real-time applications. The largest bottleneck in modern because It is the human visual perception that is being targeted computers is the transfer of data in the busses between by the compression process. This is not the case of DEMs. In memories (primary and secondary) and processors (central and fact, users of DEMs are very reluctant to use altered datasets. graphic processors). Transferring large amounts of data using Some issues are: altered slope, altered hydrology and altered external communication (cables or wireless interfaces) is even visibility. more critical. Processors are sometimes so fast that the time for transferring and decoding data may be faster than transferring In addition, grid-based DEMs are usually estimated from non-coded data (Yea and Pearlman, 2004). interpolated values. These estimates will always be affected by several sources of errors (sampling, measurement, interpolation Very dense DEM data may preclude numerous redundancies, methods, etc.), and an inevitable disparity will occur between which can be encoded efficiently with modern image processing This contribution has been peer-reviewed. The double-blind peer-review was conducted on the basis of the full paper. doi:10.5194/isprsannals-II-5-313-2014 313 ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences, Volume II-5, 2014 ISPRS Technical Commission V Symposium, 23 – 25 June 2014, Riva del Garda, Italy compression techniques (Moshe et al, 2007). Hence, this paper Indeed, very few image formats are suitable for representing a evaluates the effects of the PNG (Portable Network Graphics) DEM as a raster image with sufficient accuracy and resolution. lossless compression protocol on floating-point elevation values The majority of these papers deal with DEM compression by for 16-bit DEMs of dissimilar terrain characteristics. Since using variants of the JPEG compression algorithm. By way of DEMs can be displayed and processed using an image format, example, Shantanu and Sapiro (2001) investigated the lossless they can be read, exported and imported directly by a scientific compression of terrain images using the JPEG-LS standard application supporting that format and, therefore, can be whereas Bjorke and Nilson (2002) and Oimoen (2005) detailed processed in a lossy or lossless compressed form as required wavelets based compression schemes for terrain images. This (Kidner and Smith, 2003). scheme support elevation query mechanisms while allowing the compression and/or decompression of specific terrain areas of The majority of the research in the compression of raster images interest within an elevation range. for DEMs representation has focused on lossy compression algorithms. Lossy compression schemes may only achieve Owen and Grigg. (2004) demonstrated the use of JPEG2000 for modest compression before significant information is lost. Even compressing and querying DEMs, and provided comparisons if greater compression could be achieved, and if some loss of with compression utilities such as winzip and winrar. Moshe data is acceptable, there is still controversy over the role of and Shamir (2007) presented an image compression terrain lossy compression for particular applications (Isenburg et al. generalisation algorithm based on Discrete Cosine Transforms 2004). Indeed, if only mild levels of lossy compression are (DCT) and Discrete Wavelets Transforms (DWT) that were attained, then significantly improved lossless compression specifically adjusted to fit DEMs. Bjørke and Nilsen (2003) techniques might be more appropriate. carried out a similar work using Wavelets Transforms as applied to the simplification and compression of digital terrain Furthermore, complex compression schemes are more costly to models. Alternatively, Hilbring (2004) used 16-bits PNG develop, implement, and deploy, and the use of proprietary images in the integration of high-resolution DEMs into 3D GIS schemes (i.e. JPEG2000) may have a cost (and risk) associated applications for environmental systems. with the end of life of equipment especially for long-term archives and storage (Remondino, 2003). Proprietary Other research papers have studied the compression of floating- compression schemes may also compromise interaction among point 3D data using image formats by focusing on maximising different softwares. Hence, the use of popular consumer the compression ratio as the decompression speed was not industry standards such as the PNG image format can reduce the relevant. For example, Usevitch (2003) and Gamito and Dias cost and risk of using lossless image compression for efficient (2004) proposed extensions and modifications to the JPEG2000 storage and representation of mass generated DEMs. standard that allows floating-point data