Technical Note Gap-Filling of Landsat 7 Imagery Using the Direct Sampling Method
Gaohong Yin 1,*, Gregoire Mariethoz 2 and Matthew F. McCabe 1
1 Water Desalination and Reuse Center, Division of Biological and Environmental Sciences and Engineering, King Abdullah University of Science and Technology, Thuwal 23955, Saudi Arabia; [email protected] 2 Institute of Earth Surface Dynamics, University of Lausanne, Lausanne 1015, Switzerland; [email protected] * Correspondence: [email protected]
Academic Editors: Richard Müller and Prasad S. Thenkabail Received: 17 July 2016; Accepted: 14 December 2016; Published: 27 December 2016
Abstract: The failure of the Scan Line Corrector (SLC) on Landsat 7 imposed systematic data gaps on retrieved imagery and removed the capacity to provide spatially continuous fields. While a number of methods have been developed to fill these gaps, most of the proposed techniques are only applicable over relatively homogeneous areas. When they are applied to heterogeneous landscapes, retrieving image features and elements can become challenging. Here we present a gap-filling approach that is based on the adoption of the Direct Sampling multiple-point geostatistical method. The method employs a conditional stochastic resampling of known areas in a training image to simulate unknown locations. The approach is assessed across a range of both homogeneous and heterogeneous regions. Simulation results show that for homogeneous areas, satisfactory results can be obtained by simply adopting non-gap locations in the target image as baseline training data. For heterogeneous landscapes, bivariate simulations using an auxiliary variable acquired at a different date provides more accurate results than univariate simulations, especially as land cover complexity increases. Apart from recovering spatially continuous fields, one of the key advantages of the Direct Sampling is the relatively straightforward implementation process that relies on relatively few parameters.
Keywords: Landsat ETM+; gap filling; multiple-point geostatistics; Scan Line Corrector; SLC
1. Introduction The Landsat 7 satellite was launched in April, 1999 with an aim of providing timely and high quality visible and infrared images of the Earth’s surface, while fulfilling the goal of mission data continuity that had been established from preceding platforms [1]. One of the principal objectives of the Landsat Program, a joint effort of the U.S. Geological Survey (USGS) and the National Aeronautics and Space Administration (NASA), was to develop a long-term and spatially consistent record of natural and human-induced changes on the global landscape [2]. The Landsat program has been successful in this task and has played a foundational role across a diverse range of studies with focus areas in agriculture, forestry, water quality and geology, to name just a few. Unfortunately, the Scan Line Corrector (SLC; an electromechanical device that compensates for the forward motion of the satellite) on Landsat 7’s Enhanced Thematic Mapper Plus (ETM+) instrument, failed on 31 May 2003. Consequently, the once spatially continuous fields observed from the sensor were affected by a line-of-sight zig-zag pattern, resulting in large gaps and around 22% of pixels missing in collected data. While still providing usable and accurate data, the utility of the observations was significantly impaired. In order to assess the scale of the SLC problem, scientists were gathered to
Remote Sens. 2017, 9, 12; doi:10.3390/rs9010012 www.mdpi.com/journal/remotesensing Remote Sens. 2017, 9, 12 2 of 20 evaluate the utility and validity of Landsat 7 products after the SLC failure [3]. Overall, it was found that for those applications operating at large spatial scales, such as land cover change and qualitative assessments of crops, the effect of the anomaly was slight or tolerable, while for applications requiring complete and focused retrievals over localized areas, such as detailed mapping or event monitoring, the anomaly could not be ignored [3]. Ultimately, finding a method to satisfactorily fill the gaps was necessary. To address this task, a variety of techniques have been proposed, with the Local Linear Histogram Matching (LLHM) approach [4,5] being one of the first considered by the USGS. In this approach (and in subsequent examples used herein), the image containing gaps is referred to as the target image, while the images that provide information to fill these gaps are referred to as input images. The LLHM method uses a geometrically consistent Landsat 7 image collected prior to the anomaly as an input image and attempts to find the linear transformation of the digital number (DN) of the non-gap locations between the input image and target image. While the method is conceptually simple, the results are in general only passable. Some striping appears around land cover changes, and artifacts occur around small sharp edges, which generally limits the usage of LLHM to homogeneous land types. To address the limitations inherent in the LLHM method, Maxwell et al. [6] introduced an approach to fill the gaps based on a multi-scale segment model. The idea of segmentation is to group individual pixels into a series of size groups (scale 10, 15 and 20 were used in Maxwell et al. 2007) according to their spectral and spatial characteristics [7]. However, like the LLHM, the approach is most suited to homogeneous areas, as it is not able to depict the characteristics of more heterogeneous landscapes. Pringle et al. [8] proposed geostatistical interpolation to fill the gaps, recommending the co-kriging algorithm to predict the missing values [9,10]. Two cloud-free input images are used, with one taken earlier and another after the target image. Ideally, the two input images should be temporally close to the target image. However, the utility of this approach is reduced as the availability of temporally close gap-free imagery becomes limited. Further, the implementation of geostatistical approaches is generally based on the assumption of a stationary spatial process, which is not plausible when land cover is expected to change dynamically in both space and time. In more recent work, Chen et al. [11] proposed a deterministic approach, referred to as the Neighborhood Similar Pixel Interpolator (NSPI), to improve the performance of predicting heterogeneous areas. The NSPI assumes that neighborhood pixels close to the un-scanned (i.e., gap) pixel share similar spectral characteristics and temporal patterns of changes to the un-scanned pixel(s). By combining the information provided by the neighborhood pixels, the value of the missing pixels can be determined. Results have shown that the NSPI produces far better continuous characteristics compared with previous methods. However, one issue requiring consideration is that the NSPI is a deterministic approach. Indeed, the question of how input images affect the fidelity of the final simulation results remains unknown, which may weaken the reliability of the simulation results significantly. To address this problem, Zhu et al. [12] introduced the Geostatistical Neighborhood Similar Pixel Interpolation (GNSPI), which improved NSPI by combining geostatistical theory and demonstrated a more accurate result when compared to the NSPI alone. GNSPI divides the reflectance value of an un-scanned pixel into a temporal trend and residual, of which the temporal trend is estimated based on NSPI and the residual is predicted through an ordinary kriging algorithm. Although the simulation results from the GNSPI make it one of the best performing methods to date, its relatively complicated operation as well as computational constraints limits its application. Zeng et al. [13] proposed a weighted linear regression (WLR) algorithm integrated with Laplacian prior regularization to recover missing pixels in SLC-off imagery. The WLR algorithm builds a linear regression model between the corresponding pixels of multi-temporal images to fill the majority of missing pixels. The Laplacian prior regularization method, a non-reference regularization method, is then applied to fill those gap pixels that cannot be completely recovered by WLR algorithm. The integrated approach has been demonstrated to reconstruct complicated landscapes accurately. However, its performance becomes limited when applied to complex landscapes with large data gaps. Remote Sens. 2017, 9, 12 3 of 20
Multiple-point geostatistics (MPS) [14,15] is a modeling approach that uses training images to develop stochastic representations of spatial phenomena. The idea behind multiple-point geostatistics is to infer the multiple-point conditional distribution directly from selected training images and then reconstruct complicated structures using the patterns present in given training images. The sources of training imagery are various and can include, for instance, remotely sensed images, experimental data or modeling results. One important concept in the MPS approach is the data event, which is the pattern of values consisting of the n closest neighborhood pixels of the unknown pixel x in the target image. One early algorithm for MPS is the method proposed by Guardiano and Srivastava [14]. In this approach, the entire training image is scanned and all training replicates that perfectly match the data event (i.e., training replicates have the same values as the data event at corresponding locations) are found and stored. Based on this matching set, the conditional cumulative distribution function (ccdf) of x can be determined and a sample is then randomly drawn from this distribution to fill unknown pixels. One problem impeding the practical application of this algorithm is that it requires a scan of the entire training image to predict the value of each single pixel, which is clearly time-consuming. Strebelle improved the applicability of MPS by introducing the Single Normal Equation Simulation (SNESIM) algorithm [16], which includes a search tree structure where training replicates of all data events are recorded and stored, making MPS more practical. Considering the difficulty of deriving the ccdf from a continuous TI, the SNESIM algorithm can only be applied to categorical variables. A number of alternative methods including the SIMPAT algorithm [17], unilateral path algorithm [18] and FILTERSIM algorithm [19], have also been proposed to deal with both categorical and continuous variables. To date, MPS has been widely used for the reconstruction of complex geological structures [20–22] and in more recent years, has been applied to analyze remote sensing land cover classification [23,24] as well as downscaling applications [25,26]. As such, the method is anticipated to serve as an efficient approach to fill gaps within remote sensing imagery, caused for instance by cloud cover and orbital characteristics [27] or, as in the case to be explored here, instrument failure. To advance the application of MPS approaches for gap-filling efforts, we explore the use of the Direct Sampling method as proposed in Mariethoz et al. [28]. The Direct Sampling method uses a conditional stochastic resampling of known areas in the training image to simulate the unknown locations in the target image and allows for the generation of multiple and equiprobable stochastic reconstructions. The applicability of Direct Sampling has been demonstrated previously across a range of applications, which include the downscaling of climate simulations [29] as well as the simulation of daily rainfall time series [30]. To evaluate the gap-filling potential of Direct Sampling more thoroughly, the approach will be assessed against Landsat data covering a range of challenging land cover types, including deserts, sparse agricultural areas, dense farmlands and urban areas, as well as braided rivers and coastal area. To examine the impact of temporal distance on the fidelity of the simulation results, for each studied land cover type, a series of input images at approximately two weeks, four weeks, six weeks and longer than four months separation from the target image are used to provide gap-filling information. The influence of the choice of input imagery on prediction accuracy is also considered. The reconstruction results are examined using both quantitative and qualitative descriptors to identify the utility of the Direct Sampling method.
2. Materials and Methods
2.1. Direct Sampling Method The Direct Sampling method adopted in this paper generates stochastic fields that present complex statistical and spatial properties based on training images [27]. The particular features of the Direct Remote Sens. 2017, 9, 12 4 of 20
Sampling method compared to other MPS algorithms is that it does not demand an explicit computation and storage of the ccdf expressed in Equation (1):
F(z, x, Nx) = Prob(Z(x) ≤ z|Nx) (1) where Z(x) is the variable of interest, x is the location of an unknown pixel and Nx is the data event consisting of the pattern of values made by the n closest neighbors of x:{x1, x2, ... , xn}. For each unknownRemote pixel Sens.x ,2017Nx, 9is, 12 made of the values of Z at the neighboring locations, {Z(x1), Z(x24), of... 20 , Z(xn)}, as well as the lag vectors that describe the relative positions between location x and its neighborhoods, where Z(x) is the variable of interest, x is the location of an unknown pixel and Nx is the data event − − − defined asconsisting L = {h1 ,ofh 2the, ... pattern, hn} =of { xvalues1 x ,madex2 byx, ...the ,nx closestn x}. neighbors The Direct of x Sampling: {x1, x2,…, methodxn}. For each acquires the values ofunknown unknown pixel pixels x, Nx Z( is xmade) by directlyof the values sampling of Z at thethe neighboring training image locations, in a random{Z(x1), Z(x manner,2), …, Z(xn attempting)}, to find oneas well pixel asy thein thelag trainingvectors that image describe with the a replicate relative Npositionsy that isbetween similar location to Nx [31x ].and As its soon as a neighborhoods, defined as L = {h1, h2, …, hn} = {x1 − x, x2 − x, …, xn − x}. The Direct Sampling method pixel y is obtained with a neighborhood similar to Nx, the sampling process stops and the value of y is acquires the values of unknown pixels Z(x) by directly sampling the training image in a random assigned to x. manner, attempting to find one pixel y in the training image with a replicate Ny that is similar to Nx To clarify[31]. As thesoon process as a pixel of y theis obtained Direct with Sampling a neighborhood method, similar the gap-filling to Nx, the sampling procedure process of stops a categorical variable isand displayed the value of in y Figureis assigned1. In to this x. example, Figure1a is the target image with unknown pixels to be filled, of whichTo clarify white the process pixels of are the unknown Direct Sampling and coloredmethod, the pixels gap-filling are informed. procedure of The a categorical filling path of all un-scannedvariable pixels is displayed is generated in Figure randomly. 1. In this example, The pixel Figure denoted 1a is the bytarget a question image with mark unknown is the pixels target pixel to be filled, of which white pixels are unknown and colored pixels are informed. The filling path of x to be predicted at the current step and the data event N is defined in (b). Figure1c is a training all un-scanned pixels is generated randomly. The pixel denotedx by a question mark is the target image usedpixel to x fillto be the predicted target pixels.at the current The data step eventand the (b) data is usedevent toNx searchis defined the in training (b). Figure image 1c is randomly.a The searchtraining process image continues used to fill until the onetarget replicate pixels. TheN ydatathat event can (b) exactly is used match to search the the defined training data image event Nx is found inrandomly. the training The search image. process Once continues the matched until one replicate replicateN Nyy isthat found can exactly (Figure match1d, checkthe defined marks), the search processdata event stops Nx andis found the valuein the oftrainingy is assigned image. Once to thethe targetmatched pixel replicatex, as N showny is found in Figure(Figure1 e.1d, The new check marks), the search process stops and the value of y is assigned to the target pixel x, as shown pixel valuein Figure is then 1e. consideredThe new pixel known value is andthen consid the procedureered known (a)–(d) and the isprocedure repeated (a)–(d) to simulate is repeated the to value of another unknownsimulate the pixel. value Each of another time, theunknown previously pixel. simulatedEach time, pixelsthe previously are considered simulated in pixels subsequent are data events. Theconsidered order in in subsequent which pixels data areevents. simulated The order is definedin which bypixels arandom are simulated path is which defined is by different a for each realization.random path which is different for each realization.
Figure 1.FigureThe Direct1. The Direct Sampling Sampling approach approach and and its its use use of of a atraining training image image toto predict predict a missing a missing pixel pixelin in the target image.the target The image. procedure The procedure follows follows a number a number of steps, of steps, including including ( a(a)) definitiondefinition of of the the simulation simulation grid, where thegrid, pixel wherex represents the pixel x therepresents pixelto the be pixel filled, to be and filled, the an blue,d the red, blue, yellow red, yellow pixels pixels represent represent pixels with pixels with known values; (b) definition of the data event; (c) use of a training image to provide known values; (b) definition of the data event; (c) use of a training image to provide information to fill information to fill the gaps, with a search employing the defined data event from (b) until; (d) the the gaps,simulation with a search data employingevent is identifi theed defined in the datatraining event image; from and (b) until;(e) the ( dmissing) the simulation target value data event is identifiedcorresponding in the training to the identified image; andtraining (e) thevalue missing is assigned target the valuefirst data corresponding point matching tothe the data identified trainingevent. value is assigned the first data point matching the data event.
The gap-filling procedure for a continuous variable (Figure 2) is slightly different from the categorical variables as it is unrealistic to find a replicate in the training image which is completely the same as the defined data event. A distance criterion is introduced with the notation d(Nx, Ny) to choose the replicates for filling gap pixels. The distance is a metric used to assess the similarity
Remote Sens. 2017, 9, 12 5 of 20
The gap-filling procedure for a continuous variable (Figure2) is slightly different from the categorical variables as it is unrealistic to find a replicate in the training image which is completely the sameRemote as theSens. 2017 defined, 9, 12 data event. A distance criterion is introduced with the notation5 ofd 20(N x, Ny) to choose the replicates for filling gap pixels. The distance is a metric used to assess the similarity between the replicate Ny found in the training image and the data event Nx. When the distance is between the replicate N found in the training image and the data event N . When the distance is less less than a given thresholdy t, Ny is regarded similar as Nx and the value of xy is assigned to x. The than a givendistance threshold can be computedt, Ny is regardedin different similarways (further as Nx discussionand the value on this of cany isbe assignedfound in Mariethoz to x. The et distance can be computedal. [31]). in different ways (further discussion on this can be found in Mariethoz et al. [31]).
Figure 2.FigureFlowchart 2. Flowchart of the of Directthe Direct Sampling Sampling method method using a asingle single training training imag image.e. In the In flowchart, the flowchart, x x and y areand the y are locations the locations of the of pixels the pixels in target in targ imageet image and and training training imageimage respectively. respectively. Z(xZ) (andx) and Z(y)Z (y) are are the values at corresponding locations; d is the distance between two data events Nx and Ny, the values at corresponding locations; d is the distance between two data events Nx and Ny, which can which can be denoted as d{Nx, Ny}. f is the maximum search proportion of scanned training image be denoted as d{Nx, Ny}. f is the maximum search proportion of scanned training image and t is the and t is the user-defined threshold. user-defined threshold. In this paper, the Landsat reflectance data is a continuous variable, hence the Euclidian In thisDistance paper, is favored, the Landsat expressed reflectance in Equation data (2) isas: a continuous variable, hence the Euclidian Distance is favored, expressed in Equation (2) as: 1 ( , )= ( − ( ) (2) s n 1 2 where ƞ is a normalization factord(N defined, N ) = as the difference[Z(x − betweenZ(y )] the maximum and minimum (2) x y η ∑ i i values in the training image and ensures that the idistance=1 remains bounded within the interval [0,1]. The threshold t is also defined in the same interval, of which t = 0 reflects the highest accuracy where ηallowingis a normalization no discrepancy factor in the simulated defined aspatterns the difference and t = 1 assumes between complete the maximumdata event mismatch. and minimum values inUsually, the training it is impossible image and to achieve ensures a 0 that threshold, the distance so a very remains small t boundedvalue is set. within For each the step interval of [0,1]. The thresholdsearchingt is replicates, also defined the replicate in the same giving interval, the lowest of whichdistancet =will 0 reflectsbe stored the until highest one replicate accuracy that allowing no discrepancygives a distance in the simulatedlower than patterns the threshold and t is= 1found. assumes To balance complete the dataprediction event accuracy mismatch. and Usually, it is impossiblecomputational to achieve time, a fraction a 0 threshold, parameter so f is a introduced. very small Fractiont value is the is set.maximum For each proportion step ofof the searching training image to be searched for each target pixel. When the search proportion of the training replicates, the replicate giving the lowest distance will be stored until one replicate that gives a distance image exceeds a fraction f without replicates fulfilling the threshold requirement found, the value of lower thanthe pixel the thresholdy with the lowest is found. distance To balancewill be assigned the prediction to Z(x). The accuracy concept and of Direct computational Sampling is time, a fractionrelatively parameter simple,f is introduced.with only three Fraction main isparameters the maximum comprising proportion the number of the of training neighbors image n, to be searchedthreshold for each t and target fraction pixel. f, to When be considered the search in generating proportion the of simulation. the training Since image the Direct exceeds Sampling a fraction f withoutis replicates a stochastic fulfilling method, the multiple threshold equiprobable requirement reconstructions found, the valuecan be of generated the pixel toy withquantify the lowest distanceuncertainty. will be assigned to Z(x). The concept of Direct Sampling is relatively simple, with only three In this study, the Direct Sampling fills missing pixels by deriving values from portions of the main parameterstraining image comprising that satisfy the a distance number to of a neighborsdata event thatn, threshold is less thant theand specified fraction thresholdf, to be consideredt, with in generating the simulation. Since the Direct Sampling is a stochastic method, multiple equiprobable reconstructions can be generated to quantify uncertainty. Remote Sens. 2017, 9, 12 6 of 20
In this study, the Direct Sampling fills missing pixels by deriving values from portions of the trainingRemote image Sens. 2017 that, 9, satisfy12 a distance to a data event that is less than the specified threshold 6t ,of with 20 the training image covering the same area as the target image (but from a different date). In cases where a the training image covering the same area as the target image (but from a different date). In cases significant portion of the target image is already known, the Direct Sampling can work without using where a significant portion of the target image is already known, the Direct Sampling can work any additionalwithout using training any additional imagery, training with the imagery, non-gap with locations the non-gap in the locations target image in the functioning target image as the trainingfunctioning data to as fill the the training gaps. Indata addition to fill the to gaps. these In so addition called univariate to these so cases,called theunivariate Direct cases, Sampling the can alsoDirect be applied Sampling to multivariate can also be simulations,applied to multivar with theiate processsimulations, conceptually with the process similar conceptually to the univariate simulation,similar to but the with univariate the difference simulation, being but inwith the the selection difference of neighborhoodbeing in the selection pixels of and neighborhood the definition of distance.pixels Figure and the3 illustrates definition theof distance. selection Figure of neighborhood 3 illustrates pixelsthe selection in multivariate of neighborhood simulations pixels (bivariate in simulationsmultivariate examined simulations in this (bivariate study). simulations Figure3a shows examined the in target this study). images, Figure with 3a the shows values the of target observed pixelsimages, (i.e., coloredwith thepixels) values differentof observed for pixels the two (i.e., variables. colored pixels) Figure different3b displays for the the two training variables. images withFigure the same 3b displays two variables. the training The images pixel with with the a redsame question two variables. mark The in Figure pixel with3a is a the red targetquestion pixel x mark in Figure 3a is the target pixel x to be predicted at the current step, and its closest seven pixels to be predicted at the current step, and its closest seven pixels are selected as neighborhood pixels. are selected as neighborhood pixels. Neighborhoods are defined on both variables. One replicate Ny Neighborhoodsfound in the are training defined image on bothis shown variables. in Figure One 3b replicate with pixelsNy found marked in thein red. training The imagedistance is is shown in Figurecomputed3b with by pixelscalculating marked a weighted in red. Theaverage distance of the is univariate computed distance by calculating of each variable, a weighted as shown average of the univariatein Equation distance (3): of each variable, as shown in Equation (3):