electronics

Article Medical Video Coding Based on 2nd-Generation : Performance Evaluation

Merzak Ferroukhi 1, Abdeldjalil Ouahabi 2,3,* , Mokhtar Attari 1, Yassine Habchi 4 and Abdelmalik Taleb-Ahmed 5

1 Laboratory of Instrumentation, Faculty of Electronics and Computers, University of Sciences and Technology Houari Boumediene, 16111 Algiers, Algeria; [email protected] (M.F.); [email protected] (M.A.) 2 Polytech Tours, Imaging and Brain, INSERM U930, University of Tours, 37200 Tours, France 3 Computer Science Department, LIMPAF, University of Bouira, 10000 Bouira, Algeria 4 Electrical Engineering Department, LTIT, Bechar University, 08000 Bechar, Algeria; [email protected] 5 IEMN DOAE UMR CNRS 8520, UPHF, 59313 Valenciennes, France; [email protected] * Correspondence: [email protected]; Tel.: +33-603894463

 Received: 5 October 2018; Accepted: 8 January 2019; Published: 14 January 2019 

Abstract: The operations of digitization, transmission and storage of medical data, particularly images, require increasingly effective encoding methods not only in terms of compression ratio and flow of information but also in terms of visual quality. At first, there was DCT (discrete cosine transform) then DWT (discrete transform) and their associated standards in terms of coding and image compression. The 2nd-generation wavelets seeks to be positioned and confronted by the image and video coding methods currently used. It is in this context that we suggest a method combining bandelets and the SPIHT (set partitioning in hierarchical trees) algorithm. There are two main reasons for our approach: the first lies in the nature of the bandelet transform to take advantage of capturing the geometrical complexity of the image structure. The second reason is the suitability of encoding the bandelet coefficients by the SPIHT encoder. Quality measurements indicate that in some cases (for low bit rates) the performance of the proposed coding competes with the well-established ones (H.264 or MPEG4 AVC and H.265 or MPEG4 HEVC) and opens up new application prospects in the field of medical imaging.

Keywords: bandelets; medical imaging; quadtree decomposition; SPIHT coder; video coding; MPEG4 AVC; HEVC; video quality measure

1. Introduction and Motivation The extensive volume of patient medical data recorded at every moment in hospitals, medical imaging centers and other medical organizations has become a major issue. The need for quasi-infinite storage space and efficient real-time transmission in specific applications such as medical imaging, military imaging and satellite imaging requires advanced techniques and technologies including coding methods to reduce the amount of data to be stored or transmitted. Without coding of information, it is very difficult and sometimes impossible to make a way to store or communicate big data (large volumes of high velocity and complex images, audio and video information) via the internet. The encoding of these data is obtained by eliminating the redundant or unnecessary information in the original which cannot be identified with the naked eye. Telemedicine offers valuable specialty diagnostic services to underserved patients in rural areas, but often requires significant image compression to transmit medical images using limited bandwidth

Electronics 2019, 8, 88; doi:10.3390/electronics8010088 www.mdpi.com/journal/electronics Electronics 2019, 8, x FOR PEER REVIEW 2 of 18 bandwidth networks. The difficulty is that compressing images such as those for tele-echography can introduce artifacts and reduce medical image quality which affects directly the diagnosis [1]. Electronics 2019, 8, 88 2 of 18 This possible disadvantage is inherent in all lossy compression methods. Figure 1 recalls the key steps of video coding. Transformation is a linear and reversible operationnetworks. Thethat difficultyallows the is thatimage compressing to be represented images such in asthe those transform for tele-echography domain where can the introduce useful informationartifacts and is reduce well localized. medical image This property quality which will make affects it possible directly theduring diagnosis the compression [1]. to achieve goodThis discrimination, possible disadvantage that is to say is inherent the suppressi in all lossyon of compressionunnecessary methods.or redundant information. The discreteFigure cosine1 recalls transform the key stepsand the of videowavelet coding. tran Transformationsform are among is a linearthe most and reversiblepopular transforms operation proposedthat allows for the image image and to video be represented coding. in the transform domain where the useful information is wellQuantization localized. is This the step property of the willcompression make it possibleprocess during during which the compression a large part of to the achieve elimination good ofdiscrimination, unnecessary or that redundant is to say theinformation suppression occurs. of unnecessary or redundant information. The discrete cosineCoding transform techniques and the e.g., wavelet Huffman transform coding areand among arithmetic the mostcoding popular [2], are transforms the best-known proposed entropy for codingimage andparticularly video coding. in image and video coding.

Figure 1. SimplifiedSimplified video coding scheme.

ItQuantization is well known is the that step lossless of the compression compression methods process duringhave the which advantage a large of part providing of the elimination an image withoutof unnecessary any loss or redundantof quality informationand the disadvantage occurs. of compressing weakly. Hence, this type of compressionCoding techniquesis not suitable e.g., for Huffman transmission coding or and storage arithmetic of big coding data such [2], areas video. the best-known Conversely, entropy lossy compressioncoding particularly produces in image significantly and video smaller coding. file sizes often at the expense of image quality. However,It is well lossy known coding that based lossless on transforms compression has methods gained great have importance the advantage when of several providing applications an image withouthave appeared. any loss In of particular, quality and lossy the coding disadvantage represents of compressingan acceptable weakly. option Hence,for coding this medical type of images.compression e.g., a is DICOM not suitable (Digital for Imaging transmission Communica or storagetion ofin bigMedicine) data such JPEG as (Joint video. Photographic Conversely, Expertslossy compression Group) based produces on discrete significantly cosine smallertransform file (DCT) sizes oftenand a at DICOM the expense JPEG of 2000 image based quality. on wavelet.However, In lossy this context, coding basedwavelet-based on transforms image has codi gainedng, e.g., great JPEG2000, importance using when multiresolution several applications analysis [3,4]have exhibits appeared. performance In particular, that lossy is highly coding superior represents to other an acceptable methods optionsuch as for those coding based medical on DCT, images. e.g., JPEG.e.g., a DICOM (Digital Imaging Communication in Medicine) JPEG (Joint Photographic Experts Group) basedIn on 1993, discrete Shapiro cosine introduced transform a (DCT) lossy andimage a DICOM compression JPEG 2000 algorithm based onthat wavelet. he called In thisembedded context, wavelet-basedzerotrees of wavelet image coding,(EZW) e.g.,[5]. JPEG2000,The success using of co multiresolutionding wavelet analysisapproaches [3,4 ]is exhibits largely performance due to the occurrencethat is highly of superior effective to sub-band other methods coders. such The as thoseEZW basedencoder on DCT,is the e.g., first JPEG. to provide remarkable distortion-rateIn 1993, Shapiro performance introduced while aallowing lossy image progressive compression decoding algorithm of the that image. he calledThe principle embedded of zerotrees,zerotrees ofor waveletother partitioning (EZW) [5]. structures The success in sets of codingof zeros, wavelet makes approachesit possible to is take largely account due toof the residualoccurrence dependence of effective of sub-bandthe coefficients coders. between The EZW them. encoder More is precisely, the first tosince provide the high-energy remarkable coefficientsdistortion-rate are spatially performance grouped, while their allowing position progressive is effectively decoding comput ofed theby indicating image. The the principle position of thezerotrees, sets of orlow-energy other partitioning coefficients. structures After separation in sets of of zeros, the low-energy makes it possible coefficients to take and account the energetic of the coefficients,residual dependence the latter of is the relative coefficientsly independent between them.and can More be efficiently precisely, since coded the using high-energy quantification coefficients and entropyare spatially coding grouped, techniques. their position is effectively computed by indicating the position of the sets of low-energyIn 1996, coefficients. Said and Pearlman After separation proposed of a the hierarchical low-energy tree coefficients partitioning and coding the energetic scheme coefficients, called set partitioningthe latter is relativelyin hierarchical independent tree (SPIHT) and can[6], bebased efficiently on the codedsame basic using EZW, quantification and exhibiting and entropy better performancecoding techniques. than the original EZW. VideoIn 1996, compression Said and Pearlman algorithms proposed such as a H.264 hierarchical and MPEG tree partitioning use inter-frame coding prediction scheme to called reduce set videopartitioning data between in hierarchical a series of tree images. (SPIHT) This [6 involves], based techniques on the same such basic as differential EZW, and exhibitingcoding where better an performance than the original EZW. Video compression algorithms such as H.264 and MPEG use inter-frame prediction to reduce video data between a series of images. This involves techniques such as differential coding where an Electronics 2019, 8, 88 3 of 18 Electronics 2019, 8, x FOR PEER REVIEW 3 of 18 image is compared with a reference image and only the pixels that have changed with respect to that reference image are coded. MPEG (Moving Picture Experts Group) allows to encode video usingusing threethree codingcoding modes:modes: Intra coded framesframes (I):(I): framesframes areare coded coded separately separately without without reference reference to to previous previous images images (i.e., (i.e., as asJPEG JPEG coding). coding). Predictive coded frames (P): images are describeddescribed by difference fromfrom previousprevious images.images. Bidirectionally predictive coded frames (B): (B): the images are described by difference with the previous image and the following image. In order to optimize MPEG coding, the the image image sequences sequences are are in in practice practice coded coded in in a a series series of of I, I, B, B, and P images, the order of which has been determineddetermined experimentally. The typical sequence called GOP (group of pictures) is as follows (see Figure2 2):):

Figure 2.2. TypicalTypicalsequence sequence with with I, BI, andB and Pframes. P frames. P frame P frame can onlycan only refer refer to the to previous the previous I or P frames,I or P whileframes, a Bwhile frame a B can frame refer can to previousrefer to pr orevious subsequent or subsequent I or P frames. I or P frames.

To evaluateevaluate thethe visualvisual qualityquality of of the the reconstructed reconstructed video, video, several several measurements measurements of of the the quality quality of ofthe the MPEG MPEG video video are are validated validated in [in7, 8[7,8].]. As a successor to the famous H.264 or MPEG-4 Part 10 10 standards standards [9,10], [9,10], HEVC HEVC (High-Efficiency (High-Efficiency Video Coding)Coding) oror H.265 H.265 standard standard has has recently recently been be defineden defined as a promisingas a promising standard standard for video for coding, video coding,In June 2013In June [11 ],2013 the [11], first versionthe first of version HEVC of was HEVC announced, was announced, and to improve and to the improve efficiency the ofefficiency the coding of theof this coding standard, of this astandard, set of its a components set of its components requires morerequires development. more development. In addition, In addition, this new this system new systemrequires requires new hardware new hardware and software and software investments investments delaying delaying its adoption its adoption in applications in applications such as in such the asmedical in the field.medical field. In ourour case, case, the the use use of wavelets of wavelets enhances enhances the contours, the contours, which minimizes which minimizes the impact the of information impact of informationloss in medical loss imaging, in medical mainly imaging, in tumor mainly detection in tumor [12– 15detection]. Moreover, [12–15]. bandelets Moreover, are 2nd-generation bandelets are wavelets2nd-generation and the wavelets corresponding and the transform corresponding is an orthogonal transform multi-resolution is an orthogonal transform multi-resolution that is able to transformcapture the that geometric is able contentto capture ofimages the geometric and surfaces. content Hence, of images the possible and surfaces. presence Hence, of artifacts the possible will be presenceconsiderably of artifacts reduced. will be considerably reduced. Our motivation lies in the prospect of offering a new codec that is simple to implement with performances superior to the current H.264 standard and which could compete with the H.265 or HEVC standard for some applications. This This superiority superiority is is based based on on the SPIHT algorithm and the second-generation wavelets. One can ask: “Why using second generation wavelets (instead of classical wavelets)?” This new new generation generation of of wavelets wavelets makes makes it itpossib possiblele to to generate generate decorrelated decorrelated coefficients, coefficients, and and to eliminateto eliminate any any redundancy redundancy by by retaining retaining only only the the ne necessarycessary information information and and it it is is better better suited suited to geometric data (ridges,(ridges, contours,contours, curves,curves, oror singularities).singularities). ThisThis 2nd-generation2nd-generation wavelet wavelet consists of Xlets such asas shearlets,shearlets, bandelets,bandelets, curvelets, curvelets, contourlets, contourlets, ridgelets, ridgelets, noiselets, noiselets, etc. etc. These These Xlets Xlets provide provide an aninteresting interesting performance performance in some in applications, some applications, e.g., contourlets-based e.g., contourlets-based denoising [16], contourlets-baseddenoising [16], contourlets-based video coding [17,18], curvelets-based contrast enhancement [19], bandelets-based geometric image representations [20], shearlets-based image denoising [21], …

ElectronicsElectronics2019 2019, 8,, 8 88, x FOR PEER REVIEW 44 ofof 18 18

In this paper, we analyse and compare to the current state-of-the-art coding methods, the videoperformances coding [17 of,18 a], new curvelets-based coding method contrast based enhancement on bandelet [19 transform], bandelets-based where its geometric coefficients image are representationsencoded using [the20], set shearlets-based partitioning in image hierarchical denoising trees [21 (SPIHT).], . . . InThe this remainder paper, we of analysethis paper and is compare organized to as the follows: current Section state-of-the-art 2 presents coding and recalls methods, the main the performancesproperties of ofbandelet a new coding transform. method Section based 3 intr on bandeletoduces bandelet-SPIHT-bas transform where itsed coefficients video coding. are encoded Section 4 usingfocuses the on set the partitioning experimental in hierarchical aspect and treesthe analysis (SPIHT). of the results. It is worth noting that this work is an extensionThe remainder of the ofconference this paper paper is organized (IECON 2016 as follows:—42nd Annual Section 2Conference presents and of the recalls IEEE the Industrial main propertiesElectronics of bandeletSociety) transform.entitled “Towards Section3 introducesa New Standard bandelet-SPIHT-based in Medical Video video Compression” coding. Section [22].4 focusesCompared on the to experimental this conference aspect paper, and thethere analysis are subs of thetantial results. differences, It is worth in noting this thatrevised this workversion, is anincluding extension large of the databases conference used paper benchmarking (IECON 2016—42nd and the Annualintroduction Conference of HEVC of the which IEEE is Industrial the most Electronicsrecent standardized Society) entitled video“Towards compression a New technolo Standardgy. inFinally, Medical Section Video 5 Compression” concludes this [ 22study.]. Compared to this conference paper, there are substantial differences, in this revised version, including large databases2. Bandelet used Transform benchmarking Algorithm and the introduction of HEVC which is the most recent standardized video compression technology. Finally, Section5 concludes this study. This section describes the different steps of the bandelet approximation algorithm. This 2.algorithm Bandelet was Transform originally Algorithm proposed by Peyré and Mallat [23,24] in the context of geometric image representation. This section describes the different steps of the bandelet approximation algorithm. This algorithm was2.1. originally Bandelet Transform proposed by Peyré and Mallat [23,24] in the context of geometric image representation.

2.1. BandeletBandelets Transform are 2nd-generation wavelets and the corresponding transform is an orthogonal multi-resolution transform that is able to capture the geometric content of images and surfaces. Bandelets are 2nd-generation wavelets and the corresponding transform is an orthogonal Bandelet transform is carried out as follows: multi-resolution transform that is able to capture the geometric content of images and surfaces. First, using orthogonal or bi-orthogonal wavelet, we calculate the wavelet transform of the Bandelet transform is carried out as follows: original medical image. First, using orthogonal or bi-orthogonal wavelet, we calculate the wavelet transform of the Second, to construct a bandelet basis on the whole wavelet domain, we use a quadtree original medical image. segmentation at each wavelet scale in dyadic squares, as shown in Figure 3 where the main steps for Second, to construct a bandelet basis on the whole wavelet domain, we use a quadtree computing bandelet transform are summarized and illustrated on a brain magnetic resonance image segmentation at each wavelet scale in dyadic squares, as shown in Figure3 where the main steps for (MRI). Note that we define a different quadtree for each scale of the wavelet transform (in Figure 3 computing bandelet transform are summarized and illustrated on a brain magnetic resonance image only the quadtree of the finest scale is depicted). More specifically, we use the same quadtree for (MRI). Note that we define a different quadtree for each scale of the wavelet transform (in Figure3 each of the three orientations of the wavelet transform at fixed scale. only the quadtree of the finest scale is depicted). More specifically, we use the same quadtree for each of the three orientations of the wavelet transform at fixed scale.

Quadtree-A Quadtree- B

(a ) MRI (brain) Video (b ) Video frame (c ) Wavelet decomposition (d ) Extraction of a sub - square

(e ) Warped wavelet coefficients ( f ) Resulting 1D Wavelet transform (g ) Bandelet coefficients space

Figure 3. Example of graphical steps of the bandelet transformation algorithm illustrated on a brain Figure 3. Example of graphical steps of the bandelet transformation algorithm illustrated on a brain magnetic resonance image (MRI) (T2-weighted image acquired at 3 T). magnetic resonance image (MRI) (T2-weighted image acquired at 3 T). A complete representation in a bandelet basis is thus composed of: A complete representation in a bandelet basis is thus composed of:

Electronics 2019, 8, 88 5 of 18

Electronics 2019, 8, x FOR PEER REVIEW 5 of 18 • An image segmentation at each scale, defined by a quadtree, • • ForAn image each dyadic segmentation square of at the each segmentation, scale, defined by a quadtree, • For each dyadic square of the segmentation, – a polynomial flow, – a polynomial flow, – –bandelet bandelet coefficients. coefficients.

The MATLAB implementationimplementation ofof bandeletbandelet transformtransform isis freelyfreely availableavailable inin [[25].25].

2.2. Geometric Flows The bandelet bases are basedbased onon anan associationassociation betweenbetween thethe waveletwavelet decompositiondecomposition and an estimation ofof the the image image information information of geometric of geometri character.c character. Instead Instead of describing of describing the image the geometry image throughgeometry edges, through which edges, are mostwhich often are most ill defined, often ill the de imagefined, geometry the image is geometry characterized is characterized with a geometric with flowa geometric of vectors. flow Theseof vectors. vectors These give vectors the local give directions the local indirections which the in imagewhich hasthe regularimage has variations. regular Orthogonalvariations. Orthogonal bandelet bases bandelet are constructed bases are by constr dividingucted the by image dividing support the inimage regions support inside in which regions the geometricinside which flow the is geometric parallel. The flow estimation is parallel. of theThe geometry estimation is doneof the by geometry studying is the done contours by studying present the in ancontours image. present A contour in an is thenimage. seen A contour as a parametric is then curveseen as that a parametric will be characterized curve that bywill its be tangents. characterized To do this,by its we tangents. look for To gradients do this, ofwe significant look for grad importanceients of significant in the frame. importance in the frame. Around each region of frames,frames, the local geometry directions, in which the frame has regular variations inin thethe neighborhoodneighborhood ofof eacheach pixel, pixel, are are determined determined by by two-dimensional two-dimensional geometric geometric flow flow of of a vectora vector field. field.

2.3. Quadtree Division Process In this algorithm,algorithm, the bandelet transform is performedperformed over each dyadic square and scale in the waveletwavelet domain.domain. This is of course a redundantredundant transform.transform. The best segmentation represented as a quadtree isis obtainedobtained throughthrough LagrangianLagrangian optimization.optimization. An example of quadtree decompositiondecomposition is shown in Figure4 4..

Figure 4. Example of quadtree decomposition.

In orderorder toto minimizeminimize thethe distortiondistortion rate, we useuse a parallel,parallel, vertical or horizontal flowflow in eacheach dyadic square.square. TheThe macro macro block block is is considered considered regular regular uniformly uniformly and and wavelet wavelet basis basis is used is used if the if there the isthere no geometricis no geometric flow. Otherwise, flow. Otherwise, the sub-block the sub-bl mustock be processedmust be processed by warped by wavelets, warped aswavelets, explained as inexplained [23,24]. in [23,24]. The best quadtree requiresrequires aa minimization,minimization, forfor eacheach direction,direction, ofof thethe followingfollowing Lagrangian:Lagrangian:

2 Lf,R=f-f() 2 +λQR+R+R2 2 ()  L( fd, R) =dk fd − dfdR dRk + λQ∑ jSRjS + jGRjG jB+ RjB (1)(1) j j where where

f d : Original signal. fd: Original signal. f dR : Recovered signal using inverse 1D wavelet transform. fdR: Recovered signal using inverse 1D wavelet transform. R jS : Number of bits needed to encode the dyadic segmentation. RjS: Number of bits needed to encode the dyadic segmentation. R jG : Number of bits needed to encode the geometric parameter.

R jB : Number of bits needed to encode the quantized bandelets coefficients. λ: Lagrangian multiplier is chosen heuristically to be equal to ¾. To justify this value, see [20].

Electronics 2019, 8, 88 6 of 18

RjG: Number of bits needed to encode the geometric parameter.

RjB: Number of bits needed to encode the quantized bandelets coefficients. Electronics 2019, 8, x FOR PEER REVIEW 6 of 18 3 λ: Lagrangian multiplier is chosen heuristically to be equal to 4 . To justify this value, see [20]. Q: QuantizationQ : Quantization step. step. TheThe following following steps steps implement implemen thet the quadtree quadtree structure.structure. To computeTo compute the quadtree,the quadtree, we firstwe first compute compute the Lagrangianthe Lagrangian for eachfor each dyadic dyadic square square for the for smallest the size.smallest Then, the size. algorithm Then, the goesalgorithm from goes the smallestfrom the sizesmal tolest the size biggest, to the biggest, and tries and to tries merge to merge each groupeach of fourgroup squares. of four To do squares. so, it simply To do computesso, it simply the computes Lagrangian the ofLagrangian the merged of the square, merged and square, compares and it to compares it to the sum of the 4 Lagrangians (plus 1 bit, which is the cost of coding a split in the tree). the sum of the 4 Lagrangians (plus 1 bit, which is the cost of coding a split in the tree). Using MATLAB code, one can compute a single quadtree for each scale and for each Using MATLAB code, one can compute a single quadtree for each scale and for each orientation. orientation. Sharing the same quadtree for the three orientations leads to a little efficiency increase, Sharingat the the price same of quadtreea more complex for the implementation. three orientations The leads quadtree to a structure little efficiency and geometry increase, are at stored the price in of a morean image complex that implementation.is the same size as the The original quadtree imag structuree (and they and have geometry the same are hierarchical stored in structure an image as that is thethe same wavelet size transformed as the original image). image (and they have the same hierarchical structure as the wavelet transformed image). 2.4. Warped Wavelet along Geometric Flows 2.4. Warped Wavelet along Geometric Flows In order to perform the complex geometry and remove the redundancy of orthogonal wavelet coefficients,In order to bandelet perform basis the complexdecomposition geometry is applied and remove with a fast the application redundancy of ofthe orthogonal geometric flow, wavelet coefficients,quadtree bandeletdecomposition, basis warping decomposition and bandeletization. is applied with a fast application of the geometric flow, quadtreeIn decomposition, each dyadic square warping and at and each bandeletization. scale, the geometric flow is used to warp the wavelet basis alongIn each the dyadicregularity square direction. and atAfte eachr warping, scale, the the geometricnext step is flow to construct is used bandelets to warp theby applying wavelet basisa alongbandeletization the regularity procedure. direction. After warping, the next step is to construct bandelets by applying a The wavelet includes high-pass filters and vanishing moments at lower resolutions, this is valid bandeletization procedure. for vertical and diagonal detail coefficients, but not for horizontal detail coefficients. The problem of The wavelet includes high-pass filters and vanishing moments at lower resolutions, this is valid regularity along the geometric flow is due to the scaling function where it includes low-pass filters for verticaland does and not diagonalhave a vanishing detail coefficients, moment at lower but not resolutions. for horizontal detail coefficients. The problem of regularityTo along take advantage the geometric of regularity flow is due along to thethe scalinggeometric function flow for where horizontal it includes detail low-pass coefficients, filters the and doeswarped not have wavelet a vanishing basis is moment bandeletized at lower by replacin resolutions.g the horizontal wavelet by modified horizontal waveletTo take advantagefunction. Near of regularity the singularities, along the geometricwavelet coefficient flow for horizontalcorrelations detail are coefficients,removed by the using warped waveletbandeletization basis is bandeletized operation. by replacing the horizontal wavelet by modified horizontal wavelet function. Near theAfter singularities, warping waveletand bandeletization coefficient correlations operations, are the removed regions byare usingregular bandeletization along the vertical operation. or horizontalAfter warping direction. and In bandeletizationthe end, warped wavelets operations, are used the regionsto compute are bandelet regular coefficients along the with vertical 1D or horizontaldiscrete direction. wavelet transform In the end, than warped are encoded wavelets using are SPIHT used coder. to compute bandelet coefficients with 1D discrete waveletFigure 5 transformillustrates thanthis feature are encoded and compares using SPIHT histograms: coder. wavelet coefficients vs bandelet coefficients. Figure 5 clearly shows that the bandelet coefficients have larger amplitudes than the Figure5 illustrates this feature and compares histograms: wavelet coefficients vs bandelet wavelet coefficients. This gives the bandelets the ability to carry almost only useful information since coefficients. Figure5 clearly shows that the bandelet coefficients have larger amplitudes than the what is considered undesirable information is represented by small amplitude coefficients rarely waveletpresents coefficients. in the histogram This gives of theband bandeletselet coefficients. the ability Bandelets to carry are, almost therefore, only a usefulnatural information candidate for since whateffective is considered image compression. undesirable information is represented by small amplitude coefficients rarely presentsHence, in the histogramthe SPIHT coder, of bandelet initially coefficients. applied to wa Bandeletsvelet coefficients, are, therefore, will become a natural bandelet-SPIHT candidate for effectiveand will image be applied compression. to bandelet coefficients: this is what justifies our contribution.

1800 4000

1600 3500

1400 3000

1200 2500

1000

2000

800

1500 600

1000 400

200 500 Bandelet coefficient number Wavelet coefficients number coefficients Wavelet 0 0 -300 -200 -100 0 100 200 300 -300 -200 -100 0 100 200 300 Wavelet coefficients values Bandelet coefficient values (a) Wavelet (b) Bandelet

FigureFigure 5. Histograms 5. Histograms of of bandelet bandelet vs. vs. wavelet wavelet coefficients.coefficients. Bandelet Bandelet transform transform contains contains only only the themost most significantsignificant coefficients coefficients and, and, therefore, therefore, those those which which carrycarry information.

Electronics 2019, 8, 88 7 of 18

Hence, the SPIHT coder, initially applied to wavelet coefficients, will become bandelet-SPIHT and will be applied to bandelet coefficients: this is what justifies our contribution.

3. Process of Encoding

3.1. Set Partitioning in Hierarchical Trees (SPIHT) Algorithm Principles The SPIHT algorithm is one of the most efficient encoder [6]. It is used in many applications such as progressive web browsing, flexible image storage and retrieval, and image transmission over heterogeneous networks [26,27]. The SPIHT algorithm takes up the principles of EZW (embedded zerotree wavelet coding) [5] while proposing to recursively partition the coefficient trees. Set partitioning in hierarchical trees (SPIHT) is an image-compression algorithm that exploits the inherent similarities across the subbands in a wavelet decomposition of an image. This algorithm codes the most important wavelet transform coefficients first, and transmits the bits so that an increasingly refined copy of the original image can be obtained progressively. This method represents an important advance in the field and deserves special attention because it provides the following:

- Highest image quality; - Progressive image transmission; - Fully embedded coded file; - Simple quantization algorithm; - Fast coding/decoding; - Completely adaptive; - Error protection.

For more details on the SPIHT algorithm refer to the famous founding paper written by Said et al. [6]. SPIHT MATLAB code is available for both gray scale and true color images; it requires Image Processing Toolbox and Wavelet Toolbox. Thanks to these features, we have chosen this algorithm as the basis for our contribution, called the bandelet-SPIHT algorithm.

3.2. New Coding Algorithm: Bandelet-SPIHT The proposed coding method is formulated as follows:

• Step 1: Input: Medical sequences of size 512 × 512; • Step 2: Performing 2D discrete wavelet transform to decompose sequences into a subband; • Step 3: Decompose the frame recursively into dyadic squares S; • Step 4: Geometric flow is applied to the each square; • Step 5: Warping the discrete wavelets along the geometric flux; • Step 6: Perform 1D discrete wavelet transform to bandelize warped wavelet bases; • Step 7: Resulting bandelet coefficients are encoded using SPIHT encoder; • Step 8: Quality of the recovered video sequences is evaluated according to both objective and subjective criteria.

4. Experimental Results and Discussion We proposed in this work, a new algorithm for medical video coding based on bandelet transform coupled by SPIHT coder that is able to capture complex geometric contents of images and surfaces and reduce drastically existing redundancy in video. Electronics 2019, 8, 88 8 of 18

The accuracy of the algorithm is evaluated vs. bit-rate range 0.1 to 0.5 Mbps on sized medical video sequences. Each sequence has variable number of frames, and the frame rate is equal to 30 frames/second. Figure6 shows a sample of medical frames used for evaluation:

- MR imaging is the modality of choice for accurate local staging of bladder cancer (see Figure6. (d)). In addition, bladder MR imaging helps detect lymph node involvement. With bladder Electronics 2019, 8, x FOR PEER REVIEW 8 of 18 cancer (T2-weighted axial image), MRIs are used mainly to look for any signs that cancer has spread beyond the bladder into nearby tissues or lymph nodes. cancer (T2-weighted axial image), MRIs are used mainly to look for any signs that cancer has - Cardiac MRI is used to detect or monitor cardiac disease and to evaluate the heart’s anatomy—for spread beyond the bladder into nearby tissues or lymph nodes. - example,Cardiac inMRI the is evaluation used to ofdetect congenital or monitor heart disease.cardiac disease The MR and image to evaluate of the heart, the shownheart’s in Figureanatomy—for3c, is derived example, from in gradient the evaluation echo imaging of congenital employed heart in disease. the assessment The MR image of ventricular of the function,heart, shown blood in velocity Figure 3c, and is flowderived measurements, from gradient assessment echo imaging of valvular employed disease, in the assessment and myocardial of perfusion.ventricular This function, MR image blood is performedvelocity and with flow the me useasurements, of intravenous assessment contrast of agentsvalvular (gadolinium disease, chelates)and myocardial to enhance perfusion. the signal This ofMR pathology image is performed or to better with visualize the use the of bloodintravenous pool orcontrast vessels. Theagents paramagnetic (gadolinium effect chelates) of gadolinium to enhance causes the signal a shortening of pathology of T1or relaxationto better visualize time, thus the blood causing areaspool withor vessels. gadolinium The paramagnetic to be bright effect on of T1-weighted gadolinium causes images. a shortening Contrast of agent T1 relaxation is used in cardiactime, thus MRI causing for evaluation areas with of gadolinium myocardial to perfusion, be bright on delayed T1-weighted enhanced images. imaging Contrast (myocardial agent is infarctions,used in cardiac myocarditis, MRI for infiltrative evaluation processes), of myoc differentiationardial perfusion, of intracardiacdelayed enhanced masses (neoplasmimaging vs.(myocardial thrombus) etc.infarctions, myocarditis, infiltrative processes), differentiation of intracardiac - Computedmasses (neoplasm tomography vs. thrombus) (CT) of theetc. abdomen and pelvis, shown in Figure3b, is a diagnostic - imagingComputed test tomography used to help (CT) detect of the diseases abdomen of the and small pelvis, bowel, shown colon in Figure and other 3b, is internal a diagnostic organs andimaging is often test used used to to determine help detect the diseases cause of of unexplained the small bowel, pain. colon In emergency and other cases, internal it can organs reveal and is often used to determine the cause of unexplained pain. In emergency cases, it can reveal internal injuries and bleeding quickly enough to help save lives. internal injuries and bleeding quickly enough to help save lives. - Figure3a shows a coronary angiogram showing the left coronary circulation. The exam by - Figure 3a shows a coronary angiogram showing the left coronary circulation. The exam by coronarographycoronarography consists consists in in aa filmedfilmed radiography,radiography, typically typically at at 30 30 frames frames per per second, second, of ofcoronary coronary (arteries(arteries using using X-rays X-rays andand aa tracertracer product product based based on on iodine). iodine). The The procedure procedure involves involves inserting inserting a acatheter (thin (thin flexible flexible tube) tube) into into a blood a blood vessel vessel in the in groin the groinor arm or area arm toarea guide to it guide to the itheart. to the heart.Once Once the catheter the catheter is in is place, in place, contrast contrast material material is injected is injected into into thethe coronary coronary arteries arteries so sothat that physiciansphysicians can can see see blocked blocked or or narrowed narrowed sections sections of of arteries. arteries.The The testtest alsoalso verifiesverifies thethe conditioncondition of theof valvesthe valves and and heart heart muscle. muscle.

FigureFigure 6. Medical6. Medical video video used used for assessment.for assessment. (a) Coronary (a) Coronary angiography-X-ray; angiography-X-Ray; (b) abdomen/pelvis- (b) abdomen computed/pelvis-computed tomography tomography (CT); (c) (CT); heart-axial (c) heart-axial MRI; (d )MRI; bladder-MRI. (d) bladder-MRI.

TheThe study study is is based based on on real real data data from from INSERMINSERM (French(French National National Institute Institute of of Health Health and and Medical Medical Research),Research), including including large large databases databases usedused forfor thethe comparative analysis. analysis. The The objective objective is isto toshow show the the feasibilityfeasibility of of the the proposed proposed codec. codec. The The imagesimages shownshown are a a simple simple illustration illustration and and their their technical technical and and medicalmedical characteristics characteristics are are summarized summarized in in TableTable1 .1. The results presented in this paper are based on a statistical study (Mean, 95% confidence interval, mean squared error/peak signal-to-noise ratio (MSE/PSNR)). Each value represents an average over a large sample of sequences (see Table 1).

Table 1. Details and characteristics of all standards tested video.

Number of Number of Width × Heigh Anatomical Region/Modality Frames Frames/Second Video Format Brain-MRI (T2-weighted image acquired at 3 T) 144 30 240 × 240 AVI Bladder-MRI (T2-weighted axial image) 662 30 480 × 320 AVI Heart-MRI (Gradient echo imaging) 87 30 340 × 336 AVI Abdomen/pelvis-CT 295 30 340 × 352 AVI Coronary angiogram-X-Ray (+a tracer product based on iodine) 594 30 360 × 360 AVI

Electronics 2019, 8, 88 9 of 18

Table 1. Details and characteristics of all standards tested video.

Number of Number of Width × Heigh Anatomical Region/Modality Frames Frames/Second Video Format

Brain-MRI (T2-weighted image acquired at 3 T) 144 30 240 × 240 AVI Bladder-MRI (T2-weighted axial image) 662 30 480 × 320 AVI Heart-MRI (Gradient echo imaging) 87 30 340 × 336 AVI Abdomen/pelvis-CT 295 30 340 × 352 AVI Coronary angiogram-X-ray (+a tracer product 594 30 360 × 360 AVI based on iodine)

The results presented in this paper are based on a statistical study (Mean, 95% confidence interval, mean squared error/peak signal-to-noise ratio (MSE/PSNR)). Each value represents an average over a large sample of sequences (see Table1). The perceived quality of recovered frames has been measured. Moreover, a comparison between Bandelet-SPIHT versus wavelet transform using SPIHT encoder, MPEG-4, and H.26x is carried. To evaluate the quality of the compressed image, subjective and objective quality measures are used. But subjective quality measures based upon group of observers are too time consuming [28–31]. So, objective quality measures such as MSE, PSNR etc. are preferred.

4.1. Objective Measurement of Video Quality For the analysis of decoded video, we can distinguish data metrics, which measure the fidelity of the signal without considering its content, and picture metrics, which treat the video data as the visual information that it contains. For compressed video delivery over packet networks, there are also packet- or bitstream-based metrics, which look at the packet header information and the encoded bitstream directly without fully decoding the video. Furthermore, metrics can be classified into full-reference, no-reference and reduced-reference metrics based on the amount of reference information they require.

4.1.1. Data Metrics The image and video processing community has long been using MSE and PSNR as fidelity metrics (mathematically, PSNR is just a logarithmic representation of MSE). There are a number of reasons for the popularity of these two metrics. The formulas for computing them are as simple to understand and implement as they are easy and fast to compute. Minimizing MSE is also very well understood from a mathematical point of view. Over the years, video researchers have developed a familiarity with PSNR that allows them to interpret the values immediately. There is probably no other metric as widely recognized as PSNR, which is also due to the lack of alternative standards. Despite its popularity, PSNR only has an approximate relationship with the video quality perceived by human observers, simply because it is based on a byte-by-byte comparison of the data without considering what they actually represent. PSNR is completely ignorant to things as basic as pixels and their spatial relationship, or things as complex as the interpretation of images and image differences by the human visual system. PSNR is the ratio between the maximum possible power of a signal and the power of noise. PSNR is usually expressed in terms of the logarithmic decibel

 (2n − 1)  PSNR = 10 log (2) 10 MSE where

• (2n − 1) is the dynamic of the signal. In the standard case, (2n − 1) = 255 is maximum possible amplitude for an 8-bit image. Electronics 2019, 8, 88 10 of 18

• MSE represents the mean square error between two frames, namely the original frame f (i, j) and the recovered frame fr(i, j) of size M × N and is given by:

M N 1 2 MSE = ∑ ∑( f (i, j) − fr(i, j)) (3) MN i=1 j=1

MSE can be identified as the power of noise.

The PSNR is an easy, fast and very popular quality measurement metric, widely used to compare the quality of video encoding and decoding. Although a high PSNR generally means a good quality reconstruction, this is not always the case. Indeed, PSNR requires original image for comparison, but this may not be available in every case, also PSNR does not correlate well with subjective video quality measures, so is not very suitable for perceived visual quality.

4.1.2. Picture Metrics As an alternative to data metrics described above, better visual quality measures have been designed taking into account the effects of distortions on perceived quality. An engineering approach consists primarily on the extraction and analysis of certain features or artifacts in the video. These can be either structural image elements such as contours, or specific distortions that are introduced by a particular video processing step, compression technology or transmission link, such as block artifacts. This does not necessarily mean that such metrics disregard human vision, as they often consider psychophysical effects as well, but image content and distortion analysis rather than fundamental vision modeling is the conceptual basis for their design. 1. Image Quality Assessment using Structural Similarity (SSIM) index Wang et al.’s structural similarity (SSIM) index [32] computes the mean, variance and covariance of small patches inside a frame and combines the measurements into a distortion map. Two image signals x and y are aligned with each other (e.g., spatial patches extracted from each image). If we consider one of the signals to have perfect quality, then the similarity measure can serve as a quantitative measurement of the quality of the second signal. SSIM index introduces three key features: luminance l, contrast c and structure s. This metrics is defined in (4): SSIM( f , fr) = l( f , fr) × c( f , fr) × s( f , fr) (4)

- The luminance comparison l(x, y) is defined as a function of the mean intensities Mx and My of signals x et y: 2M M + C ( ) = x y 1 l x, y 2 2 (5) Mx + My + C1 where 1 N 1 N Mx = ∑ xi and My = ∑ yi N i=1 N i=1 2 2 and the constant C1 is included to avoid instability when Mx + My is close to zero (similar considerations for the presence of C2 and C3). - The contrast comparison c(x, y) is function of standard deviations σx and σy, and takes the following form: 2σ σ + C ( ) = x y 2 c x, y 2 2 (6) σx + σy + C2 where the standard deviation of x is Electronics 2019, 8, 88 11 of 18

!1/2 1 N σ = (x2 − (M )2) x − ∑ i x N 1 i=1

- A simple and effective measure to quantify the structural similarity s(x, y) is the estimation of the correlation coefficient σxy between x and y. The structure comparison s(x, y) is defined as follows:

σxy + C3 s(x, y) = (7) σxσy + C3

where the covariance of x and y is

1 N σ = x y − M M xy − ∑ i i x y N 1 i=1

2 2 Specifically, it is possible to choose Ci = Ki D , i = 1, and Ki is a constant, such as Ki  1 and D is the dynamic range of the pixel values (D = 255 corresponds to a grey-scale digital image when C2 the number of bits/pixel is 8). For C3 = 2 , the explicit expression of the structural similarity (SSIM) index is:   2Mx My + C1 2σxy + C2 SSIM(x, y) =    (8) 2 2 2 2 Mx + My + C1 σx + σy + C2

SSIM is symmetrical and less than or equal to one (it is equal to one if and only if f = fr, i.e., if x = y in expression (8)). Generally, over the whole video coding, a mean value of SSIM is required as mean SSIM (MSSIM):

1 L MSSIM( f , fr) = ∑ SSIM( fi, fri) (9) L i=1 where fi and fri are the contents of frames (original and recovered respectively) at the ith local window (or sub-image), and L is the total of local windows number in frame. The MSSIM values exhibit greater consistency with the visual quality. 2. Image Quality Assessment using Video Fidelity (VIF) index Sheikh and Bovik [33] proposed a new paradigm for video quality assessment: visual information fidelity (VIF). This criterion quantifies the Shannon information that is shared between the original and recovered images relative to the contained information in the original image itself. It uses natural scene statistics modelling in conjunction with an image-degradation model and a human visual system (HVS) model. Visual information fidelity uses the Gaussian scale mixture model in the wavelet domain. To obtain VIF one performs a scale-space-orientation wavelet decomposition using the steerable pyramid and models each sub-band in the source as C = SU, where S is a random field of scalars and U is a Gaussian vector. The distortion model is GC + n where G is a scalar gain field and n is an additive Gaussian noise. VIF then assumes that the distorted and source images pass through the human visual system and the HVS uncertainty is modelled as visual noise N. The model is then: Reference signal E = C + N (10)

Test signal F = D + N (11) Electronics 2019, 8, 88 12 of 18 where E and F denote the visual signal at the output of the HVS model from the reference and the test videosElectronics respectively 2019, 8, x FOR (see PEER Figure REVIEW7), from which the brain extracts cognitive information. 12 of 18

Figure 7. Diagram of the visual information fidelityfidelity (VIF) metric.

VIF measure takes values between 00 andand 1,1, wherewhere 11 meansmeans perfectperfect quality.quality. Thus the visual information fidelityfidelity (VIF)(VIF) measuremeasure isis givengiven by:by:

j j j ∑j I C ; F /s VIF = (12) ICj; Ej/ sj ∑ 𝐼𝐶∑j ;𝐹 /𝑠 (12) VIF = where I(X; Y/z) is the conditional mutual information∑ 𝐼(𝐶 ;𝐸 between/𝑠 ) X and Y given z, C denotes the random j j fieldwhere from 𝐼(𝑋; a channel⁄) 𝑌 𝑧 is the in the conditional original image, mutuals isinformation a realization between of S for 𝑋 a particularand 𝑌 given image 𝑧, andC denotes the index thej runsrandom through field allfrom the a sub channel bands in in the the original decomposed image, image. 𝑠 is a realization of 𝑆 for a particular image and the index j runs through all the sub bands in the decomposed image. 4.2. Subjective Quality Assessment

4.2. SubjectiveThe reference Quality for Assessment multimedia quality are subjective experiments, which represent the most accurate method for obtaining quality ratings. In subjective experiments, a number of “subjects”, The reference for multimedia quality are subjective experiments, which represent the most clinicians and PhD students in our study case, (typically 15–30) are asked to watch a set of medical accurate method for obtaining quality ratings. In subjective experiments, a number of “subjects”, video and rate their quality. The average rating over all viewers for a given database is also known as clinicians and PhD students in our study case, (typically 15–30) are asked to watch a set of medical the mean opinion score (MOS) [28]. Since each individual has different interests and expectations for video and rate their quality. The average rating over all viewers for a given database is also known video, the subjectivity and variability of the viewer ratings cannot be completely eliminated. as the mean opinion score (MOS) [28]. Since each individual has different interests and expectations Subjective experiments are invaluable tools for assessing multimedia quality. Their main for video, the subjectivity and variability of the viewer ratings cannot be completely eliminated. shortcoming is the requirement for a large number of viewers, which limits the amount of video Subjective experiments are invaluable tools for assessing multimedia quality. Their main material that can be rated in a reasonable amount of time; they are neither intended nor practical for shortcoming is the requirement for a large number of viewers, which limits the amount of video 24/7 in-service monitoring applications. Nonetheless, subjective experiments remain the benchmark material that can be rated in a reasonable amount of time; they are neither intended nor practical for for any objective quality metric. 24/7 in-service monitoring applications. Nonetheless, subjective experiments remain the benchmark 4.3.for any Choice objective of Optimal quality Wavelet metric. Filters

4.3. ChoiceIn wavelet-based of Optimal Wavelet image Filters coding, a variety of orthogonal and biorthogonal filters have been developed for compression. The selection of wavelet filters plays a crucial part in achieving an effective codingIn performance,wavelet-based because image therecoding, is no a filtervariety that of performs orthogonal the bestand forbiorthogonal all images. filters have been developedOur aim for here compression. is to suggest The the selection most suitable of wavelet wavelet filters filter plays for different a crucial test part images in achieving based upon an theeffective quality coding measures performance, introduced because above. there is no filter that performs the best for all images. ToOur achieve aim here a high is to compressionsuggest the most rate, suitable it is often wavelet necessary filter to for choose different the besttest images wavelet based filter upon bank (seethe quality Mallat algorithmmeasures introduced [3,4]) and decomposition above. level, which will play a crucial role in compressing the images.To achieve The choice a high of optimal compression wavelets rate, has it severalis often criteria.necessary The to main choose criteria the best are: wavelet filter bank (see Mallat algorithm [3,4]) and decomposition level, which will play a crucial role in compressing •the images.Compact The support choice size.of optimal wavelets has several criteria. The main criteria are: • Regularity and degree of smoothness. • Compact support size. • • Symmetry.Regularity and degree of smoothness. • OrthogonalitySymmetry. and biorthogonality of the resulting analysis. • Orthogonality and biorthogonality of the resulting analysis. Compact support size lead to efficient implementation, regularity, and degree of smoothness (related to filter order or can filter length), symmetry is useful in avoiding dephasing in image processing, and orthogonality allows fast algorithm [3].

Electronics 20192019,, 88,, 88x FOR PEER REVIEW 1313 of of 18

In this experiment, MRI video is encoded using the Bandelet-SPIHT algorithm for various bit rates.Compact In order supportto find the size most lead suitable to efficient wavelet implementation, filter for different regularity, test images and degree based upon of smoothness objective (relatedquality measures to filter orderfor video or can compre filterssion, length), biorthogonal symmetry family is useful was in considered. avoiding dephasing Thus, biorthogonal in image Electronicsprocessing,wavelets 2019 CDF and, 8, x(Cohen–Daubechies–Feauveau) FOR orthogonality PEER REVIEW allows fast algorithm 9/7 (nine [3]. coefficients in decomposition low-pass13 of and 18 sevenIn in this decomposition experiment, highpass MRI video filters) is encoded and Le usingGall 5/3 the (five Bandelet-SPIHT coefficients in algorithm decomposition for various low-pass bit rates.and threeIn In this order in experiment, decomposition to find the MRI most high video suitable-pass is encodedfilters) wavelet are usin filterselected.g the for different Bandelet-SPIHT test images algorithm based uponfor various objective bit rates.qualityThe In measuresorder choice to of find for these videothe filters most compression, suitableis justified wavelet biorthogonalby th eirfilter simplicity, for family different symmetry was test considered. images and their based Thus, compact upon biorthogonal objective support qualitywavelets(localization measures CDF in (Cohen–Daubechies–Feauveau) space). for video compression, biorthogonal 9/7 (nine family coefficients was considered. in decomposition Thus, low-passbiorthogonal and waveletssevenBy in a decompositionCDF subjective (Cohen–Daubechies–Feauveau) quality highpass assessment filters) andof images Le 9/7 Gall (nine shown 5/3 coefficients (five in coefficientsFigure in 8, decomposition we in decompositionopt for CDF low-pass 9/7, low-pass which and sevenandreduces three in decompositionthe in level decomposition of artifacts. highpass high-pass Indeed, filters) filters)the and obtained Le are Gall selected. results 5/3 (five show coefficients that the inrecovered decomposition frame usinglow-pass the andbandelet threeThe choicealgorithmin decomposition of these + SPIHT filters high is close is-pass justified to filters) the byoriginal are their selected. simplicity,frame. symmetry and their compact support (localizationTheIt is choiceworth in space).notingof these the filters presence is justified of blurred by th areaeir simplicity,s in the image symmetry when Le an Galld their 5/3 compactfilters are support used. (localizationByObjective a subjective in space).measurement quality assessmentof video quality of images (PSNR, shown SSIM in and Figure VIF)8, webetween opt for (bandelet CDF 9/7, + SPIHT) which reducesalgorithmBy a the subjectiveand level (wavelet of artifacts.quality + SPIHT) assessment Indeed, applied the of to obtained images coronary resultsshown angiography showin Figure that test 8, the framewe recovered opt at forvarious CDF frame bit9/7, using rates which theare reducesbandeletillustrated the algorithm in level Figure of + artifacts.9. SPIHT isIndeed, close to the the obtained original frame.results show that the recovered frame using the bandelet algorithm + SPIHT is close to the original frame. It is worth noting the presence of blurred areas in the image when Le Gall 5/3 filters are used. Objective measurement of video quality (PSNR, SSIM and VIF) between (bandelet + SPIHT) algorithm and (wavelet + SPIHT) applied to coronary angiography test frame at various bit rates are illustrated in Figure 9.

(a) Original (b) Le Gall 5/3 filter (c) CDF9/7 filter

Figure 8. Recovered frames using:using: ((bb)) BandeletBandelet (Le(Le GallGall 5/3)-SPIHT5/3)-SPIHT and (c) Bandelet (CDF9/7)-SPIHT(CDF9/7)-SPIHT at 0.2 Mbps.

It40 is worth noting the presence of blurred1 areas in the image when0.8 Le Gall 5/3 filters are used. Objective(a) Original measurement of video(b) Le qualityGall 5/3 filter (PSNR, SSIM and VIF)0.7 between(c) CDF9/7 (bandelet filter + SPIHT) 35 0.9 algorithmFigure and8. Recovered (wavelet frames + SPIHT) using: applied (b) Bandelet to coronary (Le Gall angiography5/3)-SPIHT and0.6 test (c) frameBandelet at (CDF9/7)-SPIHT various bit rates are 30 0.8 illustratedat 0.2 Mbps. in Figure 9. 0.5

25 0.7 0.4 VIF 40 1 0.8 MSSIM PSNR(dB) 0.3 20 0.6 0.7 35 0.9 0.2 0.6 Wavelet-SPIHT Wavelet-SPIHT 15 0.5 Wavelet-SPIHT 30 Bandelet-SPIHT 0.8 Bandelet-SPIHT 0.1 0.5 Bandelet-SPIHT

10 0.4 0 250.1 0.15 0.2 0.25 0.3 0.35 0.4 0.45 0.5 0.70.1 0.15 0.2 0.25 0.3 0.35 0.4 0.45 0.5 0.40.1 0.15 0.2 0.25 0.3 0.35 0.4 0.45 0.5 BITRATE(Mbps) BITRATE(Mbps) VIF BITRATE(Mbps) MSSIM

PSNR(dB) 0.3 Figure20 9. Performance evaluation of0.6 bandelet-SPIHT versus (wavelet) SPIHT in terms of objective 0.2 metrics (peak signal-to-noiseWavelet-SPIHT ratio (PSNR), mean structural Wavelet-SPIHTsimilarity (MSSIM) and VIF). 15 0.5 Wavelet-SPIHT Bandelet-SPIHT Bandelet-SPIHT 0.1 Bandelet-SPIHT

10 0.4 0 Figure0.1 0.15 100.2 is0.25 an0.3 illustration0.35 0.4 0.45 of0.5 a sample0.1 0.15 0.2 of0.25 medical0.3 0.35 0.4images0.45 0.5 from0.1 0.15our0.2 large0.25 databases0.3 0.35 0.4 0.45used0.5 for benchmarking (subjectiveBITRATE(Mbps) quality assessment by BITRATE(Mbps)visual inspection). BITRATE(Mbps) FigureFigure 9. PerformancePerformance evaluation evaluation of of bandelet-SPIHT bandelet-SPIHT ve versusrsus (wavelet) (wavelet) SPIHT SPIHT in in terms terms of of objective metricsmetricsThe images (peak (peak signal-to-noise signal-to-noiseshown in Figure ratio ratio (PSNR), (PSNR),10 are me meana ansimple structural structural illustration similarity similarity and(MSSIM) (MSSIM) their and and technical VIF). VIF). and medical characteristics are summarized in Table 1 (see also Figure 6). They represent: (a) Coronary angiography-X-Ray;FigureFigure 1010 isis an illustration(b) abdomen of of a a /pelvis-computed sample of of medical tomography images images from from (CT); our our large large(c) heart-axial databases MRI;used used for for(d) benchmarkingbenchmarkingbladder-MRI. (subjective (subjective quality assessment by visual inspection). The images shown in Figure 10 are a simple illustration and their technical and medical characteristicsThe images are shown summarized in Figure in Table 101 are(see a also simple Figure illustration6). They represent: and their ( a )technical Coronary and angiography- medical characteristicsX-ray; (b) abdomen/pelvis-computed are summarized in Table tomography 1 (see (CT);also (cFigure) heart-axial 6). They MRI; represent: (d) bladder-MRI. (a) Coronary angiography-X-Ray; (b) abdomen /pelvis-computed tomography (CT); (c) heart-axial MRI; (d) bladder-MRI.

Electronics 2019, 8, 88 14 of 18 Electronics 2019, 8, x FOR PEER REVIEW 14 of 18

(a) (b) (c) (d) Figure 10. Comparative subjective quality assessments medical video using bandelet Figure 10. Comparative subjective quality assessments medical video using bandelet (CDF9/7)-SPIHT (CDF9/7)-SPIHT (top row), and wavelet (CDF9/7)-SPIHT (Bottom row) at 0.5 Mbps. (a) Coronary (top row), and wavelet (CDF9/7)-SPIHT (Bottom row) at 0.5 Mbps. (a) Coronary angiography-X-ray; angiography-X-Ray;(b) abdomen/pelvis-computed (b) abdomen tomography /pelvis-computed (CT); (c) heart-axial tomography MRI; (d(CT);) bladder-MRI. (c) heart-axial MRI; (d) bladder-MRI. From these results, illustrated by some examples in Figures8–10, we can draw three conclusions: From these results, illustrated by some examples in Figures 8–10, we can draw three 1 The performance of bandelet-SPIHT algorithm is superior to that of SPIHT (wavelet-based). conclusions: 2 Bandelet-SPIHT is able to capture complex geometric contents of images and surfaces and 1 Thereduce performance drastically of existing bandelet-SPIHT redundancy algorithm in video is while superior classical to that SPIHT of SPIHT (based (wavelet-based). on discrete wavelet 2 Bandelet-SPIHTtransform) is subject is able to artifacts.to capture complex geometric contents of images and surfaces and 3 reduceThe bandelet-SPIHT drastically existing algorithm redundancy gives better in results video inwhile the case classical of CDF SPIHT 9/7 wavelet (based filters on discrete than in waveletthe case transform) of Le Gall 5/3.is subject to artifacts. 3 The bandelet-SPIHT algorithm gives better results in the case of CDF 9/7 wavelet filters than in 4.4. Bandelet-SPIHTthe case of Le Gall Versus 5/3. Standard Encoders Comparison Advanced video coding (AVC), also known as H.264/MPEG-4, and its successor high-efficiency 4.4. Bandelet-SPIHT Versus Standard Encoders Comparison video coding (HEVC), also known as H.265 and MPEG-H Part 2, are compared to the proposed coder bandelet-SPIHTAdvanced video using coding data metrics, (AVC), namely also known PSNR as (dB). H.264/MPEG-4, and its successor high-efficiency videoFigure coding 11 (HEVC),shows the also performance, known as inH.265 terms and of PSNRMPEG-H vs bitrate,Part 2, ofare three compared compression to the techniques proposed coderH264/AVC, bandelet-SPIHT H265/HEVC using and data bandelet-SPIHT. metrics, namely Each PSNR value (dB). represents an average within its confidence interval.Figure For 11 example, shows the for 270performance, kbps, the PSNR in terms obtained of PSNR bybandelet-SPIHT vs bitrate, of three is higher compression than that techniques produced H264/AVC,by the other methodsH265/HEVC and itsand confidence bandelet-SPIHT. interval isEach tightened value around represents the mean an average value. On within the other its confidencehand, for 2260 interval. kbps, For the example, PSNR obtained for 270 kbps, by bandelet-SPIHT the PSNR obtained is lower by bandelet-SPIHT than that produced is higher by HEVC. than thatIn addition, produced its by confidence the other methods interval isand asymmetric its confiden withce interval a greater is tightened tendency around towards the lower mean values.value. OnConsequently, the other hand, we will for have2260 littlekbps, confidence the PSNR obtained in this average by bandelet-SPI value obtainedHT is by lower the algorithmthan that produced proposed byfor HEVC. high data In rates.addition, its confidence interval is asymmetric with a greater tendency towards lower values.As Consequently, illustrated in Figure we will 11 have, the proposedlittle confidence method in demonstratesthis average value a relatively obtained better by the performance algorithm proposedcompared for to thehigh standard data rates. video coding AVC and HEVC in terms of PSNR. For example, for a bit rate greaterAs thanillustrated 1.3 Mbps, in Figure it is clear 11, the that proposed in terms method of PSNR, demonstrates the performance a relatively of the proposed better performance algorithm (bandelet-SPIHT)compared to the standard is lower than video that coding of HEVC AVC but and almost HEVC always in terms higher of thanPSNR. that For of example, AVC. And for for a low bit ratebit rates, greater our than encoder 1.3 outperformsMbps, it is clear video that coding in terms standards. of PSNR, For example,the performance for 0.6 Mbps, of the our proposed method algorithmexceeds AVC (bandelet-SPIHT) and HEVC by 3is dB lower and than 0.5 dB that respectively. of HEVC but almost always higher than that of AVC. And Andfor low for lowbit rates, bit rates, our ourencoder encoder outperforms outperforms vide videoo coding coding standards. standards. For For example, example, for for 0.6 0.6 Mbps, Mbps, our method exceeds AVC andand HEVCHEVC byby 33 dBdB andand 0.50.5 dBdB respectively.respectively. And for low bit rates, our encoder outperforms video coding standards. For example, for 0.6 Mbps, our method exceeds AVC and HEVC by 3 dB and 0.5 dB respectively.

ElectronicsElectronics2019 2019, 8, 88, 8, x FOR PEER REVIEW 15 of15 18of 18

It is well-known that complex images, fast camera movements and action scenes require high bitrate.It is well-known In this context, that HEVC complex is th images,e most fastefficient camera technology movements to date. and action scenes require high bitrate.In this context, HEVC is the most efficient technology to date.

39

38

37

36

35

34 PSNR (dB) PSNR

33

32 H.264 MPEG4/AVC 31 H.265 HEVC CC Bandelet-SPIHT 30

29 0 500 1000 1500 2000 2500 BITRATE (kbps) Figure 11. Performance analysis in terms of PSNR (dB) between standard coding methods and the Figure 11. Performance analysis in terms of PSNR (dB) between standard coding methods and the proposed method. proposed method.

ThoseThose with with fixed fixed scenes, scenes, which which are easier are easier to encode, to enco requirede, arequire slower a rate, slower as is rate, the case as is in the medical case in imaging.medical In imaging. this context, In this bandelet-SPIHT context, bandelet-SPIHT is superior at is low superior bitrates at duelow to bitrates its ability due to to capture its ability the to geometriccapture complexity the geometric of the complexity image structure of the imag ande its structure multi-resolution and its multi-resolution analysis. analysis. ForFor the the tests, tests, we we used used an improvedan improved version version of HEVCof HEVC sometimes sometimes called called HEVC HEVC screen screen content content codingcoding extension. extension. This This standardization standardization effort effort was was developed developed to leverageto leverage the the unique unique characteristics characteristics of of computer-generatedcomputer-generated and and digitally-captured digitally-captured videos. videos. The intraThe intra block block copy modecopy mode takes advantagetakes advantage of the of presencethe presence of exact of copies exact ofcopies blocks of betweenblocks betw differenteen different pictures pictures in a video in a sequence. video sequence. TheThe test test conditions conditions include include video video sequences sequences comprising comprising a variety a variety of resolutions of resolutions and content and typescontent (texttypes and (text graphics and graphics with motion, with camera-capturedmotion, camera-cap content,tured mixedcontent, content mixed andcontent animation). and animation). For each For each sequence, two color formats are tested. One is in RGB and the other is in Y CB CR (Y is the luma sequence, two color formats are tested. One is in RGB and the other is in Y CB CR (Y is the luma component and CB and CR are the blue-difference and red-difference chroma components) and component and CB and CR are the blue-difference and red-difference chroma components) and several of theseveral 4:2:0 of sequences the 4:2:0 aresequences generated are fromgenerated their 4:4:4from correspondences.their 4:4:4 correspondences. ThreeThree test test configurations, configurations, i.e., alli.e., intra all intra (AI), random(AI), random access access (RA) and (RA) low-delay and low-delay B (LB) wereB (LB) used. were Forused. lossy For coding, lossy four coding, QPs four (quantization QPs (quantization parameters), parameters), {22, 27, 32, {22, 37}, 27, were 32, 37}, applied. were applied. TheThe statistical statistical study study we havewe have carried carried out isout in lineis in with line the with recommendations the recommendations of ITU— of TelecommunicationsITU—Telecommunications Standardization Standardization Sector, and Sector, more specificallyand more specifically the Video Codingthe Video Experts Coding Group. Experts Group.Indeed, the benchmarking used to measure performance of HEVC and the proposed codec was made partlyIndeed, by the experts benchmarking in videovia used quality to measure criteria performance adapted to of the HEVC human and visual the proposed system suchcodec as was SSIMmade and partly VIF (objective by experts quality in video metrics) via andquality partly criteria by clinicians adapted or to PhD the studentshuman visual (subjective system quality such as assessment)SSIM and fromVIF (objective our medical quality research metrics) institute and partly (INSERM). by clinicians However, or PhD the subjectivestudents (subjective improvement quality measuredassessment) somewhat from exceedsour medical the improvement research institute measured (INSERM). by the However, PSNR metric. the subjective improvement measured somewhat exceeds the improvement measured by the PSNR metric. 4.5. Computational Complexity 4.5. Computational Complexity We cannot directly compare the temporal performance of the proposed algorithm with that of H264/AVCWe andcannot H265/HEVC, directly compare due to the their temporal structural perfor differencemance andof the their proposed compression algorithm concept with based that of on totallyH264/AVC different and H265/HEVC, philosophies. due However, to their we structural will develop difference and clarify and their this issue,compression which isconcept crucial based to understandingon totally different the purpose philosophies. of our work. However, we will develop and clarify this issue, which is crucial to understanding the purpose of our work.

Electronics 2019, 8, 88 16 of 18

We will focus on two fundamental differences between the MPEG4 family (AVC and HEVC) and bandelet-SPIHT: Unlike MPEG4’s (H264/AVC and H265/HEVC) use of both intra- and inter-frame encoding, bandelet-SPIHT employs only intra-frame encoding. This means that each frame is compressed individually, and no attention is paid by the encoder to either previous or following frames, and so it is not predictive in any way. Consequently, for bandelet-SPIHT the (time) latency to compress a frame is shorter (as the codec does not have to rely on generating forward and reverse differences between frames). The type of transform used by AVC and HEVC is DCT while our compression algorithm uses 2nd-generation wavelets (bandelet transform). For an image of pixels, the computational complexity of the best bandelet basis algorithm   is O N3/2 whereas for the DCT the computational complexity can be approximated by  N  O 2 ((log2 N) − 1) . This shows that the calculation of bandelet transform is more complex than that of the DCT From these two considerations, we can conclude that the computation time required for H264/AVC compression is comparable to that of bandelet-SPIHT, which has been confirmed experimentally. For high spatial resolution videos, the increase of computational complexity for HEVC is about 25% on the average compared to AVC (and thus for bandelet-SPIHT), which seems quite reasonable with considerations of significant coding efficiency improvements. Therefore, we can conclude that the computational complexity between bandelet-SPIHT and H264/AVC is of the same order, whereas that of HEVC is significantly higher. However, H.265/HEVC technology has the capability of doubling the compression efficiency of H.264/AVC. What this means is that H.265 uses only half the bitrate otherwise used by H.264/AVC when transmitting images [9,10]. So, H.265/HEVC significantly reduces the storage and bandwidth requirements during video encoding, which is highly beneficial for both software and hardware usage.

5. Conclusions The objective motivating our study was to propose a coding method with high performances in terms of visual quality and PSNR at low bit rates. The bandelets using a SPIHT encoder is an effective condidate meeting the specific requirements of our application in the medical field and particulary in medical imaging where the quality of the image at low bit rates is a major requirement for the practitioner. In order to show the efficiency of bandelet-SPIHT algorithm, a set of medical sequences as test examples is considered. At low bit rates, the bandelet-SPIHT algorithm gives significantly better results (PSNR metric) comparing to some cutting-edge coding techniques such as H.26x family (namely H.264 and H.265). In addition, the computational complexity of the proposed algorithm is of the same order of magnitude as that of H264/AVC but significantly lower than that of H265/HEVC. From this point of view, the expected goal is achieved and the prospects for application in medical imaging are promising. Geometric image compression methods are a very dynamic search direction. One could mention the construction of transformations that adapt automatically to the geometry without the need to specify it. A consensus is growing in the scientific community that geometry is the key to significantly improve current compression methods. However, HEVC compressed videos may incur severe quality degradation, particularly at low bit-rates. Thus, it is necessary to enhance the visual quality of HEVC videos at the decoder side. It is in this area that improvements are expected and research is ongoing [34]. Another useful way is to improve objective metrics [35] to evaluate the quality of videos automatically. Electronics 2019, 8, 88 17 of 18

Author Contributions: Methodology, validation, writing—review and editing, A.O.; Project administration, A.T.-A.; Software, M.F. and Y.H.; Supervision, M.A. Funding: This research received no external funding. Acknowledgments: We thank BELADGHEM for the advice provided to HABCHI during his PhD thesis in image compression. Conflicts of Interest: The authors declare no conflict of interest.

References

1. Djeddi, M.; Ouahabi, A.; Batatia, H.; Basarab, A.; Kouamé, D. Discrete wavelet transform for multifractal texture classification: Application to ultrasound imaging. In Proceedings of the IEEE International Conference on Image Processing (IEEE ICIP2010), Hong Kong, China, 26–29 September 2010; pp. 637–640. 2. Sayood, K. Introduction to Data Compression, 5th ed.; The Morgan Kaufmann Series in Multimedia Information and Systems; Elsevier: Amsterdam, The Netherlands, 2017; ISBN 9780128094747. 3. Ouahabi, A. Signal and Image Multiresolution Analysis; Wiley-ISTE: Hoboken, NJ, USA, 2012; ISBN 978-1-848-21257-2. 4. Mallat, S. A Wavelet Tour of Signal Processing: A Sparse Way, 3rd ed.; Academic Press: Cambridge, MA, USA, 2009; ISBN 9780123743701. 5. Shapiro, J.M. Embedded image coding using zerotrees of wavelet coefficients. IEEE Trans. Signal Process. 1993, 41, 3445–3462. [CrossRef] 6. Said, A.; Pearlman, W. A new fast and efficient image codec based on set partitioning in hierarchical trees. IEEE Trans. Circuits Syst. Video Technol. 1996, 6, 243–250. [CrossRef] 7. Cermak, G.W.; Wolf, S.; Tweedy, E.P.; Pinson, M.H.; Webster, A.A. Validating objective measures of MPEG video quality. SMPTE J. 2015, 107, 226–235. [CrossRef] 8. Shi, Y.Q.; Huifang, S. Image and Video Compression for Multimedia Engineering: Fundamentals, Algorithms, and Standards, 2nd ed.; CRC Press: Boca Raton, FL, USA, 2000; ISBN 9780849373640. 9. Sullivan, G.J.; Ohm, J.R.; Han, W.; Wiegand, T. Overview of the high efficiency video coding (HEVC) standard. IEEE Trans. Circuits Syst. Video Technol. 2012, 22, 1649–1668. [CrossRef] 10. Ohm, J.R.; Sullivan, G.J.; Schwarz, H.; Tan, T.K.; Wiegand, T. Comparison of the coding efficiency of video coding standards-including high efficiency video coding (HEVC). IEEE Trans. Circuits Syst. Video Technol. 2012, 22, 1669–1684. [CrossRef] 11. International Telecommunication Union. ITU-T Recommendation H.265, High Efficiency Video Coding; International Telecommunication Union: Geneva, Switzerland, 2013. 12. Ouahabi, A. Wavelet Denoising: Application in Medical Imaging; Wiley-ISTE: Hoboken, NJ, USA, 2019. 13. Zehani, S.; Ouahabi, A.; Oussalah, M.; Mimi, M.; Taleb-Ahmed, A. New and robust method for trabecular bone texture based on fractal dimension. In Proceedings of the 42nd Annual Conference of the IEEE Industrial Electronics Society (IECON 2016), Firenze, Italy, 23–26 October 2016; pp. 992–997. 14. Ouahabi, A. Image Denoising using Wavelets: Application in Medical Imaging. In Advances in Heuristic Signal Processing and Applications; Chatterjee, A., Nobahari, H., Siarry, P., Eds.; Springer: Basel, Switzerland, 2013; pp. 287–313. 15. Ouahabi, A. A review of wavelet denoising in medical imaging. In Proceedings of the International Workshop on Systems, Signal Processing and Their Applications (IEEE/WOSSPA’13), Algiers, Algeria, 12–15 May 2013; pp. 19–26. 16. Sidahmed, S.; Messali, Z.; Ouahabi, A.; Trépout, S.; Messaoudi, C.; Marco, S. Nonparametric denoising methods based on contourlet transform with sharp frequency localization: Application to electron microscopy images with low exposure time. Entropy 2015, 17, 2781–2799. 17. Katsigiannis, S.; Papaioannou, G.; Maroulis, D. A contourlet transform based algorithm for real-time video encoding. In Proceedings of the SPIE, Real-Time Image and Video processing Conference, Brussels, Belgium, 19 April 2012; Volume 8437. 18. Shu, Z.; Luo, Y.; Liu, G.; Xie, Z. Arbitrarily shape object-based video coding technology by wavelet-based contourlet transform. In Proceedings of the IEEE International Conference on Information and Automation, Luoyang, China, 8–10 August 2015. Electronics 2019, 8, 88 18 of 18

19. Starck, J.L.; Murtagh, F.; Candès, E.J.; Donoho, D.L. Gray and color image contrast enhancement by the curvelet transform. IEEE Trans. Image Process. 2003, 12, 706–717. [CrossRef][PubMed] 20. Le Pennec, E.; Mallat, S. Sparse geometric image representations with bandelets. IEEE Trans. Image Process. 2005, 14, 423–438. [CrossRef][PubMed] 21. Yang, H.Y.; Wang, X.Y.; Niu, P.P.; Liu, Y.C. Image denoising using nonsubsampled shearlet transform and twin support vector machines. Neural Netw. 2014, 57, 152–165. [CrossRef][PubMed] 22. Habchi, Y.; Ouahabi, A.; Beladgham, M.; Taleb-Ahmed, A. Towards a new standard in medical video compression. In Proceedings of the 42nd Annual Conference of the IEEE Industrial Electronics Society (IECON2016), Florence, Italy, 23–26 October 2016; pp. 1061–1066. 23. Peyre, G.; Mallat, S. Surface compression with geometric bandelets. ACM Trans. Graphics 2005, 24, 601–608. [CrossRef] 24. Peyre, G.; Mallat, S. Discrete bandelets with geometric orthogonal filters. In Proceedings of the IEEE International Conference on Image Processing (ICIP2005), Genova, Italy, 14 September 2005; Volume 1, pp. 65–86. 25. Peyre, G.; Mallat, S. A Matlab Tour of Second Generation Bandelets. Available online: http://www.cmap. polytechnique.fr/~{}peyre/bandelets/BandeletsTutorial.pdf (accessed on 1 November 2018). 26. Danyali, H.; Mertins, A. Highly scalable image compression based on SPIHT for network applications. In Proceedings of the IEEE International Conference on Image Processing (ICIP2002), Rochester, NY, USA, 22–25 September 2002. 27. Chang, S.; Carin, L. A modified SPIHT algorithm for image coding with a joint MSE and classification distortion measure. IEEE Trans. Image Process. 2006, 15, 713–725. [CrossRef][PubMed] 28. Winkler, S.; Mohandas, P. The evolution of video quality measurement: From psnr to hybrid metrics. IEEE Trans. Broadcast. 2008, 54, 660–668. [CrossRef] 29. Winkler, S. Digital Video Quality Vision Models and Metrics, 1st ed.; John Wiley & Sons Ltd.: West Sussex, UK, 2005; ISBN 978-0470024041. 30. Margaret, H.P.; Wolf, S. A new standardized method for objectively measuring video quality. IEEE Trans. Broadcast. 2004, 50, 312–322. 31. Hands, D. A basic multimedia quality model. IEEE Trans. Multimedia 2004, 6, 806–816. [CrossRef] 32. Wang, Z.; Bovik, A.C.; Sheikh, H.R.; Simoncelli, E.P. Image quality assessment: From error visibility to structural similarity. IEEE Trans. Image Process. 2004, 13, 600–612. [CrossRef] 33. Sheikh, H.R.; Bovik, A.C. Image information and visual quality. IEEE Trans. Image Process. 2006, 15, 430–444. [CrossRef][PubMed] 34. Yang, R.; Xu, M.; Liu, T.; Wang, Z.; Guan, Z. Enhancing Quality for HEVC Compressed Videos. IEEE Trans. Circuits Syst. Video Technol. 2018, in press. [CrossRef] 35. Dumic, E.; Grgic, S.; Grgic, M. New image-quality measure based on wavelets. J. Electron. Imaging 2010, 19, 011018. [CrossRef]

© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).