Time-Recursive Deinterlacing for Idtv and Pyramid Coding*

Time-Recursive Deinterlacing for Idtv and Pyramid Coding*

Signal Processing: Image Communication 2 (1990) 365-374 365 Elsevier TIME-RECURSIVE DEINTERLACING FOR IDTV AND PYRAMID CODING* Feng-Ming WANG, Dimitris ANASTASSIOU Department of Electrical Engineering and Center for Telecommunications Research, Columbia University, New York, NY 10027-6699, U.S.A. Arun N. NETRAVALI A T& T Bell Laboratories, Murray Hill, NJ 07974, U.S.A. Received 20 March 1990 Abstract. Most improved definition television (IDTV) receivers use progressive scanning to reduce artifacts associated with interlacing (e.g. interline flicker, line crawl, etc). We propose some novel techniques of motion compensated interpolation of the missing lines of interlaced monochrome and color sequences, reducing the artifacts associated with interlacing, and effectively increasing the vertical resolution of the image sequences. Time-recursive motion compensated prediction is introduced, in which all previously displayed history (not just the previous field) is used to predict the missing pixel values. The next future field is also used for the same purpose, by a look-ahead scheme. Motion estimation is done using a quadtree-based segmented block-matching technique with half pixel accuracy. To avoid artifacts and obtain full resolution in still regions, like background, motion adaptation is used as well. We also discuss how to apply this algorithm to pyramid coding to achieve better compression rate and compatibility of progressive with interlaced format standards. Keywords. Motion compensation, deinterlacing, block matching, HDTV, IDTV, pyramid coding. I. Introduction niques [6, 12] are also important in high definition television (HDTV), since many of the HDTV pro- Interlaced scanning is an efficient method of posals either involve transmission of interlaced bandwidth compression that was appropriate video, or high spatial frequency information at a when TV frame memories were expensive. reduced temporal rate. However, in addition to the loss of vertical resol- Figure 1 shows the three-dimensional (3-D) ution, interlacing results in many well-known domain of a sampled image sequence x(m, n, t), artifacts, With the cheap frame memories becom- in which the missing lines are in dotted form. In ing widely available, IDTV receivers display non- Fig. 2, the vertical-temporal grid (for a constant interlaced video by appropriate processing at the value of n) is shown, with circles on the scan lines receiver without changing the scanning structure of an interlaced sequence and crosses on the miss- of the current television system, such as NTSC. ing lines. Proper conversion from interlaced to progressive Interlaced scanning introduces aliasing, unless format reduces line flicker and improves the ver- the moving image is properly pre-filtered with a tical resolution of the displayed images. Such tech- low-pass filter. The 3-D frequency responses of three possible such filters are shown in Fig. 3, as * Work supported in part by the National Science Foundation under grant ECD8811111 A0, by the NSF Presidential Young a direct consequence of the shape of the quincunx Investigator award ECS-84-51499, and by the New York State sampling grid in Fig. 2. Different reconstruction Center for Advanced Technology under project 89-A10. methods are appropriate for each form of prefilter- 0923-5965/90/$03.50 © 1990 - Elsevier Science Publishers B.V. 366 F.-M. Wang et al. / Time-recursive deinterlacing for IDTV and pyramid coding ing. In actual image capturing devices, temporal filtering is due to camera time integration and is performed independent of spatial filtering, result- /1 ing in separable prefiltering. If vertical filtering is not strong enough to fully cover the area between two successive field lines, there is some information i/.t missing between the field lines, which can be esti- 1" mated using appropriate models for the image s m ./ / sequence source. From psychophysical experi- __./ ments, we have found that temporal filtering is // blurring moving objects, due to eye tracking, and 1 that the apparent quality of interlaced video is best if it comes from progressive video by dropping Y/ half the lines with motion adaptive prefiltering. It is also possible to avoid prefiltering altogether, Fig, 1. 3-D sampled sequence. with quite acceptable results. In that case, deinter- lacing only consists of predicting the missing lines because each existing line remains as it is. OXOXO In Section 2, we describe deinterlacing as a resolution enhancement technique using nonlinear xo"xo x interpolation based on a source model. In Section C X 3, we summarize most used deinterlacing tech- 0 X 0 niques. In Section 4, the technique of Time-Recur- sive Motion Compensated Prediction is intro- duced, in which only one frame memory is needed, ×d×ox but the effect of all previous displayed history is m used to predict the missing lines. The deinterlacing Fig. 2. Vertical-temporal grid. algorithm is described in Section 5 in detail. In 2o,::.,~ El~ ..°..°.i ' ....~,4i ~~i ~,~i ~,~,, •........................ I ........."°"'°°"""°°" Fig. 3. Three possible antialiasing pre-interlacing filters. Signal Processing: Image Communication F.-M. Wang et al. / Time-recursive deinterlacing ,for IDTV and pyramid coding 367 Section 6, we discuss how color information is because the 'helper' signal of the second channel used in our algorithm. We show that chrominance can be coded more efficiently (see Section 8). signals slightly but systematically enhance the cor- Some simple temporal models assume that the rectness of motion prediction and improve quality. image scene contains rigid moving objects and that We outline a systolic array based implementation the components of their displacement vectors from of the scheme in Section 7. In Section 8, the appli- frame to frame are not necessarily integer multiples cation of this scheme to pyramid coding is intro- of the spatial sampling period. This implies that duced. Pyramid coding using deinterlacing is use- the same object is sampled in many frames. Con- ful to achieve compatibility of possibly progressive sidering the overlap of all these sampling grids on HDTV with current interlaced TV. The 'helper' a specific object, it is obvious that the 'virtual' signal [7] of pyramid coding can be coded in spatial sampling rate for the object is much denser different ways to achieve lower bit rate. In Section than the one in any individual frame. Assuming, 9 we present the conclusions together with a com- for the moment, that the image sequence is sampled parison of various techniques using numerical dis- without any antialiasing prefiltering, the above fact tortion measurements. implies that one can derive, at least in theory, a high-definition image sequence from the samples of the many frames in which it was captured. If 2. Resolution enhancement based on a source model there is spatial antialiasing prefiltering, then the blurred frame can be reconstructed from the We assume that, preceding interlaced scanning, samples of one frame only, and no additional separable vertical-temporal filtering was done as information is conveyed from the other frames; if in the dotted part (external cube) of Fig. 3, i.e., as there is temporal antialiasing prefiltering, then if the pictures were progressively scanned with a moving_ objects will also be blurred. According to frame rate equal to the interlaced field rate. perceptual psychology, the human eyes tend to Equivalently, such interlaced sequences are pro- track moving objects, so that, if ideal temporal duced by properly dropping half the lines from a prefiltering is applied, blurred moving objects will progressively scanned sequence. Our experiments reduce image quality. This argument shows that it show that the resulting aliasing, which gives rise is desirable to apply weaker antialiasing prefilter- to various interlacing artifacts, is reduced by non- ing schemes, in video capture, than theoretically linear interpolation of the missing lines. This needed, even for progressively scanned scenes, or resolution enhancement is a consequence of the to make the amount of such prefiltering dependent fact that the sampling t~eorem can be 'defeated' on motion, in a pixel-by-pixel adaptive manner. (e.g., some high frequericy information between In the case of traditional interlaced scanning, as the solid and dotted volumes of Fig. 3 can be explained above, there are missing lines and we predicted) if there is extra knowledge of the signal can predict them from previous fields. source. The source model used here assumes that the video scene contains a number of rigid objects moving in a translational manner. The same con- 3. Traditional deinterlacing schemes cepts can be used to achieve purely spatial or temporal resolution enhancement. Nonlinear Deinterlacing is an interpolation problem. There interpolation reduces the energy of the enhance- are various fundamental differences between the ment signal, defined as the difference between the problem of predicting pixel values for interpola- deinterlaced and the actual signals. Therefore, a tion purposes and the same problem for coding two-channel hierarchical pyramid based [3] coding purposes. Unlike coding, where inaccurate predic- scheme will benefit from nonlinear interpolation tions only increase the bit rate slightly, such Vol. 2, No. 3, October 1990 368 F..-M. Wang et al. / Time-recursive deinterlacing for IDTV and pyramid coding inaccuracies can be catastrophic in interpolation, neighboring field lines, resulting in the appearance where the result is displayed as it is. Furthermore, of elongated teeth. in the case of coding, the predicted values must Interframe interpolation can be as simple as be accessible to the decoder, so they must be func- X = C or X = ½/(C + D). This is fine for stationary tions of the reconstructed (hence distorted) values objects, but it obviously results in severe artifacts, of previously encoded pixels. In the case of inter- like multiple objects in a frame, if there is motion. polation, predictions can be functions of the actual Predicting X by ~(A + B + C + D) is not a satisfac- pixel values of both previous and following frames, tory compromise.

View Full Text

Details

  • File Type
    pdf
  • Upload Time
    -
  • Content Languages
    English
  • Upload User
    Anonymous/Not logged-in
  • File Pages
    10 Page
  • File Size
    -

Download

Channel Download Status
Express Download Enable

Copyright

We respect the copyrights and intellectual property rights of all users. All uploaded documents are either original works of the uploader or authorized works of the rightful owners.

  • Not to be reproduced or distributed without explicit permission.
  • Not used for commercial purposes outside of approved use cases.
  • Not used to infringe on the rights of the original creators.
  • If you believe any content infringes your copyright, please contact us immediately.

Support

For help with questions, suggestions, or problems, please contact us