Error Control Techniques for Interactive Low-bit Rate Video Transmission
over the Internet
Injong Rhee
Department of Computer Science
North Carolina State University
Raleigh, NC 27695-7534
E-mail: [email protected]
Abstract The e ect of packet loss is also far-reaching. Packet
loss degrades not only the quality of the frames con-
tained in lost packets, but also the quality of their sub-
A new retransmission-based error control technique is
sequent frames. This is b ecause of motion comp ensation
presented that does not incur any additional latency in
employed by video co decs. Motion-comp ensated video
frame playout times, and hence are suitable for inter-
co decs remove temp oral redundancy in a video stream
active applications. It takes advantage of the motion
by enco ding only pixel value di erence prediction er-
prediction loop employed in most motion compensation-
ror b etween an image to b e enco ded inter-frame and a
basedcodecs. By correcting errors in a referenceframe
previously transmitted image reference frame. Packet
causedbyearlier packet loss, it prevents error propaga-
loss can intro duce an error in a reference frame, which
tion. The technique rearranges the temporal dependency
can b e propagated to its subsequent frames and get am-
of frames so that a displayedframe is referenced for the
pli ed as more packets are lost.
decoding of its succeeding dependent frames much later
than its display time. Thus, the delay in repairing lost
Error propagation can be controlled by more fre-
packets can be e ectively masked out. The developed
quently adding intra frames which are co ded temp o-
technique is combined with layered videocoding to main-
rally indep endently. However, the ratio of the com-
tain consistently good video quality even under heavy
pression eciency of an intra-frame over an inter-frame
packet loss. Through the results of extensive Internet
is as large as 3 to 6 times. Increasing the frequency
experiments, the paper shows that layeredcoding can be
of intra-frames could increase the bandwidth require-
very e ective when combined with the retransmission-
menttoomuch for video transmission over a bandwidth-
based error control technique for low-bit rate transmis-
constraint network. Nonetheless, the severe degradation
sion over best e ort networks where no network-level
of image quality due to error propagation has forced
mechanism exists for protecting high priority data from
several p opular video conferencing to ols, suchas nv[7 ],
packet loss.
vic[13 ] and CU-SeeMe[6 ], to adopt an even more drastic
approach. Using a technique called conditional replen-
ishment, these to ols lter out the blo cks that have not
1 Intro duction
changed much from the previous frame and intra-co de
the remaining blo cks. Since all the co ded blo cks are
temp orally indep endent, packet loss a ects only those
Transmitting interactive video over packet switching
frames contained in lost packets. However, this en-
networks is time-constrained. If a packet containing a
hanced error resilience comes at the cost of low com-
video frame is not delivered b efore its scheduled play-
pression eciency. Additional compression can always
out time, and the frame cannot be displayed and ac-
be obtained if temp oral redundancy is removed from
cordingly, the overall quality of video degrades. The
each co ded blo ck i.e., by co ding only their prediction
diculty of meeting this time constraint lies mainly in
error.
frequent o ccurrences of packet loss which are commonly
caused by network congestion. The recovery of lost
The goal is to develop an error recovery scheme
packets b efore their display times is not always feasi-
that solves the error propagation problem without los-
ble b ecause of the delay asso ciated with detecting and
ing much compression eciency. Retransmission-based
repairing the lost packets.
and forward error correction schemes are the two ma jor
error control schemes found in the literature.
Retransmission-based error recovery REC can pro-
vide go o d error resilience without incurring much band-
width overhead b ecause packets are retransmitted only
when some indications exist that they are lost. However,
retransmission always involves additional transmission
delay, and thus has b een widely known ine ective for in-
teractive real-time video applications such as video con- frames temp orally dep end only on the essential signals
ferencing. Many researchers prop osed use of extended of their reference frames. This dep endency ensures that
control or playout times to allow retransmitted packets the distortion of the enhancement signals of a frame
to arrive in time for display [21 , 18 , 12 , 23 ]. This implies do es not a ect the quality of its succeeding frames.
that the playout time of a frame is delayed by at least Since essential signals are protected by FEC, this dep en-
three one-way trip times after its initial transmission dency will signi cantly reduce error propagation. How-
two for packet transmissions and one for a retransmis- ever, QAL has its own limitations. First, since frames
sion request. Under the currentInternet environment, are motion-comp ensated only to the essential signals of
this delaywould b e intolerable for interactive video ap- their reference frames, the temp oral redundancy present
plications, and can severely reduce the interactivityof in the enhancement signals are not exploited at all, re-
the application. sulting in low compression eciency. Second, under
heavy packet loss, even essential signals can be lost,
Retransmission, however, can be still a very e ec-
causing error propagation.
tive technique to improve error resilience in interactive
real-time video conferencing. In this pap er, we study a
These limitations, however, can b e greatly reduced
REC scheme, called periodic temporal dependency dis-
when QAL is combined with PTDD. In this pap er, we
tance PTDD, that can be used for interactive video
show that a motion-comp ensated layered co ding tech-
applications. In particular, the schemes do not require
nique such as QAL can b e very e ective for transmis-
any arti cial extension of control time and play-out de-
sion over a lossy packet-switched network even when no
lays, and thus are suitable for interactive applications.
network-level mechanism to protect a prioritized stream
In the scheme, frames are simply displayed at their nor-
exists. When used with PTDD, the layering technique
mal playout times without any delay, as they are de-
yields reasonably go o d compression eciency since p e-
co ded. Thus, if a packet arrives after the playout time
rio dic frames temp orally dep ends on b oth the essential
of its frame, the frame will be displayed with errors.
and enhancement signals of their reference frames. This
However, this \late" packet can b e used to remove error
dep endency is safe b ecause retransmission can recover
propagation. Rather than discarding the late packet,
lost packets containing b oth signals. In addition, p eri-
PTDD uses it to restore its frame although the frame
o dic frames reduce error propagation b ecause their im-
is displayed. Because the frame is used as a reference
mediately succeeding non-p erio dic frames use them as
frame for its succeeding frames, restoring the reference
a reference frame.
frame stops error propagation.
We conducted b oth transatlantic and transpaci c In-
To allow enough time for retransmitted packets to
ternet transmission tests from the East Coast of the
arrive b efore their frames are referenced for the recon-
US to show the utility of the REC-based techniques for
struction of their dep endent frames, the PTDD scheme
transmission over a long distance network. The exp er-
extends the temporal dependency distance TDD of
imental results indicated that the prop osed technique
frames. The TDD of a frame is de ned to b e the min-
achieve go o d error resilience and compression eciency.
imum numb er of frame intervals or inter-frame delay
In Section 2, we describ e the related work, and in
between that frame and another frame on which it is
Section 3 we describ e PTDD and QAL schemes. Section
temp orally dep endent. In the PTDD scheme, every p-
4 contains a discussion of the exp erimental results, and
th frame whichwe call periodic frame has an extended
Section 5 contains the conclusion.
TDD while the other frames have TDD 1. Note that
the extension of TDD do es not a ect the playout times
of frames since the playout times of frames are deter-
2 Related Work
mined solely byinter-frame delays and the initial con-
trol time which is used to reduce delay jitter. In the
PTDD scheme, the TDD of p erio dic frames is deter-
We rst discuss the related work on retransmission-
mined by the estimated delaybetween the sender and
based techniques for video transmission over packet-
the receiver.
switched networks and then discuss related work on lay-
ered video co ding.
In PTDD, non-p erio dic frames with TDD 1 are not
protected by retransmission. If packet loss o ccurs for a
Retransmission-based error recovery. Dempsey et al.[5 ]
non-p erio dic frame, the frame will b e displayed with er-
applied retransmission for the recovery of audio packets.
rors which can b e propagated to its subsequent frames
They showed that by adding some delay b efore the play-
until the next p erio dic frame is received. The receiv-
out of each received audio packet, retransmission can
ing video quality will p erio dically uctuate. To remedy
b e used to protect audio data from packet loss. Their
this, we apply a layered co ding technique, called quality
work hinges on the earlier b ehavior study by Brady [3 ]
assurance layering QAL [22 , 15 , 8 ], for non-p erio dic
showing that although less than 200 ms round trip delay
frames which divides video signals into essential and en-
is required for high qualityvoice applications, delays up
hancement signals. Essential signals are protected bya
to 600 ms can b e tolerable byhuman ears.
simple forward error correction FEC technique. Since
Ramamurthy and Raychaudhuri [21 ] applied a sim-
the amount of essential signals is often much smaller
ilar technique to video transmission over ATM. They
than that of the entire signals, with only a little FEC
analyzed the p erformance of video transmission over an
redundant information, we can maintain relatively go o d
ATM network when b oth retransmission and error con-
video quality.
cealment are used to repair errors o ccurring from cell
QAL is di erent from the technique adopted in vic
loss. They analytically showed that for a coast-to-coast
[14 ] in that QAL uses motion comp ensation. In QAL,
techniques. It was shown that by utilizing priority pack- ATM connection, 33 ms to 66 ms play-out delay is su-
etization, the basic image quality can be maintained cient to see a signi cant improvement in image quality.
if the HP stream is guaranteed to be received. How-
Padop oulos and Parulkar [18 ] prop osed an imple-
ever, these techniques do not solve the error propagation
mentation of an ARQ scheme for continuous media
problem b ecause the essential signals of a frame still de-
transmission. Various techniques including selective
p end on b oth the essential and enhancement signals of
rep eat, retransmission expiration, and conditional re-
its reference frame. Since the enhancement signals are
transmission are implemented inside a kernel. Their
less reliably transmitted, frames that dep end on these
exp erimentover an ATM connection showed the e ec-
signals also carry along the same error
tiveness of their scheme.
Intra-H.261 is used in a p opular Internet video con-
Most recently, Li et al. [12 ] and Xu et al. [23 ]
ferencing to ol called vic that enco des each frame as
used retransmission in the recovery of lost packets for
an intra-frame [13 ]. Using a conditional replenishment
video multicasting. Li et al [12 ] prop osed a novel
technique, the co dec is shown to give excellent error re-
scheme for distributing an MPEG-co ded video over a
silience over packet loss. Another b ene t of the co dec is
b est-e ort network. By transmitting di erent frame
the reduced computation in enco ding since it do es not
typ es I, P and B frames of MPEG to di erentmulti-
involve a motion prediction lo op. Alayering scheme for
cast groups, they implemented a simple layering mech-
Intra-H.261 using a bit plane division technique is also
anism in which a receiver can adjust frame play-out
discussed in [2 , 14 ].
times during congestion by joining or leaving a mul-
Quality assurance layering QAL was studied in ticast group. For instance, consider an MPEG picture
[22 , 15 ]. In this scheme, errors due to the loss of LP pattern: IBBPBBPBBPBB. Their scheme delays the
packets do not propagate b ecause each frame is tem-
play-out times of frames for one frame interval. They
p orally dep endent only on the essential signals of its
call this delayed play-out time adaptive playback point.
reference frame which are assumed to be reliably re-
When a receiver leaves the B frame group, the adaptive
ceived. Shimamura et al. [22 ] also studied the e ect of
playback p oint is additionally extended by three frame
various priority packetization techniques including fre-
intervals b ecause the next frame to b e displayed after a
quency truncation.
P frame is at three frame intervals away. The scheme is
shown e ective for non-interactive real-time video ap-
Ghanbari [8 ] prop osed similar layering techniques as
plications.
QAL. In the techniques, each frame is rst decimated to
pro duce a low-resolution image, then the low-resolution In a video conference involving a large number of
image is co ded using H.261 or a DCT-based co dec and participants, di erent participants may have di erent
packetized as HP data. The original image is then com- service requirements. While some participants may re-
pared with the deco ded frame of its low-resolution frame quire real-time interactions with other participants, oth-
to pro duce a di erence image which is co ded using a ers may simply want to watch or record the confer-
di erent co dec and packetized as a LP data. This co d- ence. Xu et al. [23 ] contended that retransmission can
ing scheme will have similar error resilience as the QAL be e ectively used for the transmission of high qual-
technique since the LP data dep end only on the HP ity video to the receivers that do not need a real-time
data. MPEG-2 adopts an approach similar to this. transfer of video data. They designed a new proto col
called structure-orientedresilient multicast STORM in
Khansari et al. [10 ] applied QAL to video trans-
which senders and receivers collab orate to recover lost
mission over a mobile network to solve the \fading"
packets using a dynamic hierarchical tree structure.
problem commonly exp erienced during a mobile host
hand-o p erio d. By keeping the size of the HP stream
large ab out 83 of the total bandwidth, video quality
Layered co ding. Leicher [11 ] applied a forward error
even under fading can b e kept relatively high. They also
correction scheme known as priority encoding transmis-
studied the e ects of di erent packetizing techniques.
sion PET [1 ] to a hierarchically enco ded MPEG video.
He used a temp oral layering scheme in which reference
Prioritylayering techniques are also applied to still
frames e.g., I-frames and P frames are given higher
JPEG image transmission. Posnak et al. [20 ] used a
priority than other temp orally dep endent frames e.g.,
frequency truncation layering technique that partitions
B frames. Since B frames temp orally dep end on P and
DCT blo cks of JPEG enco ded frames into essential and
I frames which are more reliably received, this technique
enhancementlayers. Han and Polyzos [9] also studied
can e ectively suppress error propagation. However,
the e ectiveness of layered co ding through the hierarchi-
in a low bit rate conferencing, the frequency of I and
cal mo de of JPEG and presented a statistical analysis
P frames must be kept very low b ecause of their low
showing that the overhead of the co ding metho d can b e
compression eciency. Thus, the resulting images can
insigni cant.
be very jerky as packets are b eing dropp ed to a ect B
frames.
3 Error Recovery Techniques
Pancha and El Zarki applied a priority packetization
scheme to an MPEG enco ded video stream for transmis-
sion over ATM [17 ]. A frequency truncation technique
3.1 Retransmission-Based Error Control REC
is applied in which a xed numb er of DCT co ecients
Our schemes are based on a careful observation on how of each DCT blo ck are allo cated to the HP data cf.
video frames are enco ded in most motion comp ensation- [16 , 4, 20 ]. DeCleene et al. [4] also studied the p er-
based co decs. We base our discussion mostly on H.261 formance of MPEG under various priority packetization Packet received -1 -1 -1 -1 ZZVLC Q DCT
F FFFFF F 0 1 2 3 4 5 6
periodic frame periodic frame Current/reference Reference/Current S0 S1 Periodic Temporal Dependency Distance time Reference Frame R1 Base Reference Frame R0
Current Prediction
Perio dic Temp oral Dep endency Distance Frame (CP) Figure 3:
+
ceived, sends a retransmission request NACK to the
Inverse Motion
sender. The sender gets the NACK at time t and re-
Compensation 2
(construct display
ket 3. The retransmitted packet arrives at frame) transmits pac
Display
time t which is b efore frame 3 is displayed. Packet3is
3
now used to restore the R-frame of frame 3, so frame 3
can b e deco ded and displayed without an error.
Figure 2: H.261 Deco der mo di ed to handle the recov-
ery of reference frames.
Figure 2 shows a H.261 deco der mo di ed to handle
the recovery of R-frames using retransmitted packets.
The only di erence from the original H.261 deco der is
from which many motion comp ensation-based video
one additional frame bu er added to handle the recov-
standards such as MPEG are designed. In H.261, a
ery. When a packet is received and deco ded into an
video sequence consists of two typ es of video frames:
image blo ck, the deco der determines whether the blo ck
intra-frame I-frame and inter-frame P-frame. I-
b elongs to the current frame b eing deco ded or its R-
frame removes only spatial redundancy presentinthe
frame. If it is for the current frame, then the blo ckis
frame. P-frame is enco ded through motion estimation
stored into frame bu er CP along with its motion vec-
using another P-frame or I-frame as a reference frame
tor. If it is for the R-frame, the blo ck is added with
R-frame. For each image blo ck in a P-frame, mo-
its temp orally dep endent blo ck in frame bu er R and
0
tion estimation nds a closely matching blo ck within its
stored into R . Note that CP contains only the pre-
1
R-frame, and generates the distance between the two
diction error and motion vectors of the current frame
matching blo cks as a motion vector. The pixel value
while R contains the fully motion comp ensated image
1
di erences b etween the original P-frame and a motion-
of the R-frame of the current frame, and R contains
0
predicted image of the P-frame obtained by simply cut-
the R-frame of R . We call R a base reference frame
1 0
and-pasting the matching image blo cks from its R-frame
bu er. At the next display time, the current frame is
are enco ded along with the motion vectors.
constructed using the information in CP and R . After
1
displaying the current frame, R is copied to R and
1 0
Most of the previously prop osed retransmission
the displayed image are copied to R . In this scheme,
1
schemes work as follows. When a packet containing the
as long as the packets b elonging to R arrive b efore the
1
enco ding of a frame is lost at a receiver, the receiver de-
construction of the current frame, the packet can be
tects the loss after receiving a subsequent packet of the
used to help remove errors in the current frame. The
lost packet and sends a retransmission request to the
dead line of a packet can b e informally de ned to b e the
sender. After receiving the request, the sender retrans-
arrival time of the packet at the receiver after whichit
mits the packet. We de ne the display time of a packet
is not useful for deco ding any frame. Note that the de-
to b e the time that the frame whose enco ding is con-
co der in Figure 2 extends the deadline of packets by one
tained in the packet is displayed at the receiver. If the
frame interval without delaying frame play-out times.
retransmitted packet arrives b efore its display time, the
frame can b e fully restored. Otherwise, it is discarded
We can easily generalize the ab ovescheme to extend
and the displayed image contains some error. All the
packet deadlines beyond one frame interval. The pe-
subsequently deco ded frames will carry the same error
rio dic temp oral dep endency distance scheme PTDD
until a new I-frame is received.
generalizes the ab ove scheme. See Figure 3. In the
scheme, every i-th frame has an extended temp oral de-
Our scheme di ers from others in that retransmit-
p endency distance TDD i we call this frame aperi-
ted packets arriving after their display times are not
odic frame while all the other inter-frames have TDD
discarded but instead used to reduce error propagation.
1. The TDD of p erio dic frames in Figure 3 is 4. In fact,
In motion comp ensation-based co decs, the correct im-
the pattern of the temp oral dep endency is very simi-
age reconstruction of a currently displayed image de-
lar to a picture group pattern of MPEG. All the frames
p ends on a successful reconstruction of its R-frames.
with TDD 4 can b e regarded as P-frames while the other
The scheme allows that while a frames is b eing recon-
frames as B-frame except the rst frame. Thus, this
structed, the \late" packets of an R-frame can b e de-
scheme can b e easily incorp orated into MPEG. PTDD
co ded and used for restoring the R-frame. This will stop
do es not incur much computational overhead and do es
error propagation b ecause the next frame reconstructed
not require many additional frame bu ers. Only two
would not carry over an error from the R-frame.
additional bu ers are needed to store the R-frames of
Figure 1 illustrates the error recovery using retrans-
the next p erio dic frame.
mission in a video stream containing two packets per
frame. Packet 3 is lost, and the receiver receives packet One drawback of PTDD is that only p erio dic frames
4 at time t and recognizing that packet 3 is not re- are protected by retransmission. An error in a non- 1 Frame interval
frame display frame1 frame 2 frame 3 at the sender Time
Retransmission of packet 3 packet p1 p2 p3 p4 p5 p6 p7 p8 Time transmission times t2
lost Time Arrival t1 t3 Times send NACK Although frame 2 is displayed, the packet can be used to restore the reference frame of frame 3. Thus frame 3 is displayed without error
frame 2 has error Time Playout control time frame 1frame 2 frame 3
Times
Figure 1: The recovery of frames using retransmission
enhancement signals LP stream
frame can propagate until the next p erio dic p erio dic Video In
Q ZZ VLC
ed. In the next section, we discuss how frame is receiv Comparator
+
e can overcome this drawback. In fact, there is an
w - DCT essential signal HP stream
teresting tradeo between the packet deadline and in ZZ VLC Q
Inter/intra
extent of error propagation. A long TDD of pe- the -1 -1
rio d frames can prolong error propagation among non-
However, it allows more time for p e- p erio dic frames. -1 -1
DCT DCT
vered. We plan to explore this rio dic frames to b e reco Inter/intra
+ tradeo in the future. +
Prediction
3.2 Layered Co ding with REC Frame Frame buffer 1 buffer 2
Motion Estimation
The error resilience of non-p erio dic frames can b e im-
proved by employing a layered co ding scheme that pack-
etizes enco ded frames into essential or high priority
and enhancement or low priority signals. Although
Figure 4: QAL Enco der
layered co ding is originally develop ed for a network
paradigm, such as ATM and RSVP, where a certain
t of bandwidth can be reserved, it can also be
amoun High-priority data
ver a b est-e ort network such as the Internet if
used o 132 N
y forward error cor-
the HP stream can b e protected b 1 DC
rection. The HP stream should b e kept small to reduce
2
the bandwidth required to protect the stream b ecause
the size of redundant information intro duced by FEC is
prop ortional to the size of the HP stream. End of data
Quality assurance layering techniques guarantee a
certain level of receiving video qualityeven though all
the enhancement signals are lost. Aversion of quality
assurance layering is shown in Figure 4. It is mo di ed
from H.261 and augmented with priority packetization
N
and conditional replenishment. After DCT co ecients
are quantized Q, they are partitioned into HP and LP
layers as describ ed in Figure 5. For a xed nonzero in-
Figure 5: Partitioning of DCT Co ecients from [16 ]
teger b less than 64, the rst b co ecients are allo cated
to the HP layer, and the remaining co ecients are allo-
cated to the LP layer. The subsequentinter-frame and
estimation and conditional replenishment to enco de the
the currently enco ded frame are used to p erform motion
signals of reference frames, causing error propagation.
Thus, QAL alone cannot b e very e ectiveover a b est
network where no guarantee on the packet loss
HP HP HP HP e ort
b ehavior can b e made. However, QAL combined with
PTDD e ectively suppresses error propagation. The er-
LP LP LP LP rors o ccurring in non-p erio dic frames due to loss in the
HP stream do not propagate b eyond the next p erio dic
frame if the lost packets for the current p erio dic frame
periodic non-periodic non-periodic periodic
vered by retransmission b efore the reconstruc-
frame frame frame frame are reco
tion of the next p erio dic frame.
Figure 6: A Temp oral Dep endency Chain in Layered
PTDD PTDD +QAL
4 Exp erimental Result
The main ob jective of the exp erimentistoshow that
next frame. Only the motion vectors and the HP co-
PTDD is an e ective error control scheme for a real-time
ecients of the current frame are used to reconstruct
interactive video transmission over the Internet. We
a predicted frame. The di erence between the subse-
show this through video transmission exp eriments over
quent frame and the predicted frame is enco ded. In the
transatlantic and transpaci c Internet connections from
scheme, each frame temp orally dep ends only on the es-
Emory University,Atlanta, USA. By mo difying an im-
sential signals of its reference frame. Since a frame is
plementation of H.261, we implemented three variants of
reconstructed from the essential signals of its reference
PTDD HP.261, HPF.261, HPL.261, each of which dif-
frame, an error in the enhancement signals of the ref-
fers from each other in the way that non-p erio dic frames
erence frame caused by packet loss do es not carry over.
are protected. HP.261 do es not provide any protection
Thus, even if all LP stream packets are discarded, a
for non-p erio dic frames; HPF.261 protects non-p erio dic
certain level of video quality can b e maintained.
frames by adding one Exclusive-OR parity packet size
QAL, however, has not yet b een proved to give good
512 bytes to each frame; and HPL.261 combines H.261
compression eciency. The diculty lies in the tradeo
and QAL and it protects the HP stream by adding
between compression eciency and the size of redun-
one Exclusive-OR parity packet size 512 bytes to each
dant information added by FEC. If the HP stream do es
frame.
not contain enough signals, there are not many tem-
The p erformance of the three PTDD co decs is com-
p orally redundant signals b etween the reference frame
pared to that of several other co decs. H.261 is used
reconstructed from the essential signals and the current
as a base case for the comparison. H.261 is combined
frame to be enco ded. On the other hand, as we add
with a FEC technique where one Exclusive-OR parity
more co ecients to the HP stream, the HP stream gets
packet size 512 bytes is added to each frame. We call
larger, and so do es the redundant information added by
this scheme HF.261. In addition, we implement QAL
FEC to protect the HP stream. Due to low compression
without PTDD based on H.261 as describ ed in Fig-
eciency, QAL has b een traditionally used in a situa-
ure 4. This implementation is called HL.261. We also
tion where a large p ortion of bandwidth can b e allo cated
implemented INTRA-H.261 which is adopted in vic.
to the HP stream ab out 50 to 80 of the total band-
INTRA-H.261 is known for good error resilience over
width [10 ]. Under the current Internet environment,
packet loss. For each frame, INTRA-H.261 intra-co des
this may not b e reasonable.
every image blo ckchanged signi cantly from the corre-
PTDD provides a good compromise for these con-
sp onding blo ck in the previous frame. The remaining
icting requirements. Since b oth the essential and en-
blo cks are not co ded. We also implemented INTRAL-
hancement signals of p erio dic frames can be restored
H.261 which combines layered co ding with INTRA-
by retransmission, we can allow p erio dic frames to ex-
H.261 where the DCT co ecients of each co ded blo ckis
ploit the temp oral redundancy present in b oth of the
divided into the HP and LP streams. Again, one parity
essential and enhancement signals of their reference
packet is added to each frame to protect the HP stream.
frames. This increases compression eciency. Even
All the layered enco ders co de 5 DCT co ecients of each
with a small amount of bandwidth allo cated to the HP
DCT blo ck as essential signals and the remaining as en-
stream, QAL combined with PTDD can achieve good
hancement signals.
compression eciency. Note that non-p erio dic frames
use only the essential signals of their reference frames in-
cluding the ones immediately following p erio dic frames.
4.1 Testing Metho dology
So, when a p erio dic frame is displayed with some er-
ror due to packet loss in the LP stream, its dep endent
A test video sequence is obtained from a typical video
non-p erio dic frames would not b e a ected. Figure 6 il-
conferencing session where one typical \talking head"
lustrates achain of temp oral dep endency in a layered
engages in a conversation with the other party. The
PTDD scheme.
video is sampled at 5 frames/sec rate and each frame
is captured in the color CIF YUV format 352 288.
Under heavy or bursty packet loss, even the HP
This video sampling rate allows us to achieve a bit rate
stream loses packets since FEC cannot provide adequate
suitable for transatlantic and transpaci c transmission
protection for the HP stream. Unfortunately, packet
loss in the HP stream intro duces errors in the essential
35
uch load on the network. Con- without imp osing to o m "H.261"
"HP.261"
the long distance and limited bandwidth be- sidering "HPF.261"
34.5 "HPL.261"
ween the testing sites, this frame rate is not unusual. t "HL.261"
"INTRAL-H.261"
around 250 Kbits/sec. In addi-
The target bit rate is "INTRA-H.261"
to the controlled sample rate, we use conditional tion 34
replenishment for all the tested schemes to obtain a de-
bit rate. Adjusting quantization steps would be
sired 33.5
a more common way to control the bit rate. However,
since INTRA-H261 uses conditional replenishment, we
Average PSNR (dB)-> 33
use the same technique uniformly for all the schemes for
fairness. Finding the optimal video quality for a given
32.5
bit rate is outside the scop e of this pap er.
Ab out 40 second length video sequence total 190 32
2000 4000 6000 8000 10000 12000 14000 16000 18000 20000 The video sequence is replayed sev-
frames is obtained. Average bytes/frame->
eral times for a ve minute p erio d for each exp eriment.
The replay do es not a ect the integrityof the exp eri-
Figure 7: Compression Eciency of Various Schemes
ment b ecause the rst frame is always completely intra-
co ded without any conditional replenishment. The
Compression Avg. bit FEC HP Avg.
95th frame is intra-co ded with conditional replenish-
scheme rate Kb/s PSNR
ment to remove any artifact due to the deco der drift
H.261 240.6 0 0 34.50
e ect. All the co decs applies a default quantization step
HF.261 245.0 8.4 0 34.49
size 8, and all the motion comp ensated co decs use the
HP.261 232.6 0 0 34.51
full exhaustive searchover search window size 15 by 15.
HPF.261 234.2 8.6 0 34.41
All PTDD co decs set the TDD of all the p erio dic frames
HL.261 252 8 33 33.97
to b e 5 frame intervals. Given 5 frames/s frame rate,
HPL.261 239.3 8.4 26 34.20
this TDD extends the deadlines of p erio dic frames up
INTRA-H.261 247.7 0 0 33.65
to 1 sec.
INTRAL-H.261 252.7 8.1 27 33.49
To see the compression eciency of various co decs
for the input test sequence, we measure the average
Table 1: Chosen data rates for network exp eriments,
p eak signal-to-noise ratio PSNR of deco ded frames
and their average PSNR's
over various data rates. The data rate is measured
by the average number of bytes required to compress
a frame Figure 7. It is clearly shown that for a given
rate sp eci ed in Table 1. The video quality of HF.261 is
data rate, INTRA-H.261 and INTRAL-H.261 show the
similar to HP.261, and is not shown here. Note that the
worst video quality while H.261 shows the b est. To
mean PSNR of HP.261 for the given data rates is 0.1 dB
achieve the same quality, INTRA-H.261 requires more
higher than that of H.261. This is due to the artifact
bandwidth than the other motion comp ensation-based
of the enco der. In general, PTDD gives slightly lower
co decs. For instance, to obtain ab out 34 dB PSNR,
compression eciency than H.261 as shown in Figure 7.
INTRA-H.261 requires 80 11KB/6KB more bits p er
The test video sequence is rst compressed us-
frame than H.261. It is known that more than 1 dB dif-
ing each compression scheme and then packetized into
ference in video quality is clearly visible byhuman eyes.
approximately 512-byte packets. The packetized se-
Although these measurements alone do not prove the
quences of all the PTDD schemes are transmitted over
low compression eciency of INTRA-H.261, it clearly
the Internet. For each transmission test, we obtain a 5
indicates one asp ect of that.
minute trace that records the sequence numb ers and ar-
HL.261 gives only slightly higher compression e-
rival times of all the received packets. The transmission
ciency than INTRA-H.261. This is b ecause the HP
tests are conducted during the p erio ds b etween Oct. 20
stream of HL.261 do es not contain many co ecients,
and only the essential signals of reference frames are
35 On the other used for motion comp ensation in HL.261. "H.261" 34.8 "HP.261"
"HPL.261"
HPL.261 gives excellent compression eciency. hand, "INTRA-H.261"
34.6 In HPL.261, although non-p erio dic frames are enco ded
34.4 in the same as in HL.261, p erio dic frames are motion-
34.2
comp ensated to the full signals of its reference frame,
34
resulting in higher compression eciency. PSNR->
33.8
Table 1 shows the chosen data rate of each co dec for
33.6 transmission along with the ratio of the bit rate of the
33.4
HP stream over the total bit rate, and that of the re-
33.2
dundant bit rate induced by FEC. HP.261 and HPL.261
33
en a little lower data rates b ecause the asso ciated
are giv 0 20 40 60 80 100 120 140 160 180 200
Frame #->
retransmission of lost packets may increase the actual
data rate. Figure 8 shows the PSNR of each frame com-
Figure 8: Video quality of enco ded sequences
pressed by four di erentschemes under the target bit packet frame packet # received received packet frame packet # packet frame packet # received received number number in a frame Y/N time (ms) number number in a frame number number in a frameY/N time (ms) 0 1 0 Y 201 0 1 0 0 1 0 Y 201 1 1 1 Y 450 1 1 1 1 1 1 Y 450 2 1 2 Y 600 2 1 2 2 1 2 Y 600 3 1 3 N * 3 1 3 3 1 3 N * 4 1 4 Y 980 4 1 4 4 1 4 Y 980 5 2 0 Y 1100 5 2 0 5 2 0 Y 1100 6 2 1 Y 1305 6 3 1 6 3 1 Y 1305 7 3 0 N * 7 2 2 7 2 2 N * 8 3 1 Y 1790 8 2 3 8 2 3 Y 1790 9 3 2 N * 9 3 0 9 3 0 N * 10 3 3 Y 2100 10 3 1 10 3 1 Y 2100 11 4 0 N * 11 3 2 11 3 2 N * 12 4 1 N * 12 3 3 12 3 3 N * 13 5 0 Y 2710 13 4 0 13 4 0 Y 2710 14 5 1 Y 3000 14 4 1 14 4 1 Y 3000
(a) (b) (c)
Figure 9: c is the result of mapping an actual trace a to a packet sequence b.
a packetized sequence S for those schemes as if the se- to Oct. 27, 1997, and b etween Jan 30 to Feb 6, 1998.
quence would have b een transmitted in a real test. Each
Each packet of a frame is transmitted at a regular
packet p in trace T is mapp ed to a packet q that has
interval determined by the 5 frames/s frame rate and
the same sequence number as p. If packet p is received,
the numb er of packets within the frame. For example,
we record q as received and assign the receiving time of
for the frame interval of 200 ms, if one frame consists of
p to q . Otherwise, we record q as lost. Figure 9 illus-
10 packets, a packet in the frame is transmitted at 20 ms
trates this mapping. Figure 9a shows a sample trace
interval. Each transmitted packet is assigned a unique
ofaHP.261 sequence `' indicates the packet is not re-
sequence numb er. Retransmitted packets are given the
ceived. Those packets received indicated byYshow
same sequence numb ers as their original packets.
received times. Figure 9 b shows a sample of a pack-
etized H.261 sequence. Figure 9 c shows the result The ARQscheme employed during the exp eriment
of mapping a to b. This mapping technique pro- works as follows. The receiver sends one acknowledg-
vides a very accurate comparison of various transmis- ment to the sender for each received frame. An ac-
sion schemes b ecause the sequences of all the schemes knowledgment contains information ab out the missing
are mapp ed to the same traces. packets of the last p erio dic frame that the receiver re-
ceived. After retransmitting a packet, the sender do es
This mapping technique cannot capture the dynam-
not retransmit the same packet for ab out three frame
ics of acknowledgments and retransmissions in HP.261,
intervals. It may retransmit the packet if it receives an-
HPL.261 and HPF.261. This is b ecause in the exp er-
other acknowledgment after the p erio d indicating that
iment, the acknowledgment is transmitted by the re-
the packet is lost. The receiver also do es not request
ceiver only when a new frame is received and each se-
for the retransmission of packets whose deadlines are
quence may have a di erent number of packets in a
expired. These mechanisms reduce the numb er of un-
frame. Thus, acknowledgments and retransmitted pack-
necessary retransmissions.
ets might b e received at di erent times for di erent se-
quences. The dynamics can only b e captured through a Each trace is fed to an o -line deco der to measure
real transmission, whichiswhy the sequences of HP.261, the signal-to-noise of the received frames. To simplify
HPF.261, and HPL.261 are actually transmitted. The the exp eriment, we did not add any jitter control time
other schemes do not have this problem b ecause they for frame play-out. Each frame is considered to b e dis-
do not involveany retransmission. played at the arrival of the rst packet of its next frame
if that packet is received. If that packet is not received,
We obtained 168 traces of HP.261, 147 traces of
the frame is considered to b e displayed at 200 ms after
HPF.261, and 55 traces for HPL.261 Since many traces
its previous frame's play-out time. If no packet for the
are obtained, it is dicult to present the result of each
frame is received, any frame displayed last will b e dis-
trace indep endently. So we classify the traces into sev-
played for that frame. Retransmitted packets are not
eral loss rate groups and present only the average b e-
used for the displayof their frames, but used only to
havior of the traces in each group. Table 2 shows the
restore their corresp onding reference frames.
loss groups and their corresp onding loss ranges. Since
high loss cases are relatively infrequent, we set a larger For fair comparison, we run trace-driven simulations
range for high loss rates. by mapping each of the traces T to the packetized se-
quence of H.261, HL.261, INTRAL-H.261 and INTRA-
H.261 as follows. We rst obtain a 5 minute length of
Loss group 0.025 0.05 0.075 0.1 0.125 0.15
Loss range 0, 0.025 [0.025,0.05 [0.05,0.075 [0.075,0.1 [0.1, 0.125 [0.125, 0.15
Loss group 0.175 0.2 0.25 0.3 0.35 0.4
Loss range [0.15, 0.175 [0.175,0.2 [0.2, 0.25 [0.25,0.30 [0.3, 0.35 [0.35,0.40
Table 2: Loss Rate Groups and their loss ranges
35
ket loss, the mean PSNR of HP.261 drops "HP.261" and 20 pac
34 "INTRA-H.261"
w that of INTRA-H.261. This is due to the drop in "H.261" b elo
33
the REC recovery rates.
32 e can see that the PSNR of HPF.261
31 In Figure 11, w
is slightly improved from that of HP.261. HPF.261 gives
30
slightly b etter PSNR than INTRA-H.261 for all loss
29
groups except two. This improvement is mainly due 28
PSNR ->
to FEC. However, the improvement is still marginal.
27
Although many packets can b e recovered through FEC
ket loss, as packet loss gets severer, FEC
26 under small pac
e. From Table 3, under less than 10
25 b ecomes ine ectiv
ket loss, FEC could recover from 30 to 50 packet
24 pac
However, as more packets are dropp ed, the FEC
23 lost.
very rate drops b elow 10. 22 reco 0 0.05 0.1 0.15 0.2 0.25 0.3 0.35 0.4
loss rate (x 100%) ->
There is a clear correlation b etween the round trip
time and REC recovery rates. As the round trip time
Figure 10: Mean PSNR of H.261, INTRA-H.261 and
delays increase, the recovery rates of p erio dic frames
HP.261
also drop. When round trip times increase beyond
250ms, the recovery rates by REC are signi cantly re-
35
This is b ecause the increased network delay re- "HPF.261" duced.
34 "INTRA-H.261"
the probability for retransmitted packets to be "H.261" duces
33
received b efore their deadlines. Second, in HP.261,
32
there is no protection for non-p erio dic frames against
31
packet loss, and in HPF.261, the protection by FEC
as not so helpful in recovering lost packets for non-
30 w
Thus, as more packets are lost, more 29 p erio dic frames.
28 non-p erio dic frames su er from error propagation. PSNR ->
27
Overall, although the HP.261 and HPF.261 show
ver INTRA-h.261, the improve-
26 b etter error resilience o
t can be considered marginal. This is b ecause
25 men
hemes, little protection is provided for non-
24 in b oth sc
frames. Packet loss in a non-p erio dic frame
23 p erio dic
error propagation which is continued until the 22 starts
0 0.05 0.1 0.15 0.2 0.25 0.3 0.35
p erio dic frame is received. The resulting video
loss rate -> next
quality uctuates substantially. This problem disap-
p ears when PTDD is combined with a b etter FEC
Figure 11: Mean PSNR's of H.261, INTRA-H.261, and
scheme such as QAL.
HPF.261
4.3 Performance of PTDD + QAL + FEC HPL.261
4.2 Performance of PTDD HP.261, and PTDD +
FEC HPF261
In this section, we rep ort the result obtained from the
HPL.261 traces. HPL.261 combines PTDD and QAL.
In this section, we rep ort the results obtained from the
The HP stream generated from QAL is protected by
HP.261 and HPF.261 traces. The sequences of H.261,
FEC. We ran trace-driven simulations of H.261, HL.261,
HF.261 and INTRA-H.261 are mapp ed to the traces
INTRA-H.261, and INTRAL-H.261 based on the traces
of HP.261 and HPF.261. Figures 10 and 11 show the
of HPL.261. Table 4 summarizes the result. Figure 12
average PSNR's of H.261, INTRA-H.261 and HP.261
shows the average PSNR's of various schemes. Wedo
for various loss groups. Table 3 summarizes the result
not have traces for loss groups higher than 20.
of H.261, HF.261, HPF.261, and INTRA-H.261.
In Figure 12, HPL.261 shows go o d error resilience,
In Figures 10 and 11, the mean PSNR of H.261 drops
showing clear separation ab out 1 dB from INTRA-
drastically even under small packet loss, showing the
H.261. 1 dB or higher PSNR di erence in video quality
e ect of error propagation. INTRA-H.261, HP.261, and
is clearly visible byhuman eyes. The go o d p erformance
HPF.261 exhibit generally go o d error resilience. They
of HPL.261 can b e b est explained from Figure 13. As
all show similar PSNRs for all loss groups. Between 12
Loss of H.261 HPF.261 INTRA-H.261 HF.261
Rate traces PSNR D. R. PSNR D. R. RTT PSNR D. R. PSNR D.R.
dB Kb/s dB Kb/s REC FEC ms dB Kb/s db Kb/s
2.5 24 32.91 247.0 34.16 247.9 89.7 48.88 188.5 33.12 259.0 33.62 246.4
5.0 42 30.39 246.5 33.48 249.1 96.21 42.23 197.7 32.46 258.5 31.66 246.0
7.5 32 28.72 245.3 32.71 249.6 96.64 33.77 223.5 31.83 257.3 30.07 244.8
10.0 14 28.17 245.7 32.13 250.8 84.56 24.46 239.1 31.41 257. 6 29.24 245.1
12.5 14 26.73 249.7 31.21 261.9 81.70 18.77 325.1 30.80 259. 7 27.55 249.5
15.0 9 26.58 249.5 30.60 263.3 57.94 11.78 388.1 30.4 259.0 27.26 249.9
17.5 6 25.45 250.2 29.19 263.7 50.74 9.34 437.7 29.94 259.3 25.91 250.6
25.0 5 26.27 250 29.31 261.7 41.9 5.11 359.3 29.53 260.1 2 6.69 249.8
35.0 1 25.68 239.8 29.35 245.4 24.96 6.14 243.5 28.84 251.5 26.9 239.3
Table 3: Exp erimental Data based on HPF.261 traces
Loss of INTRA-H.261 INTRAL-H.261 H.261 HPL.261 HL.261
Rate traces D.R. D. R. FEC D.R. D.R. REC FEC RTT D.R. FEC
Kb/s Kb/s Kb/s Kb/s ms Kb/s
2.5 22 249 243 60.7 243 238 89.9 61.3 242.6 243 63.5
5 16 247 240 59.8 240.8 239 78.0 57.8 280.4 238 57.9
7.5 7 249 242 52.4 242 240 73.2 52.1 310.5 240 54.4
10 3 249 241 35.5 240 240 57 39.1 305.6 239 41.7
12.5 2 249 243 48.5 242 245 84.4 48.7 347 244 54.5
Table 4: Exp erimental Data based on HPL.261 traces
35 35 "HPL.261" "HPL.261" 34 "INTRA-H.261" "HPF.261" "INTRAL-H.261" "HP.261" 33 "HL.261" 34 "INTRA-H.261" "H.261" 32 33 31
30 32 29
28 PSNR -> PSNR -> 31 27
26 30 25
24 29 23
22 28 0 0.05 0.1 0.15 0.2 0.25 0 0.05 0.1 0.15 0.2
loss rate -> loss rate ->
Figure 12: Mean PSNR's of H.261, HL.261, HPL.261, Figure 14: PSNR vs. Loss Rate Comparison of HP.261,
INTRA-H.261, and INTRAL-H.261 HPF.261, and HPL.261
in HP.261, the PSNR of HPL.261 drops when there is
not contain enough information to improve the receiv-
heavy loss for a frame. However, it quickly b ounces
ing video qualityeven though they are successfully re-
back when the loss rate of the subsequent frames quickly
ceived. Second, the compression eciency of INTRAL-
reduces. This is unlikeHP.261 where reb ound happ ens
H.261 is lower than that of INTRA-H.261. Thus, under
mostly around p erio dic frames. We clearly see many
no or little loss, its average PSNR is always a bit lower
plateaus indicating that packet loss do es not havemuch
than that of INTRA-H.261. The PSNR di erence b e-
impact on the video quality of non-p erio dic frames.
tween INTRAL-H.261 and INTRA-H.261 over various
loss groups seems to remain constant b ecause of these
HL.261 p erforms quite well under low packet loss.
two reasons.
As packet loss gets substantial, The PSNR of HL.261
quickly drops. This is b ecause b eyond low loss, the FEC
We also compare the mean PSNRs of all the PTDD
scheme protecting the HP stream b ecomes ine ective
schemes presented in this pap er in Figure 14. We use the
and the HP stream starts losing packets causing error
mean PSNR of INTRA-H.261 obtained from the traces
propagation.
of HPL.261 as a reference for comparison. From the g-
ure, we can see that HPL.261 seems to p erform b est. At
Surprisingly, INTRAL-H.261 p erforms slightly worse
the b eginning, when there is no or little loss, HPL.261
than INTRA-H.261. This can b e explained in twoways.
gives the worst PSNR among REC-based co decs. How-
First, the HP stream contains have only 5 DCT co ef-
ever, as packet loss gets larger, HP.261 and HPF.261
cients from each co ded blo ck. These co ecients do 40 "HPL.261" "H.261"
35
30 PSNR ->
25
20 6000 7000 8000 9000 10000 11000 12000 13000 14000 15000 sequence # -> 100 "lossrate"
80
60
loss rate -> 40
20
0 6000 7000 8000 9000 10000 11000 12000 13000 14000 15000
packet sequence # ->
Figure 13: One trace of HPL.261 with 10 packet loss { PSNR, and Loss and Recovery Rates
p ointed out in [23 ], in a multicast group, while some re- quickly drops while HPL.261 sustains go o d video qual-
ceivers want real-time video, others maywanttowatch ity.
or record the transmitted video. These observers may
want the highest quality that the video source could pro-
5 Discussion
vide although they can tolerate a longer play-out delay.
At the same time, they mayhave only a small amount
of bandwidth allo cated for the video.
Retransmission has widely b een known ine ective for
recovering lost packets in real-time interactive video ap-
Motion-comp ensated enco ding allows much b etter
plications. This pap er challenges the conventional wis-
video quality for a given data rate than intra-co ding.
dom and presents new retransmission-based error con-
Our scheme allows real-time receivers to view video in
trol techniques that do not incur any additional latency
a comparable qualityasintra-co ding schemes. At the
in the frame play-out time, and hence are suitable for
same time, it allows the same enco ded video to b e multi-
interactive applications.
casted to other non-real-time receivers. These receivers
can view the video in very high qualityby adding an
One main implication of our work is that many
additional delay i.e., control time b efore the display
motion comp ensation prediction-based co decs, suchas
of the rst frame. On the other hand, for intra-co ded
MPEG, and H.261, are still useful for Internet inter-
video to b e sentonalow bandwidth network, its qual-
active video transmission over a lossy network. Some
ity needs to b e reduced substantially. In this case, al-
of the disadvantages of the motion comp ensated co decs
though real-time receivers may get the video in similar
cited in the literature [14 , 13 ] include 1 computational
quality as the motion-comp ensation scheme with REC,
complexity, 2 error resilience, 3 tight coupling be-
non-real-time receivers will receive a p o or quality video.
tween the prediction state at the enco der and that at the
Note that this feature is di erent from media scaling [14 ]
deco der, and 4 compute-scalable deco ding. In this pa-
where receivers with a higher network capacity always
p er, we showed that the H.261 equipp ed with our REC
get a higher quality image. Here, this feature allows
schemes achieves comparable error resilience to that of
even the receivers with a low network capacity to get
INTRA-H.261. We also b elieve that some of the other
a high quality while trading interact-ability. The mo-
disadvantages can b e overcome with a simple mo di ca-
tion comp ensated co decs equipp ed with a REC scheme
tion to the co decs. For instance, the compute-scalable
can provide this feature as they generally give b etter
deco ding can also b e achieved by deco ding only p erio dic
compression eciency and lost packets can b e recovered
frames and shedding o the computational load for de-
through retransmission.
co ding non-p erio dic frames in PTDD.
Having said some of the disadvantages of motion-
References
comp ensated co decs, we would like to emphasize one
of their advantages over INTRA-H.261, which is high
[1] A. Albanese, J. Blomer, J. Edmonds, M. Luby, compression eciency. Although INTRA-H.261 gives
and M. Sudan, \Priority Enco ding Transmission," go o d error resilience, the low compression eciency of
IEEE Transactions on Information Theory,vol. 42, INTRA-H.261 will makeitvery dicult to obtain very
no. 6, Nov. 1996. high quality video for a low bit rate transmission. As
[16] N. Ohta, \Packet Video: Mo deling and Signal Pro- [2] Amir, E., McCanne, S., and Vetterli, M. \A Lay-
cessing," Artech House Publishers, 1994. ered DCT Co der for Internet Video," the Proceed-
ings of IEEE International Conference on Image
[17] P.Pancha and M. El Zarki,\MPEG Co ded Video
Processing. September, 1996. Lausanne, Switzer-
and BISDN," IEEE Communications Magazine,
land.
Vol. 32, No. 5, May 1994.
[3] P. Brady, \E ects of Transmission Delay on Con-
[18] C. Papadop oulos and G. Parulkar,
versational Behavior on Echo-Free Telephone Cir-
\Retransmission-based error control for continuous
cuits," Bel l System Technical Journal,vol. 50, no.
media applications," Proceedings of the Sixth Inter-
1, pp. 115 { 134, Jan. 1993.
national Workshop on Network and Operating Sys-
[4] B. DeCleene, P. Pancha, M. El Zarki, and H.
tem Support for Digital Audio and Video, pp. 5{12,
Sorensen, \Comparison of PriorityPartition Meth-
1996.
ods for VBR MPEG," Proceedings of IEEE Info-
[19] Priority Enco ding Transmission, Web Page:
com'94, Toronto, Canada, June 1994.
http://www.icsi.b erkeley.edu/PET/icsi-p et.html.
[5] B. Demsey, J. Lieb eherr, and A. Weaver, \On
retransmission-based error control for continuous
[20] E. J. Posnak, S. P. Gallindo, A. P. Stephens and H.
media trac in packet-switching networks," Com-
M. Vin. \Techniques for ResilientTransmission of
puter Networks and ISDN Systems, 1996.
JPEG Video Streams" the Proceedings of Multime-
dia Computing and Networking, San Jose, Volume
[6] T. Dorcey, \CU-SeeMe desktop video conferencing
2417, Pages 243-252, February 1995.
software," ConneXions, vol. 9, no. 3, Mar. 1995.
[21] G. Ramamurthy, and D. Raychaudhuri, \Perfor-
[7] R. Fredrick, Network Videonv, XeroxPalo Alto
mance of packet video with combined error re-
Research Center.
covery and concealment," Proceedings of INFO-
COM'95, pp. 753{761, April 1995.
[8] M. Ghanbari, \Twolayer co ding of video signals for
VBR networks," IEEE Journal on Selected Areas
[22] K. Shimamura, Y. Hayashi, and F. Kishino,
in Communications,vol. 7, no. 5, June 1989.
\Variable-bit-rate Co ding Capable of Comp ensat-
ing for Packet Loss," Proceedings of Visual Com- [9] K. Han and G. C. Polyzos, \Multi-Resolution Lay-
munications and Image Processing, Nov. 1988, pp. ered Co ding for Real-Time Image Transmission:
991 { 998. Architectural and Error Control Considerations,"
Real-Time Imaging," Academic Press to app ear.
[23] R. Xu, C. Myers, H. Zhang, R. Yavatkar, \Re-
[10] M. Khansari, A. Jalali, E. Dub ois, and P. Mermel-
silientmulticast supp ort for continuous-media ap-
stein, \Low bit-rate video transmission over fading
plications," Proceedings of the Sixth International
channels for wireless micro cellular systems," IEEE
Workshop on Network and Operating System Sup-
Transactions on Circuits and Systems for Video
port for Digital Audio and Video, St Louis, May
Technology,vol. 6, pp. 1-11, February 1996.
1997.
[11] C. Leicher, \Hierarchical enco ding of MPEG
sequences using priority enco ding transmission
PET," ICSI technical rep ort TR-94-058, Nov.
1994.
[12] X. Li, S. Paul, P.Pancha, and M. Ammar, \Lay-
ered video multicast with retransmission LVMR:
evaluation of error recovery schemes," Proceedings
of the Sixth International Workshop on Network
and Operating System Support for Digital Audio
and Video, St Louis, May 1997.
[13] S. McCanne and V. Jacobson, \vic: a exible
framework for packet video," in Proceedings of
ACM Multimedia'95, San Francisco, CA, Nov.
1995, pp. 511{522
[14] S. McCanne, M. Vetterli, and V. Jacobson, \Low-
complexity video co ding for receiver-driven lay-
ered multicast," IEEE Journal on Selected Areas
in Communications, vol. 16, no. 6, pp. 983-1001,
August 1997
[15] M. Normura, T. Fujii, N. Ohta, \Layered Packet-
Loss Protection for Variable Rate Co ding Using
DCT," Proceedings of International Workshop on
Packet Video, D7, Sept. 1988.