rfc2354.txt
来自「RFC 的详细文档!」· 文本 代码 · 共 675 行 · 第 1/2 页
TXT
675 行
Network Working Group C. Perkins
Request for Comments: 2354 O. Hodson
Category: Informational University College London
June 1998
Options for Repair of Streaming Media
Status of this Memo
This memo provides information for the Internet community. This memo
does not specify an Internet standard of any kind. Distribution of
this memo is unlimited.
Copyright Notice
Copyright (C) The Internet Society (1998). All Rights Reserved.
Abstract
This document summarizes a range of possible techniques for the
repair of continuous media streams subject to packet loss. The
techniques discussed include redundant transmission, retransmission,
interleaving and forward error correction. The range of
applicability of these techniques is noted, together with the
protocol requirements and dependencies.
1 Introduction
A number of applications have emerged which use RTP/UDP transport to
deliver continuous media streams. Due to the unreliable nature of
UDP packet delivery, the quality of the received stream will be
adversely affected by packet loss. A number of techniques exist by
which the effects of packet loss may be repaired. These techniques
have a wide range of applicability and require varying degrees of
protocol support. In this document, a number of such techniques are
discussed, and recommendations for their applicability made.
It should be noted that this document is introductory in nature, and
does not attempt to be comprehensive. In particular, we restrict our
discussion to repair techniques which require the involvement of the
sender of a media stream, and do not discuss possibilities for
receiver based repair.
For a more detailed survey, the reader is referred to [5].
Perkins & Hodson Informational [Page 1]
RFC 2354 Options for Repair of Streaming Media June 1998
2 Terminology and Protocol Framework
A unit is defined to be a timed interval of media data, typically
derived from the workings of the media coder. A packet comprises one
or more units, encapsulated for transmission over the network. For
example, many audio coders operate on 20ms units, which are typically
combined to produce 40ms or 80ms packets for transmission. The
framework of RTP [18] is assumed. This implies that packets have a
sequence number and timestamp. The sequence number denotes the order
in which packets are transmitted, and is used to detect losses. The
timestamp is used to determine the playout order of units. Most loss
recovery schemes rely on units being sent out of order, so an
application must use the RTP timestamp to schedule playout.
The use of RTP allows for several different media coders, with a
payload type field being used to distinguish between these at the
receiver. Some loss repair schemes send multiple copies of units, at
different times and possibly with different encodings, to increase
the probability that a receiver has something to decode. A receiver
is assumed to have a `quality' ranking of the differing encodings,
and so is capable of choosing the `best' unit for playout, given
multiple options.
A session is defined as interactive if the end-to-end delay is less
then 250ms, including media coding and decoding, network transit and
host buffering.
3 Network Loss Characteristics
If it is desired to repair a media stream subject to packet loss, it
is useful to have some knowledge of the loss characteristics which
are likely to be encountered. A number of studies have been
conducted on the loss characteristics of the Mbone [2, 8, 21] and
although the results vary somewhat, the broad conclusion is clear:
in a large conference it is inevitable that some receivers will
experience packet loss. Packet traces taken by Handley [8] show a
session in which most receivers experience loss in the range 2-5%,
with a somewhat smaller number seeing significantly higher loss
rates. Other studies have presented broadly similar results.
It has also been shown that the vast majority of losses are of single
packets. Burst losses of two or more packets are around an order of
magnitude less frequent than single packet loss, although they do
occur more often than would be expected from a purely random process.
Longer burst losses (of the order of tens of packets) occur
infrequently. These results are consistent with a network where
small amounts of transient congestion cause the majority of packet
loss. In a few cases, a network link is found to be severely
Perkins & Hodson Informational [Page 2]
RFC 2354 Options for Repair of Streaming Media June 1998
overloaded, and large amount of loss results.
The primary focus of a repair scheme must, therefore, be to correct
single packet loss, since this is by far the most frequent
occurrence. It is desirable that losses of a relatively small number
of consecutive packets may also be repaired, since such losses
represent a small but noticeable fraction of observed losses. The
correction of large bursts of loss is of considerably less
importance.
4 Loss Mitigation Schemes
In the following sections, four loss mitigation schemes are
discussed. These schemes have been discussed in the literature a
number of times, and found to be of use in a number of scenarios.
Each technique is briefly described, and its advantages and
disadvantages noted.
4.1 Retransmission
Retransmission of lost packets is an obvious means by which loss may
be repaired. It is clearly of value in non-interactive applications,
with relaxed delay bounds, but the delay imposed means that it does
not typically perform well for interactive use.
In addition to the possibly high latency, there is a potentially
large bandwidth overhead to the use of retransmission. Not only are
units of data sent multiple times, but additional control traffic
must flow to request the retransmission. It has been shown that, in
a large Mbone session, most packets are lost by at least one receiver
[8]. In this case the overhead of requesting retransmission for most
packets may be such that the use of forward error correction is more
acceptable. This leads to a natural synergy between the two
mechanisms, with a forward error correction scheme being used to
repair all single packet losses, and those receivers experiencing
burst losses, and willing to accept the additional latency, using
retransmission based repair as an additional recovery mechanism.
Similar mechanisms have been used in a number of reliable multicast
schemes, and have received some discussion in the literature [9, 13].
In order to reduce the overhead of retransmission, the retransmitted
units may be piggy-backed onto the ongoing transmission, using a
payload format such as that described in [15]. This also allows for
the retransmission to be recoded in a different format, to further
reduce the bandwidth overhead. As an alternative, FEC information
may be sent in response to retransmission requests [13], allowing a
single retransmission to potentially repair several losses. The
choice of a retransmission request algorithm which is both timely and
Perkins & Hodson Informational [Page 3]
RFC 2354 Options for Repair of Streaming Media June 1998
network friendly is an area of current study. An obvious starting
point is the SRM protocol [7], and experiments have been conducted
using this, and with a low-delay variant, STORM [20]. This work
shows the trade-off between latency and quality for retransmission
based repair schemes, and illustrates that retransmission is an
effective approach to repair for applications which can tolerate the
latency.
There is no standard protocol framework for requesting retransmission
of streaming media. An experimental RTP profile extension for SRM-
style retransmission requests has described in [14].
4.2 Forward Error Correction
Forward error correction (FEC) is the means by which repair data is
added to a media stream, such that packet loss can be repaired by the
receiver of that stream with no further reference to the sender.
There are two classes of repair data which may be added to a stream:
those which are independent of the contents of the stream, and those
which use knowledge of the stream to improve the repair process.
4.2.1 Media-Independent FEC
A number of media-independent FEC schemes have been proposed for use
with streamed media. These techniques add redundant data, which is
transmitted in separate packets, to a media stream. Traditionally,
FEC techniques are described as loss detecting and/or loss
correcting. In the case of streamed media, loss detection is
provided by the sequence numbers in RTP packets.
The redundant FEC data is typically calculated using the mathematics
of finite fields [1]. The simplest of finite field is GF(2) where
addition is just the eXclusive-OR operation. Basic FEC schemes
transmit k data packets with n-k parity packets allowing the
reconstruction of the original data from any k of the n transmitted
packets. Budge et al [4] proposed applying the XOR operation across
different combinations of the media data with the redundant data
transmitted separately as parity packets. These vary the pattern of
packets over which the parity is calculated, and hence have different
bandwidth, latency and loss repair characteristics.
Parity-based FEC based techniques have a significant advantage in
that they are media independent, and provide exact repair for lost
packets. In addition, the processing requirements are relatively
light, especially when compared with some media-specific FEC
(redundancy) schemes which use very low bandwidth, but high
complexity encodings. The disadvantage of parity based FEC is that
the codings have higher latency in comparison with the media-specific
Perkins & Hodson Informational [Page 4]
RFC 2354 Options for Repair of Streaming Media June 1998
schemes discussed in following section.
A number of FEC schemes exist which are based on higher-order finite
fields, for example Reed-Solomon (RS) codes, which are more
sophisticated and computationally demanding. These are usually
structured so that they have good burst loss protection, although
this typically comes at the expense of increased latency. Dependent
on the observed loss patterns, such codes may give improved
performance, compared to parity-based FEC.
An RTP payload format for generic FEC, suitable for both parity based
and Reed-Solomon encoded FEC is defined in [17].
4.2.2 Media-Specific FEC
The basis of media-specific FEC is to employ knowledge of a media
compression scheme to achieve more efficient repair of a stream than
can otherwise be achieved. To repair a stream subject to packet
loss, it is necessary to add redundancy to that stream: some
information is added which is not required in the absence of packet
loss, but which can be used to recover from that loss.
The nature of a media stream affects the means by which the
redundancy is added. If units of media data are packets, or if
multiple units are included in a packet, it is logical to use the
unit as the level of redundancy, and to send duplicate units. By
recoding the redundant copy of a unit, significant bandwidth savings
may be made, at the expense of additional computational complexity
and approximate repair. This approach has been advocated for use
with streaming audio [2, 10] and has been shown to perform well. An
RTP payload format for this form of redundancy has been defined [15].
If media units span multiple packets, for instance video, it is
sensible to include redundancy directly within the output of a codec.
For example the proposed RTP payload for H.263+ [3] includes multiple
copies of key portions of the stream, separated to avoid the problems
of packet loss. The advantages of this second approach is
efficiency: the codec designer knows exactly which portions of the
stream are most important to protect, and low complexity since each
unit is coded once only.
An alternative approach is to apply media-independent FEC techniques
to the most significant bits of a codecs output, rather than applying
it over the entire packet. Several codec descriptions include bit
sensitivities that make this feasible. This approach has low
computational cost and can be tailored to represent an arbitrary
fraction of the transmitted data.
Perkins & Hodson Informational [Page 5]
RFC 2354 Options for Repair of Streaming Media June 1998
The use of media-specific FEC has the advantage of low-latency, with
only a single-packet delay being added. This makes it suitable for
interactive applications, where large end-to-end delays cannot be
tolerated. In a uni-directional non-interactive environment it is
possible to delay sending the redundant data, achieving improved
performance in the presence of burst losses [11], at the expense of
additional latency.
4.3 Interleaving
When the unit size is smaller than the packet size, and end-to-end
delay is unimportant, interleaving [16] is a useful technique for
reducing the effects of loss. Units are resequenced before
transmission, so that originally adjacent units are separated by a
guaranteed distance in the transmitted stream, and returned to their
original order at the receiver. Interleaving disperses the effect of
packet losses. If, for example, units are 5ms in length and packets
20ms (ie: 4 units per packet), then the first packet could contain
units 1, 5, 9, 13; the second packet would contain units 2, 6, 10,
14; and so on. It can be seen that the loss of a single packet from
an interleaved stream results in multiple small gaps in the
reconstructed stream, as opposed to the single large gap which would
occur in a non-interleaved stream. In many cases it is easier to
reconstruct a stream with such loss patterns, although this is
clearly media and codec dependent. Note that the size of the gaps is
dependent on the degree of interleaving used, and can be made
arbitrarily small at the expense of additional latency.
The obvious disadvantage of interleaving is that it increases
latency. This limits the use of this technique for interactive
applications, although it performs well for non-interactive use. The
major advantage of interleaving is that it does not increase the
bandwidth requirements of a stream.
A potential RTP payload format for interleaved data is a simple
extension of the redundant audio payload [15]. That payload requires
that the redundant copy of a unit is sent after the primary. If this
restriction is removed, it is possible to transmit an arbitrary
interleaving of units with this payload format.
5 Recommendations
If the desired scenario is a non-interactive uni-directional
transmission, in the style of a radio or television broadcast,
latency is of considerably less importance than reception quality.
In this case, the use of interleaving, retransmission based repair or
FEC is appropriate. If approximate repair is acceptable,
interleaving is clearly to be preferred, since it does not increase
Perkins & Hodson Informational [Page 6]
⌨️ 快捷键说明
复制代码Ctrl + C
搜索代码Ctrl + F
全屏模式F11
增大字号Ctrl + =
减小字号Ctrl + -
显示快捷键?