📄 rfc2658.txt
字号:
RFC 2658 RTP Payload Format for PureVoice(tm) Audio August 1999
Additionally, senders have the following restrictions:
o Once beginning transmission with a given SSRC and given interleave
value, MUST NOT increase the interleave value. If the interleave
value needs to be increased, a new SSRC number MUST be used.
o MAY decrease the interleave value only between interleave groups.
If the interleave value is decreased, it MUST NOT be increased
(even to the original value), although it may be decreased again
at a later time.
3.5 Finding Interleave Group Boundaries
Given an RTP packet with sequence number S, interleave value (field
LLL) L, and interleave index value (field NNN) N, the interleave
group consists of RTP packets with sequence numbers from S-N to S-N+L
inclusive. In other words, the Interleave group always consists of
L+1 RTP packets with sequential sequence numbers. The bundling value
for all RTP packets in an interleave group MUST be the same.
The receiver determines the expected bundling value for all RTP
packets in an interleave group by the number of CODEC data frames
bundled in the first RTP packet of the interleave group received.
Note that this may not be the first RTP packet of the interleave
group sent if packets are delivered out of order by the underlying
transport.
On receipt of an RTP packet in an interleave group with other than
the expected bundling value, the receiver MAY discard CODEC data
frames off the end of the RTP packet or add erasure CODEC data frames
to the end of the packet in order to manufacture a substitute packet
with the expected bundling value. The receiver MAY instead choose to
discard the whole interleave group and play silence.
3.6 Reconstructing Interleaved Audio
Given an RTP sequence number ordered set of RTP packets in an
interleave group numbered 0..L, where L is the interleave value and B
is the bundling value, and CODEC data frames within each RTP packet
that are numbered in order from first to last with the numbers 1..B,
the original, time-ordered sequence of output frames from the CODEC
may be reconstructed as follows:
First L+1 frames:
Frame 0 from packet 0 of interleave group
Frame 0 from packet 1 of interleave group
And so on up to...
Frame 0 from packet L of interleave group
K. McKay Standards Track [Page 6]
RFC 2658 RTP Payload Format for PureVoice(tm) Audio August 1999
Second L+1 frames:
Frame 1 from packet 0 of interleave group
Frame 1 from packet 1 of interleave group
And so on up to...
Frame 1 from packet L of interleave group
And so on up to...
Bth L+1 frames:
Frame B from packet 0 of interleave group
Frame B from packet 1 of interleave group
And so on up to...
Frame B from packet L of interleave group
3.6.1 Additional Receiver Responsibility
Assume that the receiver has begun playing frames from an interleave
group. The time has come to play frame x from packet n of the
interleave group. Further assume that packet n of the interleave
group has not been received. As described in section 4, an erasure
frame will be sent to the Qcelp CODEC.
Now, assume that packet n of the interleave group arrives before
frame x+1 of that packet is needed. Receivers SHOULD use frame x+1
of the newly received packet n rather than substituting an erasure
frame. In other words, just because packet n wasn't available the
first time it was needed to reconstruct the interleaved audio, the
receiver SHOULD NOT assume it's not available when it's subsequently
needed for interleaved audio reconstruction.
4 Handling lost RTP packets
The Qcelp CODEC supports the notion of erasure frames. These are
frames that for whatever reason are not available. When
reconstructing interleaved audio or playing back non-interleaved
audio, erasure frames MUST be fed to the Qcelp CODEC for all of the
missing packets.
Receivers MUST use the timestamp clock to determine how many CODEC
data frames are missing. Each CODEC data frame advances the
timestamp clock EXACTLY 160 counts.
Since the bundling value may vary (it can only decrease), the
timestamp clock is the only reliable way to calculate exactly how
many CODEC data frames are missing when a packet is dropped.
K. McKay Standards Track [Page 7]
RFC 2658 RTP Payload Format for PureVoice(tm) Audio August 1999
Specifically when reconstructing interleaved audio, a missing RTP
packet in the interleave group should be treated as containing B
erasure CODEC data frames where B is the bundling value for that
interleave group.
5 Discussion
The Qcelp CODEC interpolates the missing audio content when given an
erasure frame. However, the best quality is perceived by the
listener when erasure frames are not consecutive. This makes
interleaving desirable as it increases audio quality when dropped
packets are more likely.
On the other hand, interleaving can greatly increase the end-to-end
delay. Where an interactive session is desired, an interleave (field
LLL) value of 0 or 1 and a bundling factor of 4 or less is
recommended.
When end-to-end delay is not a concern, a bundling value of at least
4 and an interleave (field LLL) value of 4 or 5 is recommended
subject to MTU limitations.
The restrictions on senders set forth in sections 3.3 and 3.4
guarantee that after receipt of the first payload packet from the
sender, the receiver can allocate a well-known amount of buffer space
that will be sufficient for all future reception from the same SSRC
value. Less buffer space may be required at some point in the future
if the sender decreases the bundling value or interleave, but never
more buffer space. This prevents the possibility of the receiver
needing to allocate more buffer space (with the possible result that
none is available) should the bundling value or interleave value be
increased by the sender. Also, were the interleave or bundling value
to increase, the receiver could be forced to pause playback while it
receives the additional packets necessary for playback at an
increased bundling value or increased interleave.
6 Security Considerations
RTP packets using the payload format defined in this specification
are subject to the security considerations discussed in the RTP
specification [2], and any appropriate profile (for example [4]).
This implies that confidentiality of the media streams is achieved by
encryption. Because the data compression used with this payload
format is applied end-to-end, encryption may be performed after
compression so there is no conflict between the two operations.
K. McKay Standards Track [Page 8]
RFC 2658 RTP Payload Format for PureVoice(tm) Audio August 1999
A potential denial-of-service threat exists for data encodings using
compression techniques that have non-uniform receiver-end
computational load. The attacker can inject pathological datagrams
into the stream which are complex to decode and cause the receiver to
be overloaded. However, this encoding does not exhibit any
significant non-uniformity.
As with any IP-based protocol, in some circumstances, a receiver may
be overloaded simply by the receipt of too many packets, either
desired or undesired. Network-layer authentication may be used to
discard packets from undesired sources, but the processing cost of
the authentication itself may be too high. In a multicast
environment, pruning of specific sources may be implemented in future
versions of IGMP [5] and in multicast routing protocols to allow a
receiver to select which sources are allowed to reach it.
7 References
[1] TIA/EIA/IS-733. TR45: High Rate Speech Service Option for
Wideband Spread Spectrum Communications Systems. Available from
Global Engineering +1 800 854 7179 or +1 303 792 2181. May also
be ordered online at http://www.eia.org/eng/.
[2] Schulzrinne, H., Casner, S., Frederick, R. and V. Jacobson,
"RTP: A Transport Protocol for Real-Time Applications", RFC
1889, January 1996.
[3] Bradner, S., "Key words for use in RFCs to Indicate Requirement
Levels", BCP 14, RFC 2119, March 1997.
[4] Schulzrinne, H., "RTP Profile for Audio and Video Conferences
with Minimal Control", RFC 1890, January 1996.
[5] Deering, S., "Host Extensions for IP Multicasting", STD 5, RFC
1112, August 1989.
8 Author's Address
Kyle J. McKay
QUALCOMM Incorporated
5775 Morehouse Drive
San Diego, CA 92121-1714
USA
Phone: +1 858 587 1121
EMail: kylem@qualcomm.com
K. McKay Standards Track [Page 9]
RFC 2658 RTP Payload Format for PureVoice(tm) Audio August 1999
9 Full Copyright Statement
Copyright (C) The Internet Society (1999). All Rights Reserved.
This document and translations of it may be copied and furnished to
others, and derivative works that comment on or otherwise explain it
or assist in its implementation may be prepared, copied, published
and distributed, in whole or in part, without restriction of any
kind, provided that the above copyright notice and this paragraph are
included on all such copies and derivative works. However, this
document itself may not be modified in any way, such as by removing
the copyright notice or references to the Internet Society or other
Internet organizations, except as needed for the purpose of
developing Internet standards in which case the procedures for
copyrights defined in the Internet Standards process must be
followed, or as required to translate it into languages other than
English.
The limited permissions granted above are perpetual and will not be
revoked by the Internet Society or its successors or assigns.
This document and the information contained herein is provided on an
"AS IS" basis and THE INTERNET SOCIETY AND THE INTERNET ENGINEERING
TASK FORCE DISCLAIMS ALL WARRANTIES, EXPRESS OR IMPLIED, INCLUDING
BUT NOT LIMITED TO ANY WARRANTY THAT THE USE OF THE INFORMATION
HEREIN WILL NOT INFRINGE ANY RIGHTS OR ANY IMPLIED WARRANTIES OF
MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE.
Acknowledgement
Funding for the RFC Editor function is currently provided by the
Internet Society.
K. McKay Standards Track [Page 10]
⌨️ 快捷键说明
复制代码
Ctrl + C
搜索代码
Ctrl + F
全屏模式
F11
切换主题
Ctrl + Shift + D
显示快捷键
?
增大字号
Ctrl + =
减小字号
Ctrl + -