📄 draft-herlein-speex-rtp-profile-02.txt
字号:
frequency using the 'a=rtpmap' line. Any number from 96 to 127 could have been chosen (the allowed range for dynamic types). The value of the sampling frequency is typically 8000 for narrow band operation, 16000 for wide band operation, and 32000 for ultra-wide band operation. If for some reason the offerer has bandwidth limitations, the client may use the "b=" header, as explained in SDP [4]. The following example illustrates the case where the offerer cannot receive more than 10 kbit/s. m=audio 8088 RTP/AVP 97 b=AS:10 a=rtmap:97 speex/8000 In this case, if the remote part agrees, it should configure its Speex encoder so that it does not use modes that produce more than 10 kbit/s. Note that the "b=" constraint also applies on all payload types that may be proposed in the media line ("m="). An other way to make recommendations to the remote Speex encoder isHerlein, et al. Expires October 3, 2005 [Page 8]Internet-Draft draft-herlein-speex-rtp-profile-02 April 2005 to use its specific parameters via the a=fmtp: directive. The following parameters are defined for use in this way: ptime: duration of each packet in milliseconds. sr: actual sample rate in Hz. ebw: encoding bandwidth - either 'narrow' or 'wide' or 'ultra' (corresponds to nominal 8000, 16000, and 32000 Hz sampling rates). vbr: variable bit rate - either 'on' 'off' or 'vad' (defaults to off). If on, variable bit rate is enabled. If off, disabled. If set to 'vad' then constant bit rate is used but silence will be encoded with special short frames to indicate a lack of voice for that period. cng: comfort noise generation - either 'on' or 'off'. If off then silence frames will be silent; if 'on' then those frames will be filled with comfort noise. mode: Speex encoding mode. Can be {1,2,3,4,5,6,any} defaults to 3 in narrowband, 6 in wide and ultra-wide. penh: use of perceptual enhancement. 1 indicates to the decoder that perceptual enhancement is recommended, 0 indicates that it is not. Defaults to on (1). Examples: m=audio 8008 RTP/AVP 97 a=rtpmap:97 speex/8000 a=fmtp:97 mode=4 This examples illustrate an offerer that wishes to receive a Speex stream at 8000Hz, but only using speex mode 3. The offerer may suggest to the remote decoder to activate its perceptual enhancement filter like this: m=audio 8088 RTP/AVP 97 a=rtmap:97 speex/8000 a=fmtp:97 penh=1 Several Speex specific parameters can be given in a single a=fmtp line provided that they are separated by a semi-colon:Herlein, et al. Expires October 3, 2005 [Page 9]Internet-Draft draft-herlein-speex-rtp-profile-02 April 2005 a=fmtp:97 mode=any;penh=1 The offerer may indicate that it wishes to send variable bit rate frames with comfort noise: m=audio 8088 RTP/AVP 97 a=rtmap:97 speex/8000 a=fmtp:97 vbr=on;cng=on The "ptime" attribute is used to denote the packetization interval (ie, how many milliseconds of audio is encoded in a single RTP packet). Since Speex uses 20 msec frames, ptime values of multiples of 20 denote multiple Speex frames per packet. Values of ptime which are not multiples of 20 MUST be ignored and clients MUST use the default value of 20 instead. In the example below the ptime value is set to 40, indicating that there are 2 frames in each packet. m=audio 8008 RTP/AVP 97 a=rtpmap:97 speex/8000 a=ptime:40 Note that the ptime parameter applies to all payloads listed in the media line and is not used as part of an a=fmtp directive. Values of ptime not multiple of 20 msec are meaningless, so the receiver of such ptime values MUST ignore them. If during the life of an RTP session the ptime value changes, when there are multiple Speex frames for example, the SDP value must also reflect the new value. Care must be taken when setting the value of ptime so that the RTP packet size does not exceed the path MTU.10. ITU H.323/H.245 Use of Speex Application is underway to make Speex a standard ITU codec. However, until that is finalized, Speex MAY be used in H.323 [5] by using a non-standard codec block definition in the H.245 [6] codec capability negotiations.11. NonStandardMessage format For Speex use in H.245 [6] based systems, the fields in the NonStandardMessage should be:Herlein, et al. Expires October 3, 2005 [Page 10]Internet-Draft draft-herlein-speex-rtp-profile-02 April 2005 t35CountryCode = Hex: B5 t35Extension = Hex: 00 manufacturerCode = Hex: 0026 [Length of the Binary Sequence (8 bit number)] [Binary Sequence consisting of an ASCII string, no NULL terminator] The binary sequence is an ascii string merely for ease of use. The string is not null terminated. The format of this string is speex [optional variables] The optional variables are identical to those used for the SDP a=fmtp strings discussed in section 5 above. The string is built to be all on one line, each key-value pair separated by a semi-colon. The optional variables MAY be omitted, which causes the default values to be assumed. They are: ebw=narrow;mode=3;vbr=off;cng=off;ptime=20;sr=8000;penh=no; The fifth octet of the block is the length of the binary sequence. NOTE: this method can result in the advertising of a large number of Speex 'codecs' based on the number of variables possible. For most VoIP applications, use of the default binary sequence of 'speex' is RECOMMENDED to be used in addition to all other options. This maximizes the chances that two H.323 based applications that support Speex can find a mutual codec.12. RTP Payload Types Dynamic payload type codes MUST be negotiated 'out-of-band' for the assignment of a dynamic payload type from the range of 96-127. H.323 applications MUST use the H.245 H2250LogicalChannelParameters encoding to accomplish this.13. Security Considerations RTP packets using the payload format defined in this specification are subject to the security considerations discussed in the RTP specification [2], and any appropriate RTP profile. This implies that confidentiality of the media streams is achieved by encryption. Because the data compression used with this payload format is applied end-to-end, encryption may be performed after compression so there is no conflict between the two operations. A potential denial-of-service threat exists for data encodings using compression techniques that have non-uniform receiver-endHerlein, et al. Expires October 3, 2005 [Page 11]Internet-Draft draft-herlein-speex-rtp-profile-02 April 2005 computational load. The attacker can inject pathological datagrams into the stream which are complex to decode and cause the receiver to be overloaded. However, this encoding does not exhibit any significant non-uniformity. As with any IP-based protocol, in some circumstances a receiver may be overloaded simply by the receipt of too many packets, either desired or undesired. Network-layer authentication may be used to discard packets from undesired sources, but the processing cost of the authentication itself may be too high.14. Acknowledgments The authors would like to thank Equivalence Pty Ltd of Australia for their assistance in attempting to standardize the use of Speex in H.323 applications, and for implementing Speex in their open source OpenH323 stack. The authors would also like to thank Brian C. Wiles <brian@streamcomm.com> of StreamComm for his assistance in developing the proposed standard for Speex use in H.323 applications. The authors would also like to thank the following members of the Speex and AVT communities for their input: Ross Finlayson, Federico Montesino Pouzols, Henning Schulzrinne, Magnus Westerlund.15. References15.1 Normative References [1] Bradner, S., "Key words for use in RFCs to Indicate Requirement Levels", RFC 2119. [2] Schulzrinne, H., Casner, S., Frederick, R. and V. Jacobson, "RTP: A Transport Protocol for real-time applications", RFC 3550. [3] "Multipurpose Internet Mail Extensions (MIME) Part One: Format of Internet Message Bodies", RFC 2045. [4] Jacobson, V. and M. Handley, "SDP: Session Description Protocol", RFC 2327. [5] "Packet-based Multimedia Communications Systems", ITU-T Recommendation H.323. [6] "Control of communications between Visual Telephone Systems and Terminal Equipment", ITU-T Recommendation H.245. [7] Schulzrinne, H. and S. Casner, "RTP Profile for Audio and VideoHerlein, et al. Expires October 3, 2005 [Page 12]Internet-Draft draft-herlein-speex-rtp-profile-02 April 2005 Conferences with Minimal Control.", RFC 3551. [8] Walleij, L., "The application/ogg Media Type", RFC 3534.15.2 Informative References [9] "Speexenc/speexdec, reference command-line encoder/decoder", Speex website http://www.speex.org/. [10] "CELP, U.S. Federal Standard 1016.", National Technical Information Service (NTIS) website http://www.ntis.gov/.Authors' Addresses Greg Herlein 2034 Filbert Street San Francisco, California 94123 United States EMail: gherlein@herlein.com Simon Morlat 35, av de Vizille App 42 Grenoble 38000 France EMail: simon.morlat@linphone.org Jean-Marc Valin Department of Electrical and Computer Engineering University of Sherbrooke 2500 blvd Universite Sherbrooke, Quebec J1K 2R1 Canada EMail: jean-marc.valin@hermes.usherb.ca Roger Hardiman 49 Nettleton Road Cheltenham, Gloucestershire GL51 6NR England EMail: roger@freebsd.orgHerlein, et al. Expires October 3, 2005 [Page 13]Internet-Draft draft-herlein-speex-rtp-profile-02 April 2005 Phil Kerr England EMail: phil@plus24.comHerlein, et al. Expires October 3, 2005 [Page 14]Internet-Draft draft-herlein-speex-rtp-profile-02 April 2005Intellectual Property Statement The IETF takes no position regarding the validity or scope of any Intellectual Property Rights or other rights that might be claimed to pertain to the implementation or use of the technology described in this document or the extent to which any license under such rights might or might not be available; nor does it represent that it has made any independent effort to identify any such rights. Information on the procedures with respect to rights in RFC documents can be found in BCP 78 and BCP 79. Copies of IPR disclosures made to the IETF Secretariat and any assurances of licenses to be made available, or the result of an attempt made to obtain a general license or permission for the use of such proprietary rights by implementers or users of this specification can be obtained from the IETF on-line IPR repository at http://www.ietf.org/ipr. The IETF invites any interested party to bring to its attention any copyrights, patents or patent applications, or other proprietary rights that may cover technology that may be required to implement this standard. Please address the information to the IETF at ietf-ipr@ietf.org.Disclaimer of Validity This document and the information contained herein are provided on an "AS IS" basis and THE CONTRIBUTOR, THE ORGANIZATION HE/SHE REPRESENTS OR IS SPONSORED BY (IF ANY), THE INTERNET SOCIETY AND THE INTERNET ENGINEERING TASK FORCE DISCLAIM ALL WARRANTIES, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO ANY WARRANTY THAT THE USE OF THE INFORMATION HEREIN WILL NOT INFRINGE ANY RIGHTS OR ANY IMPLIED WARRANTIES OF MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE.Copyright Statement Copyright (C) The Internet Society (2005). This document is subject to the rights, licenses and restrictions contained in BCP 78, and except as set forth therein, the authors retain all their rights.Acknowledgment Funding for the RFC Editor function is currently provided by the Internet Society.Herlein, et al. Expires October 3, 2005 [Page 15]
⌨️ 快捷键说明
复制代码
Ctrl + C
搜索代码
Ctrl + F
全屏模式
F11
切换主题
Ctrl + Shift + D
显示快捷键
?
增大字号
Ctrl + =
减小字号
Ctrl + -