📄 rfc889.txt
字号:
Network Working Group D.L. MillsRequest for Comments: 889 December 1983 Internet Delay ExperimentsThis memo reports on some measurement experiments and suggests some possibleimprovements to the TCP retransmission timeout calculation. This memo isboth a status report on the measurements and advice to implementers of TCP.1. Introduction This memorandum describes two series of experiments designed to explorethe transmission characteristics of the Internet system. One series ofexperiments was designed to determine the network delays with respect topacket length, while the other was designed to assess the effectiveness of theTCP retransmission-timeout algorithm specified in the standards documents.Both sets of experiments were conducted during the October - November 1983time frame and used many hosts distributed throughout the Internet system. The objectives of these experiments were first to accumulate experimentaldata on actual network paths that could be used as a benchmark of Internetsystem performance, and second to apply these data to refine individual TCPimplementations and improve their performance. The experiments were done using a specially instrumented measurement hostcalled a Fuzzball, which consists of an LSI-11 running IP/TCP and variousapplication-layer protocols including TELNET, FTP and SMTP mail. Among thevarious measurement packages is the original PING (Packet InterNet Groper)program used over the last six years for numerous tests and measurements ofthe Internet system and its client nets. This program contains facilities tosend various kinds of probe packets, including ICMP Echo messages, process thereply and record elapsed times and other information in a data file, as wellas produce real-time snapshot histograms and traces. Following an experiment run, the data collected in the file were reducedby another set of programs and plotted on a Peritek bit-map display with colormonitor. The plots have been found invaluable in the indentification andunderstanding of the causes of netword glitches and other "zoo" phenomena.Finally, summary data were extracted and presented in this memorandum. Theraw data files, including bit-map image files of the various plots, areavailable to other experimenters upon request. The Fuzzballs and their local-net architecture, called DCN, have abouttwo-dozen clones scattered worldwide, including one (DCN1) at the LinkabitCorporation offices in McLean, Virginia, and another at the NorwegianTelecommunications Adminstration (NTA) near Oslo, Norway. The DCN1 Fuzzballis connected to the ARPANET at the Mitre IMP by means of 1822 Error ControlUnits operating over a 56-Kbps line. The NTA Fuzzball is connected to theNTARE Gateway by an 1822 interface and then via VDH/HAP operating over a9.6-Kbps line to SATNET at the Tanum (Sweden) SIMP. For most experimentsdescribed below, these details of the local connectivity can be ignored, sinceonly relatively small delays are involved.Internet Delay Experiments Page 2D.L. Mills The remote test hosts were selected to represent canonical paths in theInternet system and were scattered all over the world. They included some onthe ARPANET, MILNET, MINET, SATNET, TELENET and numerous local nets reachablevia these long-haul nets. As an example of the richness of the Internetsystem connectivity and the experimental data base, data are included forthree different paths from the ARPANET-based measurement host to London hosts,two via different satellite links and one via an undersea cable.2. Packet Length Versus Delay This set of experiments was designed to determine whether delays acrossthe Internet are significantly influenced by packet length. In cases wherethe intrinsic propagation delays are high relative to the time to transmit anindividual packet, one would expect that delays would not be strongly affectedby packet length. This is the case with satellite nets, including SATNET andWIDEBAND, but also with terrestrial nets where the degree of trafficaggregation is high, so that the measured traffic is a small proportion of thetotal traffic on the path. However, in cases where the intrinsic propagationdelays are low and the measured traffic represents the bulk of the traffic onthe path, quite the opposite would be expected. The objective of the experiments was to assess the degree to which TCPperformance could be improved by refining the retransmission-timeout algorithmto include a dependency on packet length. Another objective was to determinethe nature of the delay characteristic versus packet length on tandem pathsspanning networks of widely varying architectures, including local-nets,terrestrial long-haul nets and satellite nets.2.1. Experiment Design There were two sets of experiments to measure delays as a function ofpacket length. One of these was based at DCN1, while the other was based atNTA. All experiments used ICMP Echo/Reply messages with embedded timestamps.A cycle consisted of sending an ICMP Echo message of specified length, waitingfor the corresponding ICMP Reply message to come back and recording theelapsed time (normalized to one-way delay). An experiment run, resulting inone line of the table below, consisted of 512 of these volleys. The length of each ICMP message was determined by a random-numbergenerator uniformly distributed between zero and 256. Lengths less than 40were rounded up to 40, which is the minimum datagram size for an ICMP messagecontaining timestamps and just happens to also be the minimum TCP segmentsize. The maximum length was chosen to avoid complications due tofragmentation and reassembly, since ICMP messages are not ordinarilyfragmented or reassembled by the gateways. The data collected were first plotted as a scatter diagram on a colorbit-map display. For all paths involving the ARPANET, this immediatelyrevealed two distinct characteristics, one for short (single-packet) messagesless than 126 octets in length and the other for long (multi-packet) messagesInternet Delay Experiments Page 3D.L. Millslonger than this. Linear regression lines were then fitted to eachcharacteristic with the results shown in the following table. (Only onecharacteristic was assumed for ARPANET-exclusive paths.) The table shows foreach host the delays, in milliseconds, for each type of message along with arate computed on the basis of these delays. The "Host ID" column designatesthe host at the remote end of the path, with a letter suffix used whennecessary to identify a particular run.Internet Delay Experiments Page 4D.L. MillsHost Single-packet Rate Multi-packet Rate CommentsID 40 125 (bps) 125 256 (bps)---------------------------------------------------------------------------DCN1 to nearby local-net hosts (calibration)DCN5 9 13 366422 DMA 1822DCN8 14 20 268017 EthernetIMP17 22 60 45228 56K 1822/ECUFORD1 93 274 9540 9600 DDCMP baseUMD1 102 473 4663 4800 synchDCN6 188 550 4782 4800 DDCMPFACC 243 770 3282 9600/4800 DDCMPFOE 608 1917 1320 9600/14.4K stat muxDCN1 to ARPANET hosts and local netsMILARP 61 105 15358 133 171 27769 MILNET gatewayISID-L 166 263 6989 403 472 15029 low-traffic periodSCORE 184 318 5088 541 608 15745 low-traffic periodRVAX 231 398 4061 651 740 11781 Purdue local netAJAX 322 578 2664 944 1081 7681 MIT local netISID-H 333 520 3643 715 889 6029 high-traffic periodBERK 336 967 1078 1188 1403 4879 UC BerkeleyWASH 498 776 2441 1256 1348 11379 U WashingtonDCN1 to MILNET/MINET hosts and local netsISIA-L 460 563 6633 1049 1140 11489 low-traffic periodISIA-H 564 841 2447 1275 1635 2910 high-traffic periodBRL 560 973 1645 1605 1825 4768 BRL local netLON 585 835 2724 1775 1998 4696 MINET host (London)HAWAII 679 980 2257 1817 1931 9238 a long way offOFFICE3 762 1249 1396 2283 2414 8004 heavily loaded hostKOREA 897 1294 1712 2717 2770 19652 a long, long way offDCN1 to TELENET hosts via ARPANETRICE 1456 2358 754 3086 3543 2297 via VAN gatewayDCN1 to SATNET hosts and local nets via ARPANETUCL 1089 1240 4514 1426 1548 8558 UCL zoo NTA-L 1132 1417 2382 1524 1838 3339 low-traffic periodNTA-H 1247 1504 2640 1681 1811 8078 high-traffic periodNTA to SATNET hostsTANUM 107 368 6625 9600 bps Tanum lineETAM 964 1274 5576 Etam channel echoGOONY 972 1256 6082 Goonhilly channel echoInternet Delay Experiments Page 5D.L. Mills2.2 Analysis of Results The data clearly show a strong correlation between delay and length, withthe longest packets showing delays two to three times the shortest. On pathsvia ARPANET clones the delay characteristic shows a stonger correlation withlength for single-packet messages than for multi-packet messages, which isconsistent with a design which favors low delays for short messages and highthroughputs for longer ones. Most of the runs were made during off-peak hours. In the few cases whereruns were made for a particular host during both on-peak and off-peak hours,comparison shows a greater dependency on packet length than on traffic shift. TCP implementors should be advised that some dependency on packet lengthmay have to be built into the retransmission-timeout estimation algorithm toinsure good performance over lossy nets like SATNET. They should also beadvised that some Internet paths may require stupendous timeout intervalsranging to many seconds for the net alone, not to mention additional delays onhost-system queues. I call to your attention the fact that the delays (at least for thelarger packets) from ARPANET hosts (e.g. DCN1) to MILNET hosts (e.g. ISIA)are in the same ballpark as the delays to SATNET hosts (e.g. UCL)! I havealso observed that the packet-loss rates on the MILNET path are at present notneglible (18 in 512 for ISIA-2). Presumably, the loss is in the gateways;however, there may well be a host or two out there swamping the gateways withretransmitted data and which have a funny idea of the "normal" timeoutinterval. The recent discovery of a bug in the TOPS-20 TCP implementation,where spurious ACKs were generated at an alarming rate, would seem to confirmthat suspicion.3. Retransmission-Timeout Algorithm One of the basic features of TCP which allow it to be used on pathsspanning many nets of widely varying delay and packet-loss characteristics isthe retranansmission-timeout algorithm, sometimes known as the "RSREAlgorithm" for the original designers. The algorithm operates by recordingthe time and initial sequence number when a segment is transmitted, thencomputing the elapsed time for that sequence number to be acknowledged. Thereare various degrees of sophistication in the implementation of the algorithm,ranging from allowing only one such computation to be in progress at a time toallowing one for each segment outstanding at a time on the connection. The retransmission-timeout algorithm is basically an estimation process.It maintains an extimate of the current roundtrip delay time and updates it asnew delay samples are computed. The algorithm smooths these samples and thenestablishes a timeout, which if exceeded causes a retransmission. Theselection of the parameters of this algorithm are vitally important in orderto provide effective data transmission and avoid abuse of the Internet systemby excessive retransmissions. I have long been suspicious of the parametersInternet Delay Experiments Page 6D.L. Millssuggested in the specification and used in some implementations, especially incases involving long-delay paths involving lossy nets. The experiment wasdesigned to simulate the operation of the algorithm using data collected fromreal paths involving some pretty leaky Internet plumbing.3.1. Experiment Design The experiment data base was constructed of well over a hundred runsusing ICMP Echo/Reply messages bounced off hosts scattered all over the world.Most runs, including all those summarized here, consisted of 512 echo/replycycles lasting from several seconds to twenty minutes or so. Other runsdesigned to detect network glitches lasted several hours. Some runs usedpackets of constant length, while others used different lengths distributedfrom 40 to 256 octets. The maximum length was chosen to avoid complicationsfragmented or reassembled by the gateways. The object of the experiment was to simulate the packet delaydistribution seen by TCP over the paths measured. Only the network delay isof interest here, not the queueing delays within the hosts themselves, whichcan be considerable. Also, only a single packet was allowed in flight, sothat stress on the network itself was minimal. Some tests were conductedduring busy periods of network activity, while others were conducted duringquiet hours. The 512 data points collected during each run were processed by a programwhich plotted on a color bit-map display each data point (x,y), where xrepresents the time since initiation of the experiment the and y the measureddelay, normalized to the one-way delay. Then, the simulatedretransmission-timeout algorithm was run on these data and its computedtimeout plotted in the same way. The display immediately reveals how thealgorithm behaves in the face of varying traffic loads, network glitches, lostpackets and superfluous retransmissions. Each experiment run also produced summary statistics, which aresummarized in the table below. Each line includes the Host ID, whichidentifies the run. The suffix -1 indicates 40-octet packets, -2 indicates256-octet packets and no suffix indicates uniformly distributed lengthsbetween 40 and 256. The Lost Packets columns refer to instances when no ICMPReply message was received for thirty seconds after transmission of the ICMPEcho message, indicating probable loss of one or both messages. The RTXPackets columns refer to instances when the computed timeout is less than themeasured delay, which would result in a superfluous retransmission. For eachof these two types of packets the column indicates the number of instancesand the Time column indicates the total accumulated time required for therecovery action. For reference purposes, the Mean column indicates the computed mean delayof the echo/reply cycles, excluding those cycles involving packet loss, whilethe CoV column indicates the coefficient of variation. Finally, the EffInternet Delay Experiments Page 7
⌨️ 快捷键说明
复制代码
Ctrl + C
搜索代码
Ctrl + F
全屏模式
F11
切换主题
Ctrl + Shift + D
显示快捷键
?
增大字号
Ctrl + =
减小字号
Ctrl + -