http:^^www.cs.utexas.edu^users^kornerup^ariane5rep.html
来自「This data set contains WWW-pages collect」· HTML 代码 · 共 786 行 · 第 1/3 页
HTML
786 行
alignment function result called BH, Horizontal Bias, related to the horizontalvelocity sensed by the platform. This value is calculated as an indicatorfor alignment precision over time.</LI><LI>The value of BH was much higher than expected because the early partof the trajectory of Ariane 5 differs from that of Ariane 4 and resultsin considerably higher horizontal velocity values.</LI></UL><P>The SRI internal events that led to the failure have been reproducedby simulation calculations. Furthermore, both SRIs were recovered duringthe Board's investigation and the failure context was precisely determinedfrom memory readouts. In addition, the Board has examined the softwarecode which was shown to be consistent with the failure scenario. The resultsof these examinations are documented in the Technical Report.</P><P>Therefore, it is established beyond reasonable doubt that the chainof events set out above reflects the technical causes of the failure ofAriane 501.</P><H4>2.2 COMMENTS ON THE FAILURE SCENARIO</H4><P>In the failure scenario, the primary technical causes are the OperandError when converting the horizontal bias variable BH, and the lack ofprotection of this conversion which caused the SRI computer to stop.</P><P>It has been stated to the Board that not all the conversions were protectedbecause a maximum workload target of 80% had been set for the SRI computer.To determine the vulnerability of unprotected code, an analysis was performedon every operation which could give rise to an exception, including anOperand Error. In particular, the conversion of floating point values tointegers was analysed and operations involving seven variables were atrisk of leading to an Operand Error. This led to protection being addedto four of the variables, evidence of which appears in the Ada code. However,three of the variables were left unprotected. No reference to justificationof this decision was found directly in the source code. Given the largeamount of documentation associated with any industrial application, theassumption, although agreed, was essentially obscured, though not deliberately,from any external review.</P><P>The reason for the three remaining variables, including the one denotinghorizontal bias, being unprotected was that further reasoning indicatedthat they were either physically limited or that there was a large marginof safety, a reasoning which in the case of the variable BH turned outto be faulty. It is important to note that the decision to protect certainvariables but not others was taken jointly by project partners at severalcontractual levels.</P><P>There is no evidence that any trajectory data were used to analyse thebehaviour of the unprotected variables, and it is even more important tonote that it was jointly agreed not to include the Ariane 5 trajectorydata in the SRI requirements and specification.</P><P>Although the source of the Operand Error has been identified, this initself did not cause the mission to fail. The specification of the exception-handlingmechanism also contributed to the failure. In the event of any kind ofexception, the system specification stated that: the failure should beindicated on the databus, the failure context should be stored in an EEPROMmemory (which was recovered and read out for Ariane 501), and finally,the SRI processor should be shut down.</P><P>It was the decision to cease the processor operation which finally provedfatal. Restart is not feasible since attitude is too difficult to re-calculateafter a processor shutdown; therefore the Inertial Reference System becomesuseless. The reason behind this drastic action lies in the culture withinthe Ariane programme of only addressing random hardware failures. Fromthis point of view exception - or error - handling mechanisms are designedfor a random hardware failure which can quite rationally be handled bya backup system.</P><P>Although the failure was due to a systematic software design error,mechanisms can be introduced to mitigate this type of problem. For examplethe computers within the SRIs could have continued to provide their bestestimates of the required attitude information. There is reason for concernthat a software exception should be allowed, or even required, to causea processor to halt while handling mission-critical equipment. Indeed,the loss of a proper software function is hazardous because the same softwareruns in both SRI units. In the case of Ariane 501, this resulted in theswitch-off of two still healthy critical units of equipment.</P><P>The original requirement acccounting for the continued operation ofthe alignment software after lift-off was brought forward more than 10years ago for the earlier models of Ariane, in order to cope with the ratherunlikely event of a hold in the count-down e.g. between - 9 seconds, whenflight mode starts in the SRI of Ariane 4, and - 5 seconds when certainevents are initiated in the launcher which take several hours to reset.The period selected for this continued alignment operation, 50 secondsafter the start of flight mode, was based on the time needed for the groundequipment to resume full control of the launcher in the event of a hold.</P><P>This special feature made it possible with the earlier versions of Ariane,to restart the count- down without waiting for normal alignment, whichtakes 45 minutes or more, so that a short launch window could still beused. In fact, this feature was used once, in 1989 on Flight 33.</P><P>The same requirement does not apply to Ariane 5, which has a differentpreparation sequence and it was maintained for commonality reasons, presumablybased on the view that, unless proven necessary, it was not wise to makechanges in software which worked well on Ariane 4.</P><P>Even in those cases where the requirement is found to be still valid,it is questionable for the alignment function to be operating after thelauncher has lifted off. Alignment of mechanical and laser strap-down platformsinvolves complex mathematical filter functions to properly align the x-axisto the gravity axis and to find north direction from Earth rotation sensing.The assumption of preflight alignment is that the launcher is positionedat a known and fixed position. Therefore, the alignment function is totallydisrupted when performed during flight, because the measured movementsof the launcher are interpreted as sensor offsets and other coefficientscharacterising sensor behaviour.</P><P>Returning to the software error, the Board wishes to point out thatsoftware is an expression of a highly detailed design and does not failin the same sense as a mechanical system. Furthermore software is flexibleand expressive and thus encourages highly demanding requirements, whichin turn lead to complex implementations which are difficult to assess.</P><P>An underlying theme in the development of Ariane 5 is the bias towardsthe mitigation of random failure. The supplier of the SRI was only followingthe specification given to it, which stipulated that in the event of anydetected exception the processor was to be stopped. The exception whichoccurred was not due to random failure but a design error. The exceptionwas detected, but inappropriately handled because the view had been takenthat software should be considered correct until it is shown to be at fault.The Board has reason to believe that this view is also accepted in otherareas of Ariane 5 software design. The Board is in favour of the oppositeview, that software should be assumed to be faulty until applying the currentlyaccepted best practice methods can demonstrate that it is correct.</P><P>This means that critical software - in the sense that failure of thesoftware puts the mission at risk - must be identified at a very detailedlevel, that exceptional behaviour must be confined, and that a reasonableback-up policy must take software failures into account.</P><P>2.3 THE TESTING AND QUALIFICATION PROCEDURES</P><P>The Flight Control System qualification for Ariane 5 follows a standardprocedure and is performed at the following levels :</P><UL><P>- Equipment qualification <BR>- Software qualification (On-Board Computer software) <BR>- Stage integration <BR>- System validation tests.</P></UL><P>The logic applied is to check at each level what could not be achievedat the previous level, thus eventually providing complete test coverageof each sub-system and of the integrated system.</P><P>Testing at equipment level was in the case of the SRI conducted rigorouslywith regard to all environmental factors and in fact beyond what was expectedfor Ariane 5. However, no test was performed to verify that the SRI wouldbehave correctly when being subjected to the count-down and flight timesequence and the trajectory of Ariane 5.</P><P>It should be noted that for reasons of physical law, it is not feasibleto test the SRI as a "black box" in the flight environment, unlessone makes a completely realistic flight test, but it is possible to doground testing by injecting simulated accelerometric signals in accordancewith predicted flight parameters, while also using a turntable to simulatelauncher angular movements. Had such a test been performed by the supplieror as part of the acceptance test, the failure mechanism would have beenexposed.</P><P>The main explanation for the absence of this test has already been mentionedabove, i.e. the SRI specification (which is supposed to be a requirementsdocument for the SRI) does not contain the Ariane 5 trajectory data asa functional requirement.</P><P>The Board has also noted that the systems specification of the SRI doesnot indicate operational restrictions that emerge from the chosen implementation.Such a declaration of limitation, which should be mandatory for every mission-criticaldevice, would have served to identify any non-compliance with the trajectoryof Ariane 5.</P><P>The other principal opportunity to detect the failure mechanism beforehandwas during the numerous tests and simulations carried out at the FunctionalSimulation Facility ISF, which is at the site of the Industrial Architect.The scope of the ISF testing is to qualify :</P><UL><P>- the guidance, navigation and control performance in the whole flightenvelope, <BR>- the sensors redundancy operation, - the dedicated functions of the stages,<BR>- the flight software (On-Board Computer) compliance with all equipmentof the Flight Control Electrical System.</P></UL><P>A large number of closed-loop simulations of the complete flight simulatingground segment operation, telemetry flow and launcher dynamics were runin order to verify :</P><UL><P>- the nominal trajectory <BR>- trajectories degraded with respect to internal launcher parameters<BR>- trajectories degraded with respect to atmospheric parameters <BR>- equipment failures and the subsequent failure isolation and recovery</P></UL><P>In these tests many equipment items were physically present and exercisedbut not the two SRIs, which were simulated by specifically developed softwaremodules. Some open-loop tests, to verify compliance of the On-Board Computerand the SRI, were performed with the actual SRI. It is understood thatthese were just electrical integration tests and "low-level "(bus communication) compliance tests.</P><P>It is not mandatory, even if preferable, that all the parts of the subsystemare present in all the tests at a given level. Sometimes this is not physicallypossible or it is not possible to exercise them completely or in a representativeway. In these cases it is logical to replace them with simulators but onlyafter a careful check that the previous test levels have covered the scopecompletely.</P><P>This procedure is especially important for the final system test beforethe system is operationally used (the tests performed on the 501 launcheritself are not addressed here since they are not specific to the FlightControl Electrical System qualification).</P><P>In order to understand the explanations given for the decision not tohave the SRIs in the closed-loop simulation, it is necessary to describethe test configurations that might have been used.</P><P>Because it is not possible to simulate the large linear accelerationsof the launcher in all three axes on a test bench (as discussed above),there are two ways to put the SRI in the loop:</P><UL><P>A) To put it on a three-axis dynamic table (to stimulatethe Ring Laser Gyros) and to substitute the analog output of the accelerometers(which can not be stimulated mechanically) by simulation via a dedicatedtest input connector and an electronic board designed for this purpose.This is similar to the method mentioned in connection with possible testingat equipment level.</P><P>B) To substitute both, the analog output of the accelerometersand the Ring Laser Gyros via a dedicated test input connector with signalsproduced by simulation.</P></UL><P>The first approach is likely to provide an accurate simulation (withinthe limits of the three-axis dynamic table bandwidth) and is quite expensive;the second is cheaper and its performance depends essentially on the accuracyof the simulation. In both cases a large part of the electronics and thecomplete software are tested in the real operating environment.</P><P>When the project test philosophy was defined, the importance of havingthe SRIs in the loop was recognized and a decision was taken to selectmethod B above. At a later stage of the programme (in 1992), this decisionwas changed. It was decided not to have the actual SRIs in the loop forthe following reasons :</P><UL><LI>The SRIs should be considered to be fully qualified at equipment level</LI><LI>The precision of the navigation software in the On-Board Computer dependscritically on the precision of the SRI measurements. In the ISF, this precisioncould not be achieved by the electronics creating the test signals.</LI><LI>The simulation of failure modes is not possible with real equipment,but only with a model.</LI>
⌨️ 快捷键说明
复制代码Ctrl + C
搜索代码Ctrl + F
全屏模式F11
增大字号Ctrl + =
减小字号Ctrl + -
显示快捷键?