Random Versus Systematic Faults: What’s The Difference?

Oct 19, 2017

[:en][vc_row][vc_column][vc_column_text]I saw and responded to a LinkedIn discussion on this very issue, where someone had asked “if I have a misaligned limit switch that fails dangerously, then is it random or systematic?”. This is an intriguing question because many view human error as being systematic and, whereas, this is sometimes true, it’s not always the case.  When teaching our FSE100 course we discuss the differences and why it’s important to categorise failures this way.

We tend to think of Random failures as failures that occur at random time intervals (usually hardware related), which are unpredictable.  In probabilistic analysis where we try to predict the likelihood of a failure on demand, in low demand process applications, we use average failure rates in our PFDavg calculations, based upon constant failure rate during Useful Life.  There are now over 200 Billion unit operating hours of failure rate data that have been collected, which give us a pretty accurate value for certain types of equipment, to use in PFDavg calculations (such as are in exSILentia).

Systematic failures, on the other hand, are insidious and can only be eliminated by a change in design, manufacturing, procedures and training.  What I like to categorise as the 3 Ps:

  • People – are they competent and trained?
  • Procedures – are there well-defined and followed procedures?
  • Paperwork – do we have an audit trail to demonstrate that the first two are being adhered to?

This means that systematic failures are not considered in probabilistic calculations and therefore, if a site is categorizing failures as systematic they could end up with low and unrealistic failure rates, when looking at measuring the SIF performance. For this reason, it’s a good policy to categorise all field failures as random until proven otherwise. In this case, we won’t throw away any failures unnecessarily.

For example, let’s say an instrument technician who was well trained, had performed this task many times, without error, had mis-calibrated a sensor that resulted in it not being able to detect a high level (dangerous condition), although the calibration procedure and paper work was correct.  Would this be categorized as a systematic error or random error?

Many would argue that, because it is human error, it would be a systematic issue.

So, let’s see how this measures up to the 3 Ps:

  • People – the Technician is well trained and so is competent
  • Procedure – the procedure is correct
  • Paperwork – the paperwork is correct

In this case, this would be categorized as a Random error and not Systematic.  Perhaps the technician was distracted, tired, having a bad day, etc.  The Technician just made a mistake.  It’s that simple.

However, it could be argued that for safety-related equipment the procedure should be changed to have a four-eyes policy, which would help prevent the error, so a systematic improvement.

It’s easy to see how confusing it can be in determining whether a fault is random or systematic, which is why we recommend capturing the failure as random until proven otherwise.

So, coming back to the case of the misaligned limit switch, we would need to initially categorise the failure as Random so it’s captured and then to analyse whether it is actually a systematic fault or not, by looking into the 3 Ps.

Why not check out some of the webinars on this subject that are archived on the exida website.[/vc_column_text][/vc_column][/vc_row][:zh][vc_row][vc_column][vc_column_text]No! They are not Inherently Safe!

A collaborative robot is intended to work “collaboratively” with a person. i.e. share a common workspace. It is force and speed limited by design to minimize any potential hazard. Collaborative robots fit the application where the task cannot be easily or cost effectively automated. They are easy to deploy, program and repurpose. Collaborative robots are new to everyone including the standards agencies.

A hazard and risk assessment is required that assesses the robot and the environment that it is deployed in. Just as any other robot, things such as collisions, speed, type of end effector and worksite need to be evaluated. Collaborative robots have their own sorts of collisions and hazards. They may not be as severe, but they still exist.

This all comes down to risk and the amount of risk that you are willing to accept! The diagram below shows the high-level steps for doing a Hazard and Risk Assessment. When following the steps, if you assess the risk and find it to be acceptable (your companies acceptable risk norms) then you are done. No need to add any risk reduction.

The next best approach is to determine if protective measures other than a Safety Function can reduce the risk to an acceptable level. If not, then you must assign a SIL and implement a safety function that will provide the required risk reduction.

exida can effectively train your team to perform machine hazard and risk assessments to identify all possible hazards and estimate the risk for each hazard. Specifically, exida coaches you through the process of evaluating the risk, developing and implementing risk reduction options. exida can also educate your team in multiple approaches to SIL target selection. These are just some of the things exida does to ensure you are on the right path![/vc_column_text][/vc_column][/vc_row] [:]

Submit a Comment

Your email address will not be published. Required fields are marked *

Keep Updated.