What is fault tree analysis?

what you didn't know about fault tree analysis

This quick guide provides an overview of the basic concepts in fault tree analysis technique, as it applies to data quality. For some more well-known and useful root cause analysis techniques, please check out the:

Definition

The fault tree analysis is a top-down, deductive failure analysis that analyzes the undesirable state of a scheme using Boolean logic to combine a sequence of lower-level occurrences.

Synonym(s):

Fault Tree Diagram, Negative Analytical Tree -though technically, the Fault Tree Analysis outputs the diagram/ tree

Description:

The technique is used mainly in aerospace, engineering and high-hazard industries, but also in software engineering for debugging purposes and determining data quality issues and their causes. The main output is a Fault Tree Diagram (FTD). It is a top-down approach to show the pathways within a system that can lead to a foreseeable, undesirable failure – in our case, a data quality issue. The pathways connect contributory events and conditions, using standard logic symbols (AND, OR, etc.). At the very basic level, the constructs in a fault tree diagram are:

gates/ conditions/ logic gates (all synonyms), and
events.

data quality fault tree diagram

fault tree diagram gates This is oversimplified, but you get the idea. By the way, there are a few other gate types and diagram elements that can be used. If you’re interested in learning more, you can check out the Fault Tree Analysis: A Bibliography from the NASA Scientific and Technical Information (STI) Program.

Fun fact

The basic concept was developed at Bell Telephone Laboratories in 1962 by H.A. Watson, under contract for the US Air Force for use with the Minuteman system. Fun fact within a fun fact: Minuteman system refers to the Minuteman I Intercontinental Ballistic Missile (ICBM) Launch Control System. All the fail safe needed to be in place for this. The technique was later adopted and extensively used by Boeing and the rest is history.

When to use the fault tree analysis

When needed to understand the logic leading to the data quality issue
To show compliance with the data quality requirements
Prioritize the resolution of the causes leading to the top event, i.e. the data quality issue
If you need to create a diagnostic processes for a data quality resolution

Pros

A highly structured and graphical representation of causes and events leading to the data quality issue
Can effectively be used for analysis of recurrent and persistent data quality issues, because such issues tent to have common causes
Good visualization for presenting issues to stakeholders

Cons

If a wrong cause is identified, subsequent causes in the tree might be erroneous or invalid and time is wasted exploring that branch of the tree
If there are too many branches and levels, it might be hard to keep track.

Avoid the pitfalls of bad data quality. Here are the 4 myths about data quality everyone thinks are true.

Steps to develop it

1. State the data quality issue: This is the issue for which you will determine the causes. Ideally you will have a different fault tree diagram for each system/ process that you want to examine

2. Determine top level faults: Brainstorm the main categories with the subject matter experts in the system/ process.

3. Identify causes for top level faults: Brainstorm the main reasons for bad data quality and point them to the above top levels

4. Identify next levels: For each cause, see if the tree goes deeper and keep on adding levels.

Tips

To see what to focus on the most, try to add probabilities of occurrence to each event
Besides the tree, make sure you also take plenty of notes in order to capture the details and the context of each finding
To go in deeper and have a more granular approach, you can apply the “5 whys? technique”

Tools

There are several tools you can use to draw a fault tree diagram, besides the classic whiteboard or flipchart and marker:

MS Visio – FREE template/ example provided above
PowerPoint
Smartdraw

Share0

Tweet0

About the author

George Firican

George Firican is the Director of Data Governance and Business Intelligence at the University of British Columbia, which is ranked among the top 20 public universities in the world. His passion for data led him towards award-winning program implementations in the data governance, data quality, and business intelligence fields. Due to his desire for continuous improvement and knowledge sharing, he founded LightsOnData, a website which offers free templates, definitions, best practices, articles and other useful resources to help with data governance and data management questions and challenges. He also has over twelve years of project management and business/technical analysis experience in the higher education, fundraising, software and web development, and e-commerce industries.

Cookie	Duration	Description
cookielawinfo-checkbox-advertisement	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Advertisement" category .
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
CookieLawInfoConsent	1 year	Records the default button state of the corresponding category & the status of CCPA. It works only in coordination with the primary cookie.
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Cookie	Duration	Description
__cf_bm	30 minutes	This cookie, set by Cloudflare, is used to support Cloudflare Bot Management.
sp_landing	1 day	The sp_landing is set by Spotify to implement audio content from Spotify on the website and also registers information on user interaction related to the audio content.
sp_t	1 year	The sp_t cookie is set by Spotify to implement audio content from Spotify on the website and also registers information on user interaction related to the audio content.
tve_leads_unique	1 month	This cookie is set by the provider Thrive Themes. This cookie is used to know which optin form the visitor has filled out when subscribing a newsletter.

Cookie	Duration	Description
_ga	2 years	The _ga cookie, installed by Google Analytics, calculates visitor, session and campaign data and also keeps track of site usage for the site's analytics report. The cookie stores information anonymously and assigns a randomly generated number to recognize unique visitors.
_ga_1Z635JPV9L	2 years	This cookie is installed by Google Analytics.
CONSENT	2 years	YouTube sets this cookie via embedded youtube-videos and registers anonymous statistical data.
vuid	2 years	Vimeo installs this cookie to collect tracking information by setting a unique ID to embed videos to the website.

Cookie	Duration	Description
_fbp	3 months	This cookie is set by Facebook to display advertisements when either on Facebook or on a digital platform powered by Facebook advertising, after visiting the website.
VISITOR_INFO1_LIVE	5 months 27 days	A cookie set by YouTube to measure bandwidth that determines whether the user gets the new or old player interface.
YSC	session	YSC cookie is set by Youtube and is used to track the views of embedded videos on Youtube pages.
yt-remote-connected-devices	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt-remote-device-id	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt.innertube::nextId	never	This cookie, set by YouTube, registers a unique ID to store data on what videos from YouTube the user has seen.
yt.innertube::requests	never	This cookie, set by YouTube, registers a unique ID to store data on what videos from YouTube the user has seen.

Cookie	Duration	Description
AE_AB_COOKIE	1 year	No description
DEVICE_INFO	5 months 27 days	No description
loglevel	never	No description available.
tl_4829_4830_26	1 month	No description
tl_4829_4840_30	1 month	No description
tl_4829_4941_41	1 month	No description
tve_secret	1 year	No description available.

What is fault tree analysis?

Definition

Synonym(s):

Description:

Fun fact

When to use the fault tree analysis

Pros

Cons

Avoid the pitfalls of bad data quality. Here are the 4 myths about data quality everyone thinks are true.

Steps to develop it

Tips

Tools

George Firican

The 6 layers of AI governance: A practical AI governance framework

How AI Is Reinventing MDM and Data Governance

From fragmented data to planetary-scale systems: why FSA/MEBS represents a step-change in enterprise modeling

Optimizing retail operations through a practical data strategy

Transforming Marketing Data into Business Growth: Key Insights and Strategies

You may also like:

What is fault tree analysis?

How to use the barrier analysis for improved data quality

How learning to use Pareto analysis can improve your data quality