Reliability Centered Maintenance
Reliability Centered Maintenance (RCM) is a systematic process for determining the most effective and efficient maintenance strategy for each asset. Originally developed for the aviation industry, RCM analyzes what each asset does, how it can fail, the consequences of failure, and what maintenance tasks are applicable and effective - resulting in a tailored strategy that balances reliability, safety, and cost.
Key Points
- Systematic framework for selecting the right maintenance strategy per asset
- Focuses on preserving system function rather than maintaining equipment for its own sake
- Originated in aviation and is now applied across all asset-intensive industries
- Evaluates failure modes, consequences, and the effectiveness of each maintenance option
- Typically results in a mix of PM, PdM, RTF, and redesign tasks
The 7 RCM Questions
1. Functions: What are the functions and performance standards of the asset in its current operating context?
2. Functional Failures: In what ways can the asset fail to fulfill its functions?
3. Failure Modes: What causes each functional failure?
4. Failure Effects: What happens when each failure mode occurs?
5. Failure Consequences: In what way does each failure matter (safety, environmental, operational, economic)?
6. Proactive Tasks: What can be done to predict or prevent each failure mode?
7. Default Actions: What should be done if no proactive task is applicable or effective?
The RCM Analysis Process
An RCM analysis is performed by a cross-functional team that includes operations, maintenance, and engineering personnel who understand the asset in its operating context. The team works through the 7 questions systematically for each asset or system, documenting functions, failure modes, effects, and consequences. For each failure mode, the team evaluates whether a proactive maintenance task is technically feasible and worth doing based on the consequences. The result is a documented maintenance strategy for each failure mode - preventive, predictive, detective, run-to-failure, or redesign.
Failure Modes and Effects
- Failure mode: The specific mechanism by which an asset loses its ability to perform a function (e.g., bearing seizure, corrosion, fatigue cracking)
- Failure effect: What physically happens when the failure mode occurs, including evidence the operator would notice
- Hidden failures: Failure modes that are not evident under normal operating conditions, typically in protective or standby systems
- Consequence categories: Safety and environmental, operational (affects output), and non-operational (repair cost only)
- Age-related vs random: RCM analysis often reveals that most failure modes are not age-related, challenging traditional time-based PM approaches
Maintenance Strategy Selection
- Condition-based (predictive): Selected when a detectable degradation pattern exists and provides adequate warning time
- Scheduled restoration or replacement: Selected when there is a defined age at which reliability decreases significantly
- Failure-finding tasks: Scheduled inspections to detect hidden failures in protective devices and standby systems
- Run to failure: Selected when no proactive task is applicable, effective, or justified by the consequences
- Redesign: Recommended when failure consequences are severe and no maintenance task can adequately manage the risk
Build a Reliability Program with AssetLab
AssetLab provides the tools you need to put these concepts into practice with Canadian data residency and CAD pricing.