SI-13: Predictable Failure Prevention

Baselines:

  • Low

    N/A

  • Moderate

    N/A

  • High

    N/A

  • Privacy

    N/A

Previous Version:

Control Statement

  1. Determine mean time to failure (MTTF) for the following system components in specific environments of operation: [Assignment: organization-defined system components]; and
  2. Provide substitute system components and a means to exchange active and standby components in accordance with the following criteria: [Assignment: organization-defined MTTF substitution criteria].

Supplemental Guidance

While MTTF is primarily a reliability issue, predictable failure prevention is intended to address potential failures of system components that provide security capabilities. Failure rates reflect installation-specific consideration rather than the industry-average. Organizations define the criteria for the substitution of system components based on the MTTF value with consideration for the potential harm from component failures. The transfer of responsibilities between active and standby components does not compromise safety, operational readiness, or security capabilities. The preservation of system state variables is also critical to help ensure a successful transfer process. Standby components remain available at all times except for maintenance issues or recovery failures in progress.

Control Enhancements

SI-13(1): Transferring Component Responsibilities

Baseline(s):

(Not part of any baseline)

Take system components out of service by transferring component responsibilities to substitute components no later than [Assignment: organization-defined fraction or percentage] of mean time to failure.

SI-13(3): Manual Transfer Between Components

Baseline(s):

(Not part of any baseline)

Manually initiate transfers between active and standby system components when the use of the active component reaches [Assignment: organization-defined percentage] of the mean time to failure.

SI-13(4): Standby Component Installation and Notification

Baseline(s):

(Not part of any baseline)

If system component failures are detected: Ensure that the standby components are successfully and transparently installed within [Assignment: organization-defined time period]; and [Assignment (one or more): Activate [Assignment: organization-defined alarm] , Automatically shut down the system, [Assignment: organization-defined action] ].

SI-13(5): Failover Capability

Baseline(s):

(Not part of any baseline)

Provide [Assignment: real-time, near real-time] [Assignment: organization-defined failover capability] for the system.