SI-13: Predictable Failure Prevention
- NIST Special Publication 800-53 Revision 4:
- SI-13: Predictable Failure Prevention
- Determine mean time to failure (MTTF) for the following system components in specific environments of operation: [Assignment: organization-defined system components]; and
- Provide substitute system components and a means to exchange active and standby components in accordance with the following criteria: [Assignment: organization-defined MTTF substitution criteria].
While MTTF is primarily a reliability issue, predictable failure prevention is intended to address potential failures of system components that provide security capabilities. Failure rates reflect installation-specific consideration rather than the industry-average. Organizations define the criteria for the substitution of system components based on the MTTF value with consideration for the potential harm from component failures. The transfer of responsibilities between active and standby components does not compromise safety, operational readiness, or security capabilities. The preservation of system state variables is also critical to help ensure a successful transfer process. Standby components remain available at all times except for maintenance issues or recovery failures in progress.
SI-13(1): Transferring Component Responsibilities
Take system components out of service by transferring component responsibilities to substitute components no later than [Assignment: organization-defined fraction or percentage] of mean time to failure.
SI-13(3): Manual Transfer Between Components
Manually initiate transfers between active and standby system components when the use of the active component reaches [Assignment: organization-defined percentage] of the mean time to failure.
SI-13(4): Standby Component Installation and Notification
If system component failures are detected: Ensure that the standby components are successfully and transparently installed within [Assignment: organization-defined time period]; and [Assignment (one or more): Activate [Assignment: organization-defined alarm] , Automatically shut down the system, [Assignment: organization-defined action] ].
SI-13(5): Failover Capability
Provide [Assignment: real-time, near real-time] [Assignment: organization-defined failover capability] for the system.