This paper provides a survey of the industry perspective on System Resiliency and Resiliency design approaches and briefly touches on Organizational Resiliency topics. Beginning with a composite definition of Resiliency, System Capabilities, Adversities, and the Resiliency Life-cycle the document then covers Operational Response Timelines, Failure Sources and Classifications. Next, Design for Resiliency is discussed with an introduction to Systems Theory and a review of Trade-off Analysis and Resiliency Dependencies. Then more than a dozen Resiliency Design Patterns are included for the reader to consider for their own solutioning. Supporting non-functional design topics including Availability, Performance, Security, Reliability as well as Reliability Allocation using Reliability Block Diagrams are also covered. Additionally, Failure Mode and Effect Analysis is reviewed, and a Resiliency Maturity Model is discussed. Finally, several Resiliency Design Examples are presented along with a set of recommendations on how to apply System Resiliency concepts and methods in an IT environment.
翻译:本文件对关于系统弹性和复原力设计方法的行业观点进行了调查,并简要介绍了组织弹性设计专题。从弹性、系统能力、脆弱性和复原力寿命周期的综合定义开始,本文件随后涵盖业务反应时间线、故障源和分类。接着,对弹性设计进行讨论,同时介绍系统理论,并审查交易分析和复原力依赖性。然后,列入十多个弹性设计模式,供读者考虑自行解决问题。还涵盖非功能设计专题,包括可用性、性能、安全性、可靠性以及使用可靠性区块图的可靠性分配。此外,还审查了故障模式和效果分析,并讨论了复原力成熟性模型。最后,提出了若干弹性设计实例,并提出了关于如何在信息技术环境中应用系统弹性概念和方法的一套建议。