Software Design for Resilient Computer Systems

Schagaev, Igor, Gutknecht, Jürg

  • 出版商: Springer
  • 出版日期: 2024-07-16
  • 售價: $6,490
  • 貴賓價: 9.5$6,166
  • 語言: 英文
  • 頁數: 445
  • 裝訂: Hardcover - also called cloth, retail trade, or trade
  • ISBN: 3031551389
  • ISBN-13: 9783031551383
  • 海外代購書籍(需單獨結帳)

相關主題

商品描述

This book addresses the question of how system software should be designed to account for faults, and which fault tolerance features should provide for highest reliability. With this third edition of Software Design for Resilient Computer Systems, the book is thoroughly updated to contain the newest advice regarding software resilience. With a new introductory chapter, the new edition is ideal for researchers and industry professionals.

In the book, the authors first show how system software interacts with the hardware to tolerate faults. They analyze and further develop the theory of fault tolerance to understand the diverse ways to increase the reliability of a system, with special attention on the role of system software in this process. They introduce the theory of redundancy and its use for construction of a subsystem through generalised algorithm of fault tolerance (GAFT) and apply it to distributed systems. The book's approach is applied to various hardware subsystems: different structures of RAM and processor cores and demonstrates exceptional performance reliability and energy efficiency. This third edition devotes substantial attention to system software for modern computers, including run time systems, supporting algorithms of recovery and their analysis, language aspects and ways to improve reconfigurable and parallel computing.

Due to the wide-reaching nature of the content, this book applies to a host of industries and research areas, including military, aviation, intensive health care, industrial control, and space exploration.

商品描述(中文翻譯)

本書探討了系統軟體應如何設計以應對故障,以及應提供哪些容錯特性以實現最高可靠性。《軟體設計:可靠計算機系統》第三版全面更新,包含了關於軟體可靠性的最新建議。新版引入了一個新的導論章節,非常適合研究人員和業界專業人士閱讀。

在本書中,作者首先展示了系統軟體如何與硬體互動以容忍故障。他們分析並進一步發展了容錯理論,以了解提高系統可靠性的多種方式,特別關注系統軟體在此過程中的作用。他們介紹了冗餘理論及其在通過通用容錯算法(GAFT)構建子系統中的應用,並將其應用於分散系統。本書的方法應用於各種硬體子系統:不同結構的RAM和處理器核心,並展示了卓越的性能可靠性和能源效率。第三版還大量關注現代計算機的系統軟體,包括運行時系統、支援恢復算法及其分析、語言方面以及改進可重構和並行計算的方法。

由於內容的廣泛性,本書適用於多個行業和研究領域,包括軍事、航空、重度醫療保健、工業控制和太空探索等。