Software Design for Resilient Computer Systems

This book addresses the question of how system software should be designed to account for faults, and which fault tolerance features it should provide for highest reliability. The authors first show how the system software interacts with the hardware to tolerate faults. They analyze and further deve...

Πλήρης περιγραφή

Λεπτομέρειες βιβλιογραφικής εγγραφής
Κύριοι συγγραφείς: Schagaev, Igor (Συγγραφέας), Kaegi-Trachsel, Thomas (Συγγραφέας)
Συγγραφή απο Οργανισμό/Αρχή: SpringerLink (Online service)
Μορφή: Ηλεκτρονική πηγή Ηλ. βιβλίο
Γλώσσα:English
Έκδοση: Cham : Springer International Publishing : Imprint: Springer, 2016.
Έκδοση:1st ed. 2016.
Θέματα:
Διαθέσιμο Online:Full Text via HEAL-Link
LEADER 04268nam a22005535i 4500
001 978-3-319-29465-0
003 DE-He213
005 20160216153010.0
007 cr nn 008mamaa
008 160213s2016 gw | s |||| 0|eng d
020 |a 9783319294650  |9 978-3-319-29465-0 
024 7 |a 10.1007/978-3-319-29465-0  |2 doi 
040 |d GrThAP 
050 4 |a TK1-9971 
072 7 |a TJK  |2 bicssc 
072 7 |a TEC041000  |2 bisacsh 
082 0 4 |a 621.382  |2 23 
100 1 |a Schagaev, Igor.  |e author. 
245 1 0 |a Software Design for Resilient Computer Systems  |h [electronic resource] /  |c by Igor Schagaev, Thomas Kaegi-Trachsel. 
250 |a 1st ed. 2016. 
264 1 |a Cham :  |b Springer International Publishing :  |b Imprint: Springer,  |c 2016. 
300 |a XIV, 214 p. 70 illus., 51 illus. in color.  |b online resource. 
336 |a text  |b txt  |2 rdacontent 
337 |a computer  |b c  |2 rdamedia 
338 |a online resource  |b cr  |2 rdacarrier 
347 |a text file  |b PDF  |2 rda 
505 0 |a Introduction -- Hardware Faults -- Fault Tolerance: Theory and Concepts -- Generalized Algorithm of Fault Tolerance (GAFT) -- GAFT Generalization: A Principle and Model of Active System Safety -- System Software Support for Hardware Deficiency: Function and Features -- Testing and Checking -- Recovery Preparation -- Recovery: Searching and Monitoring of Correct Software States -- Recovery Algorithms: An Analysis -- Programming Language for Safety Critical Systems -- Proposed Runtime System Structure -- Proposed Runtime System vs. Existing Approaches -- Hardware: The ERRIC Architecture -- Architecture Comparison and Evaluation -- ERRIC Reliability. 
520 |a This book addresses the question of how system software should be designed to account for faults, and which fault tolerance features it should provide for highest reliability. The authors first show how the system software interacts with the hardware to tolerate faults. They analyze and further develop the theory of fault tolerance to understand the different ways to increase the reliability of a system, with special attention on the role of system software in this process. They further develop the general algorithm of fault tolerance (GAFT) with its three main processes: hardware checking, preparation for recovery, and the recovery procedure. For each of the three processes, they analyze the requirements and properties theoretically and give possible implementation scenarios and system software support required. Based on the theoretical results, the authors derive an Oberon-based programming language with direct support of the three processes of GAFT. In the last part of this book, they introduce a simulator, using it as a proof of concept implementation of a novel fault tolerant processor architecture (ERRIC) and its newly developed runtime system feature-wise and performance-wise. The content applies to industries such as military, aviation, intensive health care, industrial control, space exploration, etc. · Outlines potential critical faults in the modern computer systems and what is required to change them · Explains how to design and re-design system software for the next generation of computers for wider application domains and greater efficiency and reliability · Presents how implemented system software support makes maintenance of computer systems much easier, while reliability and performance increases. 
650 0 |a Engineering. 
650 0 |a Computer software  |x Reusability. 
650 0 |a Software engineering. 
650 0 |a Quality control. 
650 0 |a Reliability. 
650 0 |a Industrial safety. 
650 0 |a Electrical engineering. 
650 0 |a Electronic circuits. 
650 1 4 |a Engineering. 
650 2 4 |a Communications Engineering, Networks. 
650 2 4 |a Circuits and Systems. 
650 2 4 |a Software Engineering. 
650 2 4 |a Performance and Reliability. 
650 2 4 |a Quality Control, Reliability, Safety and Risk. 
700 1 |a Kaegi-Trachsel, Thomas.  |e author. 
710 2 |a SpringerLink (Online service) 
773 0 |t Springer eBooks 
776 0 8 |i Printed edition:  |z 9783319294636 
856 4 0 |u http://dx.doi.org/10.1007/978-3-319-29465-0  |z Full Text via HEAL-Link 
912 |a ZDB-2-ENG 
950 |a Engineering (Springer-11647)