Home > Bachelor degree > Degree > Subject > Ficha técnica

Ficha técnica de una asignatura en una titulación

13026 Reliability and Fault Tolerance - Computational Sciences Engineering


Center
School of Engineering
Departament
Computer Science
Lecturers in charge
Sin datos cargados
Met. Docent
The exam will have two parts, one for theory and the other for the practice. The 20% of the mark of practical part, 80% of the mark for theory part
Met. Avaluació
Final examination - -
Bibliografia
[Johnson89] Barry W. Johnson
"Design and Analysis of Fault Tolerant Digital Systems"
Addison-Wesley, 1989.
ISBN: 0-201-07570-9.

[Pradhan96] Dhiraj K. Pradhan
"Fault Tolerant Computer System Design"
Prentice Hall PTR, 1996
ISBN: 0-13-057887-8.

[Jalote94] Pankaj Jalote
"Fault Tolerance in Distributed Systems"
Prentice Hall, 1994.
ISBN: 0-13-301367-7.

[Sahner92] Robin A. Sahner, Kishor S. Trivedi, A. Puliafito
Performance and Reliability Analysis of Computer Systems
Kluwer Academic Publisers, 1992

[Lyu95] Michael R. Lyu
"Software Fault Tolerance"
John Willey & Sons, 1995
ISBN: 0-471-95068-8.

[Siewio92] Daniel P. Siewiorek y Robert S. Swarz
"Reliable Computer Systems. Design and Evaluation"

Continguts
This optional module seeks to make stress in a special type of computer
architectures, Fault Tolerant Systems. This machine types are used in critical applications where their failure can create big risks, either of economic type or loss of human lives.
During the term the more important characteristics will be studied,
seeing the physical aspects, as the own techniques of the software, that
make these systems offer certain guarantees that are resistant
to failures. It also aims for students to know the terminology
used in this field and so the characteristic parameters
that define the Fault Tolerant Systems are studies. They will explain
the systems with high reliability, the safe functioning systems and those
that are widely available. The student should understand which are the evaluation objectives of fault tolerant systems and which is the function and pattern utility, based on Markov chains that represent the temporary evolution of a system as failures take place.

Students are introduced to the implemented fault tolerant techniques by software, given the tendency that system designers have to use this methodology. Students should understand why the distributed and multiprocessors systems are used increasingly with
functional degradation, to achieve wide availability and a high functional guarantee.
Lastly, a basic objective is to know the functioning and application environ