Course Details
Subject {L-T-P / C} : CS4440 : Fault Tolerant Systems { 3-0-0 / 3}
Subject Nature : Theory
Coordinator : Pabitra Mohan Khilar
Syllabus
Module 1 : |
Module 1: Introduction to fault tolerance, Requirement of Fault Tolerance, Goals and Characteristics of fault tolerance, Challenges for fault tolerance, Types of faults: Hard, Soft, Transient, Intermittent and Byzantine Faults, Causes of Faults: Environment, Out of range, Physical damage
|
Course Objective
1 . |
To identify the types of faults and fault behavior in distributed systems |
2 . |
To develop fault detection, diagnosis and recovery algorithms |
3 . |
To evaluate the fault tolerant systems using standard diagnosis parameters |
4 . |
To apply the fault diagnosis algorithms to different distributed systems |
Course Outcome
1 . |
Performance evaluation of fault tolerant systems
|
Essential Reading
1 . |
P. Jalote, Fault Tolerance in Distributed Systems, PHI , 1999 |
2 . |
Elena Dubrova,, Fault Tolerant Design, Springer , 2013 |
Supplementary Reading
1 . |
Thomas H & Y. Robert,, Fault Tolerance Techniques for High Performance Computing, Springer , 2015 |
2 . |
D.Janakiram, Grid Computing, TMH , 2005 |
Journal and Conferences
1 . |
P.M.Khilar and S.Mahapatra, “Time-Constrained Fault Tolerant X-by-wire Systems” International Journal of Computer and Applications, Vol. 31, No.4, Oct-Dec, 2009, pp. 231-238 |
2 . |
Sanjaya Kumar Panda and Pabitra Mohan Khilar, “A Two-Step QoS Priority for Scheduling in Grid”, Proceedings of The Second IEEE International Conference on Parallel, Distributed and Grid Computing (PDGC), IEEE, Waknaghat, 6th - 8th Dec 2012, pp. 502 – 507. |