Course Details
Subject {L-T-P / C} : CS6121 : Fault Tolerant Distributed System { 3-0-0 / 3}
Subject Nature : Theory
Coordinator : Pabitra Mohan Khilar
Syllabus
Module 1 : |
Module 1: Introduction: High Performance Computing (HPC), Grand Challenge Problems Computational and communication intensive, Parallel Architectures Classifications SMP,MPP,NUMA,Clusters and Components of a Parallel Machine, Conventional Supercomputers and it’s limitations, Multi-processor and Multi Computer based Distributed Systems, Introduction to Clusters and Grids.
|
Course Objective
1 . |
To understand the requirements for fault tolerant distributed computing systems |
2 . |
To design and develop efficient fault tolerance algorithms for disitributed computing systems |
3 . |
To identify the fault tolerance measures for evaluating the performance of fault tolerant algorithms |
4 . |
To understand the behavior of fault tolrant systems using standard fault tolerance metrics |
Course Outcome
1 . |
Designing and implementing distributed fault tolerant systems.
|
Essential Reading
1 . |
P. Jalote, Fault Tolerance in Distributed Systems, Prentice Hall , 1994 |
2 . |
J. Joseph & C. Fellenstein,, Grid Computing, Pearson Education , 2004 |
Supplementary Reading
1 . |
H. Attiya and J. Welch, Distributed Computing: Fundamentals, Wiley , 2004 |
2 . |
G. Coulororis, J. Dollimore, and T. Kindberg., Distributed Systems: Concepts and Design., Addison Wesley , 2001 |
Journal and Conferences
1 . |
P.M.Khilar and S.Mahapatra, “Time-Constrained Fault Tolerant X-by-wire Systems” International Journal of Computer and Applications, Vol. 31, No.4, Oct-Dec, 2009, pp. 231-238 |
2 . |
A.Mahapatra and P.M.Khilar, Fault Diagnosis in Wireless Sensor Networks: A Survey, IEEE Communications Surveys and Tutorials, Issue 99, pp. 1-27, April 2013 |