Fault Tolerance Mechanism for Computational Grid Using Checkpoint Algorithm
Grid computing, fault tolerance, check pointing
Computational grids have solving large-scale scientific applications using heterogeneous and geographically distributed resources Grid infrastructure is a large set of nodes geographically distributed and connected by a communication. In Computational grid, fault tolerance is one of the main research areas. Fault tolerance is a necessary by the distribution that create a number of problems related to the heterogeneity of hardware, operating systems, networks, middleware, applications, the dynamic resource, the scalability. In this research paper our main focus is on the development of fault tolerance system for computational grids. we have studied existing fault tolerance in Computational Grid in detail, and have ascertained the frequent causes of failures in it. So Checkpoint is process as a designated place in a program at which normal processing is interrupted specifically to preserve the status information necessary to allow resumption of processing at a later time. Checkpointing is the process of saving the status information. The probability of fault occurrence increases, as the number of resources involved in grid increases. For this we had setup a computational grid based on the Alchemi middleware. Alchemi is a .NET based grid computing framework that provides the runtime machinery and programming environment required to construct computational grid. After setting up grid environment we had generate the different checkpoint result and compare with chandy-Lamport result
Mr.Ramesh T. Prajapati. "Fault Tolerance Mechanism for Computational Grid Using Checkpoint Algorithm".INTERNATIONAL JOURNAL OF ENGINEERING DEVELOPMENT AND RESEARCH ISSN:2321-9939, Vol.1, Issue 1, pp.54 - 60, URL :https://rjwave.org/ijedr/papers/IJEDR1301011.pdf
Volume 1 Issue 1
Pages. 54 - 60