Design and analysis of an efficient algorithm for coordinated checkpointing in distributed systems

Research output: Chapters, Conference Papers, Creative and Literary Works (RGC: 12, 32, 41, 45)32_Refereed conference paper (with ISBN/ISSN)peer-review

2 Scopus Citations
View graph of relations

Author(s)

Related Research Unit(s)

Detail(s)

Original languageEnglish
Title of host publicationProceedings of the Conference on Advances in Parallel and Distributed Computing
PublisherIEEE
Pages261-268
Publication statusPublished - 1997

Conference

TitleProceedings of the 1997 Conference on Advances in Parallel and Distributed Computing
CityShanghai, China
Period19 - 21 March 1997

Abstract

A synchronous checkpointing algorithm coordinates a set of processes in taking checkpoints in such a way that the set of local checkpoints always forms part of a consistent global system state. Whenever a process p requests to take a checkpoint, a set of processes, called the cohorts set of p, must be checked and some of them may also have to take their checkpoints in order to preserve system consistency. Although several synchronous checkpointing algorithms have been proposed in the literature, most of them do not address the performance issue. In this paper we propose an efficient distributed algorithm for synchronous checkpointing. Proof of correctness and analysis of efficiency of the algorithm are presented. It is shown that the algorithm has a better message and time complexity than the existing algorithms. The method proposed in this paper can also be applied to enhance the performance of rollback operation which always require synchronization of the inter-dependent processes.

Citation Format(s)

Design and analysis of an efficient algorithm for coordinated checkpointing in distributed systems. / Cao, Jiannong; Jia, Weijia; Jia, Xiaohua et al.

Proceedings of the Conference on Advances in Parallel and Distributed Computing. IEEE, 1997. p. 261-268.

Research output: Chapters, Conference Papers, Creative and Literary Works (RGC: 12, 32, 41, 45)32_Refereed conference paper (with ISBN/ISSN)peer-review