Publisher : Korean Institute of Intelligent Systems
DOI : 10.5391/JKIIS.2016.26.3.202
Title & Authors
Determination of Optimal Checkpoint Intervals for Real-Time Tasks Using Distributed Fault Detection Kwak, Seong Woo; Yang, Jung-Min;
Checkpoint placement is an effective fault tolerance technique against transient faults in which the task is re-executed from the latest checkpoint when a fault is detected. In this paper, we propose a new checkpoint placement strategy separating data saving and fault detection processes that are performed together in conventional checkpoints. Several fault detection processes are performed in one checkpoint interval in order to decrease the latency between the occurrence and detection of faults. We address the placement method of fault detection processes to maximize the probability of successful execution of a task within the given deadline. We develop the Markov chain model for a real-time task having the proposed checkpoints, and derive the optimal fault detection and checkpoint interval.
S. Punnekkat, A. Burns, and R. Davis, “Analysis of checkpointing for real-time systems,” International Journal of Time-Critical Computing Systems, Vol. 20, No. 1, pp. 83-102, 2001.
T. Ozaki, T. Dohi, H. Okamura, and N. Kaio, “Distributionfree checkpoint placement algorithms based on min-max principle,” IEEE Transactions on Dependable and Secure Computing, Vol. 3, No. 2, pp. 130-140, 2006.
J. W. Young, “A first order approximation to the optimal checkpoint intervals,” Communications of the ACM, Vol. 17, No. 9, pp. 530-531, 1974.
Y. Ling, J. Mi, and X. Lin, “A variational calculus approach to optimal checkpoint placement,” IEEE Transactions on Computers, Vol. 50, No. 7, pp. 699-708, 2001.
S. W. Kwak and Y. J. Jung, "Determination of optimal checkpoint interval for RM scheduled real-time tasks," Transactions of the Korean Institute of Electrical Engineers, vol. 56, No. 6, pp. 1122-1129, 2007.
S. W. Kwak and J.-M. Yang, “Determining checkpoint intervals of non-preemptive rate monotonic scheduling using probabilistic optimization,” Journal of Korean Institute of Intelligent Systems, Vol. 21, No. 1, pp. 120-127, 2011.
S. W. Kwak and J.-M. Yang, “Optimal checkpoint placement for real-time systems with multi-tasks having deadlines longer than periods,” Transactions of the Korean Institute of Electrical Engineers, Vol. 61, No. 1, pp. 148-154, 2012.