Presentation | 2005-10-17 Optimal Checkpoint Interval for Hybrid State Saving with Bound Rollbacks Mamoru OHARA, Masayuki ARAI, Satoshi FUKUMOTO, Kazuhiko IWASAKI, |
---|---|
PDF Download Page | ![]() |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | This paper discusses distributed periodic checkpointing for practical applications running with limited resources. We present a discrete time model evaluating the total expected overhead imposed on every event execution where the number of available checkpoints that each process can hold is finite and the rollback distance is bound to some finite interval. The recovery overhead for the checkpointing scheme is described by using a truncated geometric distribution as the rollback distance distribution. Although it is difficult to analytically derive the optimal checkpoint interval, which minimizes the total expected overhead, substituting other simple probabilistic distributions instead of the truncated geometric distribution enables us to do this explicitly. Numerical examples obtained through simulations are presented to show that we can achieve almost the minimized total overhead by using the new models and analyses. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | reliability / distributed systems / uncoordinated checkpointing / probability models / trapezoidal distribution |
Paper # | DE2005-127,DC2005-21 |
Date of Issue |
Conference Information | |
Committee | DC |
---|---|
Conference Date | 2005/10/10(1days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | |
Chair | |
Vice Chair | |
Secretary | |
Assistant |
Paper Information | |
Registration To | Dependable Computing (DC) |
---|---|
Language | JPN |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | Optimal Checkpoint Interval for Hybrid State Saving with Bound Rollbacks |
Sub Title (in English) | |
Keyword(1) | reliability |
Keyword(2) | distributed systems |
Keyword(3) | uncoordinated checkpointing |
Keyword(4) | probability models |
Keyword(5) | trapezoidal distribution |
1st Author's Name | Mamoru OHARA |
1st Author's Affiliation | Graduate School of Engineering, Tokyo Metropolitan University() |
2nd Author's Name | Masayuki ARAI |
2nd Author's Affiliation | Graduate School of Engineering, Tokyo Metropolitan University |
3rd Author's Name | Satoshi FUKUMOTO |
3rd Author's Affiliation | Graduate School of Engineering, Tokyo Metropolitan University |
4th Author's Name | Kazuhiko IWASAKI |
4th Author's Affiliation | Graduate School of Engineering, Tokyo Metropolitan University |
Date | 2005-10-17 |
Paper # | DE2005-127,DC2005-21 |
Volume (vol) | vol.105 |
Number (no) | 339 |
Page | pp.pp.- |
#Pages | 6 |
Date of Issue |