Presentation 2006/7/25
The Proposal and Evaluation of Cuckoo FTMPI : Framework of Fault/Recovery model aware Component-based FTMPI
Hideyuki JITSUMOTO, Satoshi MATSUOKA,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) Execution of MPI applications on clusters and Grid deployments suffering from node and network failures motivates the use of fault tolerant MPI implementations. Therefore, some fault tolerant MPI was implemented. But, these fault tolerant MPI implementations cannot choose easily appropriate restoration according to the environment. We present Cuckoo FTMPI: Fault/Recovery model aware component framework. Users can get a MPI implementation according to their executing environment by the selection of the components. This paper presents the architecture of Cuckoo FTMPI, its theoretical foundation and the performance of the implementation. Preliminary evaluation using NPB, there's no overhead to use Cuckoo FTMPI on MPICH. And we presented validity of Fault/Recovery model aware component framework.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) Fault Tolerant MPI / Component Framework / Recovery-model-aware
Paper # DC2006-25
Date of Issue

Conference Information
Committee DC
Conference Date 2006/7/25(1days)
Place (in Japanese) (See Japanese page)
Place (in English)
Topics (in Japanese) (See Japanese page)
Topics (in English)
Chair
Vice Chair
Secretary
Assistant

Paper Information
Registration To Dependable Computing (DC)
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) The Proposal and Evaluation of Cuckoo FTMPI : Framework of Fault/Recovery model aware Component-based FTMPI
Sub Title (in English)
Keyword(1) Fault Tolerant MPI
Keyword(2) Component Framework
Keyword(3) Recovery-model-aware
1st Author's Name Hideyuki JITSUMOTO
1st Author's Affiliation Tokyo Institute of Technology()
2nd Author's Name Satoshi MATSUOKA
2nd Author's Affiliation Tokyo Institute of Technology:National Institute of Informatics
Date 2006/7/25
Paper # DC2006-25
Volume (vol) vol.106
Number (no) 198
Page pp.pp.-
#Pages 6
Date of Issue