Presentation 2017-03-02
Failure Detection and Monitoring Method for High Availability Distributed Cluster
Tomohiro Ono, Kiyoshi Ueda,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) Session control servers of carriers networks are required to have high availability, high reliability, fault tolerance. These are also required to be small start and have scalability, and to be able to change configurations dynamically depend on traffic demand while continuing services. We have studied the scale out type session control server architecture that could control system performance flexibly. That is nonstop adding and removing servers. In this study, we propose techniques of the silent failure detection by monitoring servers each other which constructs clusters, for the further improvement of the availability and reliability of the clusters. In addition to it, we propose techniques of reduction network load of monitoring, and analyzing causes of failures. And some of proposal techniques is evaluated and confirmed feasibility and efficacy of it by experiments.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) High Availability Distributed Cluster / Failure Detection / Heartbeat
Paper # NS2016-181
Date of Issue 2017-02-23 (NS)

Conference Information
Committee NS / IN
Conference Date 2017/3/2(2days)
Place (in Japanese) (See Japanese page)
Place (in English) OKINAWA ZANPAMISAKI ROYAL HOTEL
Topics (in Japanese) (See Japanese page)
Topics (in English) General
Chair Hideki Tode(Osaka Pref. Univ.) / Katsunori Yamaoka(Tokyo Inst. of Tech.)
Vice Chair Yoshikatsu Okazaki(NTT) / Takuji Kishida(NTT)
Secretary Yoshikatsu Okazaki(Kyushu Inst. of Tech.) / Takuji Kishida(NTT)
Assistant Shohei Kamamura(NTT) / Kunitake Kaneko(Keio Univ.) / Takashi Natsume(NTT)

Paper Information
Registration To Technical Committee on Network Systems / Technical Committee on Information Networks
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) Failure Detection and Monitoring Method for High Availability Distributed Cluster
Sub Title (in English)
Keyword(1) High Availability Distributed Cluster
Keyword(2) Failure Detection
Keyword(3) Heartbeat
1st Author's Name Tomohiro Ono
1st Author's Affiliation Nihon University(Nihon Univ.)
2nd Author's Name Kiyoshi Ueda
2nd Author's Affiliation Nihon University(Nihon Univ.)
Date 2017-03-02
Paper # NS2016-181
Volume (vol) vol.116
Number (no) NS-484
Page pp.pp.137-142(NS),
#Pages 6
Date of Issue 2017-02-23 (NS)