Presentation 2018-06-15
Study of Efficient Data Management for Data Lakes
Keisuke Hatasaki,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) IoT enables transformation of business in many enterprise fields. Since most of IoT use case has been developed in explorative data analysis process, large amount of data from things and devices should be stored into Data Lakes. But it requires large cost. In this study, we developed efficient data management method based on characteristics of data stored in Data Lakes. To calculate indicator from data and only store selected data in Data Lake by the indicator, we found result of analysis was almost similar even with using 1/5 of data in our experimental analysis case.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) IoT / Data Management / Data Lake / Machine Learning
Paper # CPSY2018-7,DC2018-7
Date of Issue 2018-06-07 (CPSY, DC)

Conference Information
Committee CPSY / DC / IPSJ-ARC
Conference Date 2018/6/14(2days)
Place (in Japanese) (See Japanese page)
Place (in English) Takamiya Rurikura Resort
Topics (in Japanese) (See Japanese page)
Topics (in English) Dependable Computing Systems, etc. (HotSPA2018)
Chair Koji Nakano(Hiroshima Univ.) / Satoshi Fukumoto(Tokyo Metropolitan Univ.) / Masahiro Goshima(NII)
Vice Chair Hidetsugu Irie(Univ. of Tokyo) / Takashi Miyoshi(Fujitsu) / Hiroshi Takahashi(Ehime Univ.)
Secretary Hidetsugu Irie(Utsunomiya Univ.) / Takashi Miyoshi(Hokkaido Univ.) / Hiroshi Takahashi(Tokyo Inst. of Tech.) / (Nihon Univ.)
Assistant Yasuaki Ito(Hiroshima Univ.) / Tomoaki Tsumura(Nagoya Inst. of Tech.)

Paper Information
Registration To Technical Committee on Computer Systems / Technical Committee on Dependable Computing / Special Interest Group on System Architecture
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) Study of Efficient Data Management for Data Lakes
Sub Title (in English)
Keyword(1) IoT
Keyword(2) Data Management
Keyword(3) Data Lake
Keyword(4) Machine Learning
1st Author's Name Keisuke Hatasaki
1st Author's Affiliation Hitachi, Ltd.(Hitachi)
Date 2018-06-15
Paper # CPSY2018-7,DC2018-7
Volume (vol) vol.118
Number (no) CPSY-92,DC-93
Page pp.pp.113-117(CPSY), pp.113-117(DC),
#Pages 5
Date of Issue 2018-06-07 (CPSY, DC)