Presentation 2013-08-09
The NICT Science Cloud : A Challenge of Science Data Crawling via NICT Science Cloud and Discussion toward Linked Open Data
Ken T. MURATA, Hidenobu WATANABE, Kentaro UKAWA, Kazunori YAMAMOTO, Koji ZETTSU,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) In the Solar-Terrestrial Physics (STP), it is pointed out that circulation and utilization of observation data among researchers are insufficient. One of the reasons is that the data formats of STP observation data are not common. This is not only the issue of STP data but also of other natural science data. To archive interdisciplinary researches, we need to overcome this circulation and utilization problems. Under such a background, the Solar-Terrestrial data Analysis and Reference System (STARS) has been designed and developed by the authors' group. The STARS has its own database that manages meta-data of satellite and ground-based observation data files. The STARS provides users with cross-over data file search services and download services over the Internet. It is noted that retrieving meta-data from the observation data and registering them to database have been carried out by hand so far in the STARS. It is hard to deal with a huge amount of observation data due to the lack of manpower. We developed an automatic meta-data collection system for the observation data using the STARS RSS (RDF Site Summary) 1.0. The RSS1.0 is one of the XML-based markup languages based on the RDF (Resource Description Framework), which is designed for syndicating news and content of news-like sites. Using the RSS1.0 as a meta-data distribution method, the workflow from retrieving meta-data to registering them into the database is automated. This technique was applied for the DARTS (Data Archive and Transmission System), which is a science database managed by the PLAIN Center at ISAS/JAXA in Japan. We succeeded in generating and collecting the meta-data automatically. Our final goal is to establish the STARS Semantic Web. The Semantic Web provides a common framework that allows data to be shared and reused across applications, enterprises, and communities. The most fundamental issue on the establishment is who manages meta-data in the Semantic Web. In the present study, we designed meta-data of the STARS along with the RSS1.0 document. In order to describe the meta-data of the STARS beyond RSS1.0 vocabulary, we defined original vocabularies for the STARS resources using RDF Schema. Our system works as follows. The RSS1.0 documents generated on data sites are automatically collected by a meta-data collection agent. The agent extracts meta-data to store them in an XML database. The XML database provides advanced retrieval processing that has considered property and relation. In the future, we develop a RDF database supported by inference engine, which leads to automatic processing or high level search for the data which are not only for observation data but for news and event information related to the STP.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) Semantic Web / Space Weather / RSS1.0 / RDF / Ontology / Science Cloud / Linked Open Data
Paper # AI2013-18,SC2013-12
Date of Issue

Conference Information
Committee AI
Conference Date 2013/8/2(1days)
Place (in Japanese) (See Japanese page)
Place (in English)
Topics (in Japanese) (See Japanese page)
Topics (in English)
Chair
Vice Chair
Secretary
Assistant

Paper Information
Registration To Artificial Intelligence and Knowledge-Based Processing (AI)
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) The NICT Science Cloud : A Challenge of Science Data Crawling via NICT Science Cloud and Discussion toward Linked Open Data
Sub Title (in English)
Keyword(1) Semantic Web
Keyword(2) Space Weather
Keyword(3) RSS1.0
Keyword(4) RDF
Keyword(5) Ontology
Keyword(6) Science Cloud
Keyword(7) Linked Open Data
1st Author's Name Ken T. MURATA
1st Author's Affiliation National Institute of Information and Communications Technology()
2nd Author's Name Hidenobu WATANABE
2nd Author's Affiliation National Institute of Information and Communications Technology
3rd Author's Name Kentaro UKAWA
3rd Author's Affiliation Systems Engineering Consultants Co., LTD
4th Author's Name Kazunori YAMAMOTO
4th Author's Affiliation National Institute of Information and Communications Technology
5th Author's Name Koji ZETTSU
5th Author's Affiliation National Institute of Information and Communications Technology
Date 2013-08-09
Paper # AI2013-18,SC2013-12
Volume (vol) vol.113
Number (no) 178
Page pp.pp.-
#Pages 6
Date of Issue