Presentation 2006-06-15
Classification method of tree data reflecting structual similarity and contents similarity
Hiroaki SAITO, Hisashi KOGA, Toshinori WATANABE, Takanori YOKOYAMA,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) Tree is useful for representing various objects such as semi-structured data and gene data. Thus, computing tree similarity is important in the research area of pattern recognition and information retrieval. Tree edit distance is one of the most known dissimilarity measures for trees and defined as the minimum value of the total costs associated with node edit operations (i.e, deletion, insertion and relabeling) incurred in the conversion between two trees. Tree edit distance contains both tree structural dissimilarity and label contents dissimilarity. However, the significance of the two dissimilarity depends on the application and the data. Therefore, in this paper, we propose a new method which can get a clustering result which reflects the characteristics of the target application and the users' purpose properly by splitting the tree edit distance into contents dissimilarity and structural dissimilarity.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) Tree edit distance / Tree structural similarity / Contents similarity / Clustering / XML
Paper # DE2006-2,PRMU2006-40
Date of Issue

Conference Information
Committee DE
Conference Date 2006/6/8(1days)
Place (in Japanese) (See Japanese page)
Place (in English)
Topics (in Japanese) (See Japanese page)
Topics (in English)
Chair
Vice Chair
Secretary
Assistant

Paper Information
Registration To Data Engineering (DE)
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) Classification method of tree data reflecting structual similarity and contents similarity
Sub Title (in English)
Keyword(1) Tree edit distance
Keyword(2) Tree structural similarity
Keyword(3) Contents similarity
Keyword(4) Clustering
Keyword(5) XML
1st Author's Name Hiroaki SAITO
1st Author's Affiliation Graduate School of Information Systems, University of Electro-Communications()
2nd Author's Name Hisashi KOGA
2nd Author's Affiliation Graduate School of Information Systems, University of Electro-Communications
3rd Author's Name Toshinori WATANABE
3rd Author's Affiliation Graduate School of Information Systems, University of Electro-Communications
4th Author's Name Takanori YOKOYAMA
4th Author's Affiliation Graduate School of Information Systems, University of Electro-Communications
Date 2006-06-15
Paper # DE2006-2,PRMU2006-40
Volume (vol) vol.106
Number (no) 97
Page pp.pp.-
#Pages 6
Date of Issue