Presentation | 2006-06-15 Classification method of tree data reflecting structual similarity and contents similarity Hiroaki SAITO, Hisashi KOGA, Toshinori WATANABE, Takanori YOKOYAMA, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | Tree is useful for representing various objects such as semi-structured data and gene data. Thus, computing tree similarity is important in the research area of pattern recognition and information retrieval. Tree edit distance is one of the most known dissimilarity measures for trees and defined as the minimum value of the total costs associated with node edit operations (i.e, deletion, insertion and relabeling) incurred in the conversion between two trees. Tree edit distance contains both tree structural dissimilarity and label contents dissimilarity. However, the significance of the two dissimilarity depends on the application and the data. Therefore, in this paper, we propose a new method which can get a clustering result which reflects the characteristics of the target application and the users' purpose properly by splitting the tree edit distance into contents dissimilarity and structural dissimilarity. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | Tree edit distance / Tree structural similarity / Contents similarity / Clustering / XML |
Paper # | DE2006-2,PRMU2006-40 |
Date of Issue |
Conference Information | |
Committee | DE |
---|---|
Conference Date | 2006/6/8(1days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | |
Chair | |
Vice Chair | |
Secretary | |
Assistant |
Paper Information | |
Registration To | Data Engineering (DE) |
---|---|
Language | JPN |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | Classification method of tree data reflecting structual similarity and contents similarity |
Sub Title (in English) | |
Keyword(1) | Tree edit distance |
Keyword(2) | Tree structural similarity |
Keyword(3) | Contents similarity |
Keyword(4) | Clustering |
Keyword(5) | XML |
1st Author's Name | Hiroaki SAITO |
1st Author's Affiliation | Graduate School of Information Systems, University of Electro-Communications() |
2nd Author's Name | Hisashi KOGA |
2nd Author's Affiliation | Graduate School of Information Systems, University of Electro-Communications |
3rd Author's Name | Toshinori WATANABE |
3rd Author's Affiliation | Graduate School of Information Systems, University of Electro-Communications |
4th Author's Name | Takanori YOKOYAMA |
4th Author's Affiliation | Graduate School of Information Systems, University of Electro-Communications |
Date | 2006-06-15 |
Paper # | DE2006-2,PRMU2006-40 |
Volume (vol) | vol.106 |
Number (no) | 97 |
Page | pp.pp.- |
#Pages | 6 |
Date of Issue |