Presentation | 2004/11/27 A Mixed Similarity Measure for Data with Numeric, Symbolic and Ordinal Attributes(Artificial Intelligence I)(Joint Workshop of Vietnamese Society of AI, SIGKBS-JSAI, ICS-IPSJ, and IEICE-SIGAI on Active Mining) Nguyen Ngoc Binh, Than Van Cuong, Nguyen Thanh Phuong, Ho Tu Bao, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | Many methods of knowledge discovery in databases are distance-based, such as instance-based learning or clustering where similarity measures between objects plays an essential role. Besides, it is known that most our real-word data not only contain numeric, symbolic, and ordinal attributes individually but also carry all of them in mixed way. Therefore, a Mixed Similarity Measure (MSM) for numeric and symbolic attributes is not enough for various data process. Moreover, the high cost of O (n^2logn^2) and O (n^2) for time and complexities of the existing algorithms do not allow the MSM to be applied to large datasets in KDD. As a result, we have proposed a fast algorithm to compute the Goodall's MSM for numeric and symbolic attributes in a linear complexity. In this paper, as an extension of the MSM, we consider an MSM^* for numeric, symbolic and ordinal attributes and describe a fast algorithm for MSM^* with a linear complexity as well. The experimental results show that the proposed MSM^* is also better than MSM and C4.5/See5.0 for the classification problem. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | |
Paper # | AI2004-30 |
Date of Issue |
Conference Information | |
Committee | AI |
---|---|
Conference Date | 2004/11/27(1days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | |
Chair | |
Vice Chair | |
Secretary | |
Assistant |
Paper Information | |
Registration To | Artificial Intelligence and Knowledge-Based Processing (AI) |
---|---|
Language | ENG |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | A Mixed Similarity Measure for Data with Numeric, Symbolic and Ordinal Attributes(Artificial Intelligence I)(Joint Workshop of Vietnamese Society of AI, SIGKBS-JSAI, ICS-IPSJ, and IEICE-SIGAI on Active Mining) |
Sub Title (in English) | |
Keyword(1) | |
1st Author's Name | Nguyen Ngoc Binh |
1st Author's Affiliation | Hanoi University of Technology, Vietnam() |
2nd Author's Name | Than Van Cuong |
2nd Author's Affiliation | Hanoi University of Technology, Vietnam |
3rd Author's Name | Nguyen Thanh Phuong |
3rd Author's Affiliation | Hanoi University of Technology, Vietnam |
4th Author's Name | Ho Tu Bao |
4th Author's Affiliation | Japan Advanced Institute of Science and Technology |
Date | 2004/11/27 |
Paper # | AI2004-30 |
Volume (vol) | vol.104 |
Number (no) | 485 |
Page | pp.pp.- |
#Pages | 6 |
Date of Issue |