Presentation 2007-05-31
Similarity Sequences extracting method using efficient text processing
Takaharu Takeda, Atsuhiro Takasu,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) Similar expression and character string appear frequently during documents written about the same topic. It is easy to identify where they appeared by indexing for searches, if those are the perfect matching string, but they are taken aside when a string is different partially or includes different expression. Usually query is given by user in approximate pattern matching, the system only finds most suitable document, however we would propose the mapping method that which part and which part resemble self-organizing in this study.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) Suffix arrays / weighted edit distance / text mining / Similarity Sequences extracting
Paper # AI2007-7
Date of Issue

Conference Information
Committee AI
Conference Date 2007/5/24(1days)
Place (in Japanese) (See Japanese page)
Place (in English)
Topics (in Japanese) (See Japanese page)
Topics (in English)
Vice Chair

Paper Information
Registration To Artificial Intelligence and Knowledge-Based Processing (AI)
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) Similarity Sequences extracting method using efficient text processing
Sub Title (in English)
Keyword(1) Suffix arrays
Keyword(2) weighted edit distance
Keyword(3) text mining
Keyword(4) Similarity Sequences extracting
1st Author's Name Takaharu Takeda
1st Author's Affiliation The Graduate university for Advanced Studies, the School of Multidisciplinary Science, the Department of Informatics()
2nd Author's Name Atsuhiro Takasu
2nd Author's Affiliation National Institute of Informatics, Office for Promotion of Research Projects, Research Center for Testbeds and Prototyping
Date 2007-05-31
Paper # AI2007-7
Volume (vol) vol.107
Number (no) 78
Page pp.pp.-
#Pages 6
Date of Issue