軽量のテキスト処理による部分類似単語列検出手法(「自動化:推論,発見,学習,データマイニング」及び一般)

Presentation	2007-05-31 Similarity Sequences extracting method using efficient text processing Takaharu Takeda, Atsuhiro Takasu,
PDF Download Page	PDF download Page Link
Abstract(in Japanese)	(See Japanese page)
Abstract(in English)	Similar expression and character string appear frequently during documents written about the same topic. It is easy to identify where they appeared by indexing for searches, if those are the perfect matching string, but they are taken aside when a string is different partially or includes different expression. Usually query is given by user in approximate pattern matching, the system only finds most suitable document, however we would propose the mapping method that which part and which part resemble self-organizing in this study.
Keyword(in Japanese)	(See Japanese page)
Keyword(in English)	Suffix arrays / weighted edit distance / text mining / Similarity Sequences extracting
Paper #	AI2007-7
Date of Issue

Paper Information
Registration To	Artificial Intelligence and Knowledge-Based Processing (AI)
Language	JPN
Title (in Japanese)	(See Japanese page)
Sub Title (in Japanese)	(See Japanese page)
Title (in English)	Similarity Sequences extracting method using efficient text processing
Sub Title (in English)
Keyword(1)	Suffix arrays
Keyword(2)	weighted edit distance
Keyword(3)	text mining
Keyword(4)	Similarity Sequences extracting
1st Author's Name	Takaharu Takeda
1st Author's Affiliation	The Graduate university for Advanced Studies, the School of Multidisciplinary Science, the Department of Informatics()
2nd Author's Name	Atsuhiro Takasu
2nd Author's Affiliation	National Institute of Informatics, Office for Promotion of Research Projects, Research Center for Testbeds and Prototyping
Date	2007-05-31
Paper #	AI2007-7
Volume (vol)	vol.107
Number (no)	78
Page	pp.pp.-
#Pages	6
Date of Issue