Presentation 1999/7/23
Japanese Word Segmentation Using Textual Analysis for Full Text Search
Yasuki IIZUKA,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) This paper presents a word segmentation method based on a textual analysis. This method does not require any dictionary. The proposed method consists of two steps. The first step is building list of words by filtering string clusters devided by heuristic rules. These heuristic rules mainly utilyzes character types. The second step is segmenting texts based on the extracted word list and the other heuristic rules. The score of evaluation experiment is 90.2% precision and 85.5% recall.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) word extraction / word segmentation / corpus / heuristic rule
Paper # NLC99-14
Date of Issue

Conference Information
Committee NLC
Conference Date 1999/7/23(1days)
Place (in Japanese) (See Japanese page)
Place (in English)
Topics (in Japanese) (See Japanese page)
Topics (in English)
Vice Chair

Paper Information
Registration To Natural Language Understanding and Models of Communication (NLC)
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) Japanese Word Segmentation Using Textual Analysis for Full Text Search
Sub Title (in English)
Keyword(1) word extraction
Keyword(2) word segmentation
Keyword(3) corpus
Keyword(4) heuristic rule
1st Author's Name Yasuki IIZUKA
1st Author's Affiliation Multimedia Systems Research Laboratory Matsushita Electric Industrial Co., Ltd.()
Date 1999/7/23
Paper # NLC99-14
Volume (vol) vol.99
Number (no) 228
Page pp.pp.-
#Pages 8
Date of Issue