Presentation | 1996/7/19 High Speed Morphological Analysis using DFA Shinsuke Mori, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | Morphological analysis, which segments the input sentence into words and attaches parts of speech to them, is the most fundamental process of Japanese language processing. This process contains dictionary look-up of all substrings of input sentence. In this paper, we propose a method to convert the dictionary into a deterministic finite automaton and realize high-speed dictionary look-up. An advantage of our method is that it enables faster dictionary look-up and a disadvantage is that required memory space is larger than AC method-based dictionary look-up. The experimental results tells that our method requires 16.1 times as large memory space as AC method and is 11.7 times as fast as AC method in dictionary look-up. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | Morphological analysis / Dictionary lookup / Speedup / DFA / AC method |
Paper # | NLC96-23 |
Date of Issue |
Conference Information | |
Committee | NLC |
---|---|
Conference Date | 1996/7/19(1days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | |
Chair | |
Vice Chair | |
Secretary | |
Assistant |
Paper Information | |
Registration To | Natural Language Understanding and Models of Communication (NLC) |
---|---|
Language | JPN |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | High Speed Morphological Analysis using DFA |
Sub Title (in English) | |
Keyword(1) | Morphological analysis |
Keyword(2) | Dictionary lookup |
Keyword(3) | Speedup |
Keyword(4) | DFA |
Keyword(5) | AC method |
1st Author's Name | Shinsuke Mori |
1st Author's Affiliation | Department of Electrical Engineering, Kyoto University() |
Date | 1996/7/19 |
Paper # | NLC96-23 |
Volume (vol) | vol.96 |
Number (no) | 158 |
Page | pp.pp.- |
#Pages | 7 |
Date of Issue |