Presentation 2002/7/9
Building a Large-Scale Japanese Grammar : Case Study
Tomoya NORO, Kiyoaki SHIRAI, Takenobu TOKUNAGA, Hozumi TANAKA,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) A large-scale grammar is needed in parsing a variety of sentences, but it is difficult to build it manually. On the other hand, it is possible to build a large-scale grammar by deriving it from a large-scale tree parsed corpus (hereinafter, only abbriviated to a corpus). But because a great number of parse trees are created by using a grammar derived from a corpus, parsing accuracy gets worse, and to make matters worse, it takes a long time and a large amount of memory to parse sentences. For the above reasons, the corpus and the grammar should be modified to decrease the number of parse trees. This paper proposes the method to build a large-scale context free grammar that creates smaller number of parse trees by leaving the analysis of the structures that cannot be solved if no semantic information is considered, and shows the effectiveness of the grammar experimentally. We expect that it is possible to build a practical large-scale grammar that decreases the number of parse trees by using our method.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) Large-Scale context free Grammars / Tree Parsed Corpora / Syntactic Analysis
Paper # NLC2002-31
Date of Issue

Conference Information
Committee NLC
Conference Date 2002/7/9(1days)
Place (in Japanese) (See Japanese page)
Place (in English)
Topics (in Japanese) (See Japanese page)
Topics (in English)
Chair
Vice Chair
Secretary
Assistant

Paper Information
Registration To Natural Language Understanding and Models of Communication (NLC)
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) Building a Large-Scale Japanese Grammar : Case Study
Sub Title (in English)
Keyword(1) Large-Scale context free Grammars
Keyword(2) Tree Parsed Corpora
Keyword(3) Syntactic Analysis
1st Author's Name Tomoya NORO
1st Author's Affiliation Graduate School of Information Science and Engineering, Tokyo Institute of Technology()
2nd Author's Name Kiyoaki SHIRAI
2nd Author's Affiliation Graduate School of Information Science, Japan Advanced Institute of Science and Technology
3rd Author's Name Takenobu TOKUNAGA
3rd Author's Affiliation Graduate School of Information Science and Engineering, Tokyo Institute of Technology
4th Author's Name Hozumi TANAKA
4th Author's Affiliation Graduate School of Information Science and Engineering, Tokyo Institute of Technology
Date 2002/7/9
Paper # NLC2002-31
Volume (vol) vol.102
Number (no) 200
Page pp.pp.-
#Pages 8
Date of Issue