Presentation | 2002/7/9 Building a Large-Scale Japanese Grammar : Case Study Tomoya NORO, Kiyoaki SHIRAI, Takenobu TOKUNAGA, Hozumi TANAKA, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | A large-scale grammar is needed in parsing a variety of sentences, but it is difficult to build it manually. On the other hand, it is possible to build a large-scale grammar by deriving it from a large-scale tree parsed corpus (hereinafter, only abbriviated to a corpus). But because a great number of parse trees are created by using a grammar derived from a corpus, parsing accuracy gets worse, and to make matters worse, it takes a long time and a large amount of memory to parse sentences. For the above reasons, the corpus and the grammar should be modified to decrease the number of parse trees. This paper proposes the method to build a large-scale context free grammar that creates smaller number of parse trees by leaving the analysis of the structures that cannot be solved if no semantic information is considered, and shows the effectiveness of the grammar experimentally. We expect that it is possible to build a practical large-scale grammar that decreases the number of parse trees by using our method. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | Large-Scale context free Grammars / Tree Parsed Corpora / Syntactic Analysis |
Paper # | NLC2002-31 |
Date of Issue |
Conference Information | |
Committee | NLC |
---|---|
Conference Date | 2002/7/9(1days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | |
Chair | |
Vice Chair | |
Secretary | |
Assistant |
Paper Information | |
Registration To | Natural Language Understanding and Models of Communication (NLC) |
---|---|
Language | JPN |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | Building a Large-Scale Japanese Grammar : Case Study |
Sub Title (in English) | |
Keyword(1) | Large-Scale context free Grammars |
Keyword(2) | Tree Parsed Corpora |
Keyword(3) | Syntactic Analysis |
1st Author's Name | Tomoya NORO |
1st Author's Affiliation | Graduate School of Information Science and Engineering, Tokyo Institute of Technology() |
2nd Author's Name | Kiyoaki SHIRAI |
2nd Author's Affiliation | Graduate School of Information Science, Japan Advanced Institute of Science and Technology |
3rd Author's Name | Takenobu TOKUNAGA |
3rd Author's Affiliation | Graduate School of Information Science and Engineering, Tokyo Institute of Technology |
4th Author's Name | Hozumi TANAKA |
4th Author's Affiliation | Graduate School of Information Science and Engineering, Tokyo Institute of Technology |
Date | 2002/7/9 |
Paper # | NLC2002-31 |
Volume (vol) | vol.102 |
Number (no) | 200 |
Page | pp.pp.- |
#Pages | 8 |
Date of Issue |