Presentation 1996/7/18
Development of RWC Text Database Tagged with Classification Code
Jun Toyoura, Takenobu Tokunaga, Hitoshi Isahara, Ryuichi Oka,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) The Real World Computing Database Working Group has built a Text Database tagged with UDC (Universal Decimal Chassification) Code for about 30,000 newspaper articles.This database is available to public and free for research use. There is no such kind of text database in Japanese. It can be use for common benchmark to evaluate various natural language processing systems, for examphe "text categorization", "information extraction" and so on.
Keyword(in Japanese) (See Japanese page)
Keyword(in English)
Paper # NLC96-13
Date of Issue

Conference Information
Committee NLC
Conference Date 1996/7/18(1days)
Place (in Japanese) (See Japanese page)
Place (in English)
Topics (in Japanese) (See Japanese page)
Topics (in English)
Chair
Vice Chair
Secretary
Assistant

Paper Information
Registration To Natural Language Understanding and Models of Communication (NLC)
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) Development of RWC Text Database Tagged with Classification Code
Sub Title (in English)
Keyword(1)
1st Author's Name Jun Toyoura
1st Author's Affiliation Tsukuba Resealch Center RWCP()
2nd Author's Name Takenobu Tokunaga
2nd Author's Affiliation Tokyo Institute of Technologyt
3rd Author's Name Hitoshi Isahara
3rd Author's Affiliation Communications Research Laboratoryt
4th Author's Name Ryuichi Oka
4th Author's Affiliation Tsukuba Resealch Center RWCP
Date 1996/7/18
Paper # NLC96-13
Volume (vol) vol.96
Number (no) 157
Page pp.pp.-
#Pages 6
Date of Issue