Presentation 2019-12-04
Attempting News Classification by Media Characteristics
Yoshifumi Seki,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) The purpose of this study is to classify news articles by medium characteristics. In this paper, we compare the vectorization method of documents through the construction of a classification model for major newspapers and Tabloid. Although the classification of news articles is an popular issues, most of them deal with topics and categories, and there are few attempts to reflect differences in writing and nuances within the same category. However, it is important to evaluate such writing and nuances, which is a very important task in the modern news consumption environment. As a result of our experiment, many models learned events and topics strongly, and it was found that classification was very difficult. On the other hand, it was suggested that the model using BERT may be able to consider writing and nuances.
Keyword(in Japanese) (See Japanese page)
Keyword(in English)
Paper # NLC2019-29
Date of Issue 2019-11-27 (NLC)

Conference Information
Committee NLC / IPSJ-NL / SP / IPSJ-SLP
Conference Date 2019/12/4(3days)
Place (in Japanese) (See Japanese page)
Place (in English) NHK Science & Technology Research Labs.
Topics (in Japanese) (See Japanese page)
Topics (in English) The 6th Natural Language Processing Symposium & The 21th Spoken Language Symposium
Chair Takeshi Sakaki(Hottolink) / / Hisashi Kawai(NICT)
Vice Chair Mitsuo Yoshida(Toyohashi Univ. of Tech.) / Kazutaka Shimada(Kyushu Inst. of Tech.) / / Akinobu Ri(Nagoya Inst. of Tech.)
Secretary Mitsuo Yoshida(Ryukoku Univ.) / Kazutaka Shimada(NTT) / / Akinobu Ri(Kyoto Univ.) / (Waseda Univ.)
Assistant Takeshi Kobayakawa(NHK) / Hiroki Sakaji(Univ. of Tokyo) / / Tomoki Koriyama(Univ. of Tokyo) / Yusuke Ijima(NTT)

Paper Information
Registration To Technical Committee on Natural Language Understanding and Models of Communication / Special Interest Group on Natural Language / Technical Committee on Speech / Special Interest Group on Spoken Language Processing
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) Attempting News Classification by Media Characteristics
Sub Title (in English)
Keyword(1)
Keyword(2)
Keyword(3)
1st Author's Name Yoshifumi Seki
1st Author's Affiliation Gunosy Inc.(Gunosy Inc.)
Date 2019-12-04
Paper # NLC2019-29
Volume (vol) vol.119
Number (no) NLC-320
Page pp.pp.1-5(NLC),
#Pages 5
Date of Issue 2019-11-27 (NLC)