Presentation | 2020-12-17 A Novel Data Augmentation Framework Based on SeqGAN for Sentiment Analysis Jiawei Luo, Mondher Bouazizi, Tomoaki Ohtsuki, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | Sentiment analysis is an important field in Natural Language Processing (NLP). It can analyze people's sentiment through their articles. On a related topic, machine learning has achieved high accuracy in sentiment analysis. However, it requires a large amount of high-quality training data that are hard to be collected. In this work, a novel data augmentation framework based on sequence generative adversarial networks (SeqGAN) is proposed to improve the sentiment analysis accuracy. In our framework, we conduct sentence compression and use a sentiment dictionary to retain the sentiment words for compressed data. The compressed data are used to train SeqGAN. We use the trained SeqGAN to generate artificial data for sentiment analysis. A classifier is used to discard generated data that may contain incorrect sentiment information. The results show that the proposed data augmentation framework helps SeqGAN generate high quality and novel text data. The data generated by the proposed framework improve the accuracy of the sentiment analysis classifier on some of the benchmark sentiment analysis dataset available. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | data augmentationsentiment analysismachine learningsentence compressionSeqGAN |
Paper # | PRMU2020-43 |
Date of Issue | 2020-12-10 (PRMU) |
Conference Information | |
Committee | PRMU |
---|---|
Conference Date | 2020/12/17(2days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | Online |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | Transfer learning and few shot learning |
Chair | Yoichi Sato(Univ. of Tokyo) |
Vice Chair | Akisato Kimura(NTT) / Masakazu Iwamura(Osaka Pref. Univ.) |
Secretary | Akisato Kimura(Mobility Technologies) / Masakazu Iwamura(Chubu Univ.) |
Assistant | Takashi Shibata(NTT) / Masashi Nishiyama(Tottori Univ.) |
Paper Information | |
Registration To | Technical Committee on Pattern Recognition and Media Understanding |
---|---|
Language | ENG |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | A Novel Data Augmentation Framework Based on SeqGAN for Sentiment Analysis |
Sub Title (in English) | |
Keyword(1) | data augmentationsentiment analysismachine learningsentence compressionSeqGAN |
1st Author's Name | Jiawei Luo |
1st Author's Affiliation | Keio University(Keio Univ.) |
2nd Author's Name | Mondher Bouazizi |
2nd Author's Affiliation | Keio University(Keio Univ.) |
3rd Author's Name | Tomoaki Ohtsuki |
3rd Author's Affiliation | Keio University(Keio Univ.) |
Date | 2020-12-17 |
Paper # | PRMU2020-43 |
Volume (vol) | vol.120 |
Number (no) | PRMU-300 |
Page | pp.pp.30-35(PRMU), |
#Pages | 6 |
Date of Issue | 2020-12-10 (PRMU) |