Presentation 2016-07-06
A Semi-supervised Learning Method for Imbalanced Binary Classification
Akinori Fujino, Naonori Ueda,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) This paper presents a semi-supervised learning method for imbalanced binary classification where the number of positive samples differs largely from that of negative samples. The area under the ROC curve (AUC) is often used as an effective measure for evaluating binary classifiers in such imbalanced tasks, and thus AUC-optimized classifiers have been developed which were trained to maximize an AUC value measured on a labeled sample set. The proposed method utilizes generative models for assisting the incorporation of unlabeled samples in AUC-optimized classifiers. We applied the proposed method to text classification by employing a naive Bayes model as the generative model. Using two benchmark datasets, we confirmed experimentally that the proposed method was more useful for imbalanced binary classification than conventional semi-supervised learning methods based on discriminative, generative, and those hybrid models. We also confirmed the effect of using generative models for semi-supervised learning of AUC-optimized classifiers.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) Semi-supervised Learning / AUC Maximization / Generative Model / Naive Bayes Model / Text Classification
Paper # IBISML2016-3
Date of Issue 2016-06-28 (IBISML)

Conference Information
Committee NC / IPSJ-BIO / IBISML / IPSJ-MPS
Conference Date 2016/7/4(3days)
Place (in Japanese) (See Japanese page)
Place (in English) Okinawa Institute of Science and Technology
Topics (in Japanese) (See Japanese page)
Topics (in English) Machine Learning Approach to Biodata Mining, and General
Chair Shigeo Sato(Tohoku Univ.) / / Kenji Fukumizu(ISM)
Vice Chair Masafumi Hagiwara(Keio Univ.) / / Masashi Sugiyama(Univ. of Tokyo) / Hisashi Kashima(Kyoto Univ.)
Secretary Masafumi Hagiwara(Kyoto Sangyo Univ.) / (Tokyo Inst. of Tech.) / Masashi Sugiyama / Hisashi Kashima(Univ. of Tokyo) / (Nagoya Inst. of Tech.)
Assistant Hisanao Akima(Tohoku Univ.) / Yoshihisa Shinozawa(Keio Univ.) / / Toshihiro Kamishima(AIST) / Tomoharu Iwata(NTT)

Paper Information
Registration To Technical Committee on Neurocomputing / Special Interest Group on Bioinformatics and Genomics / Technical Committee on Infomation-Based Induction Sciences and Machine Learning / Special Interest Group on Mathematical Modeling and Problem Solving
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) A Semi-supervised Learning Method for Imbalanced Binary Classification
Sub Title (in English)
Keyword(1) Semi-supervised Learning
Keyword(2) AUC Maximization
Keyword(3) Generative Model
Keyword(4) Naive Bayes Model
Keyword(5) Text Classification
1st Author's Name Akinori Fujino
1st Author's Affiliation Nippon Telegraph and Telephone Corporation(NTT)
2nd Author's Name Naonori Ueda
2nd Author's Affiliation Nippon Telegraph and Telephone Corporation(NTT)
Date 2016-07-06
Paper # IBISML2016-3
Volume (vol) vol.116
Number (no) IBISML-121
Page pp.pp.195-200(IBISML),
#Pages 6
Date of Issue 2016-06-28 (IBISML)