講演名 2014-11-18
Learning from Positive and Unlabeled Data 2 : Computationally Efficient Estimation of Class Priors
,
PDFダウンロードページ PDFダウンロードページへ
抄録(和)
抄録(英) We consider the problem of estimating the class prior in an unlabeled dataset. Under the assumption that an additional labeled dataset is available, the class prior can be estimated by fitting a mixture of class-wise data distributions to the unlabeled data distribution. However, in practice, such an additional labeled dataset is often not available. In this paper, we show that, with additional samples coming only from the positive class, the class prior of the unlabeled dataset can be estimated correctly. Our key idea is to use properly penalized divergences for model fitting to cancel the error caused by the absence of negative samples. We further show that the use of the penalized L_1-distance gives a computationally efficient algorithm with an analytic solution, and establish its uniform deviation bound and estimation error bound. Finally, we experimentally demonstrate the usefulness of the proposed method.
キーワード(和)
キーワード(英) Learning from positive and unlabeled data / class-prior estimation / divergence matching
資料番号 IBISML2014-66
発行日

研究会情報
研究会 IBISML
開催期間 2014/11/10(から1日開催)
開催地(和)
開催地(英)
テーマ(和)
テーマ(英)
委員長氏名(和)
委員長氏名(英)
副委員長氏名(和)
副委員長氏名(英)
幹事氏名(和)
幹事氏名(英)
幹事補佐氏名(和)
幹事補佐氏名(英)

講演論文情報詳細
申込み研究会 Information-Based Induction Sciences and Machine Learning (IBISML)
本文の言語 ENG
タイトル(和)
サブタイトル(和)
タイトル(英) Learning from Positive and Unlabeled Data 2 : Computationally Efficient Estimation of Class Priors
サブタイトル(和)
キーワード(1)(和/英) / Learning from positive and unlabeled data
第 1 著者 氏名(和/英) / PLESSIS Marthinus Christoffel DU
第 1 著者 所属(和/英)
Department of Complexity Science and Engineering, University of Tokyo
発表年月日 2014-11-18
資料番号 IBISML2014-66
巻番号(vol) vol.114
号番号(no) 306
ページ範囲 pp.-
ページ数 7
発行日