Presentation 2018-06-13
Post Clustering Inference, with Application to Single Cell Analysis
Shigenori Inoue, Yuta Umezu, Shouma Tsubota, Ichiro Takeuchi,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) There are many data with several subgroups, such as customer data and gene expression data and so on. One way to analyze such data is clustering. It aims to divide the data into several clusters based on the similarity of samples and obtain knowledge on the resulting clusters. Therefore, examining the features of each cluster from the result of clustering is a very important task for understanding the essential structure of data. Various clustering methods have been studied so far, but none of them have focused on statistical guarantee for the features after clustering. In this study, we develop the framework of selective inference for a hypothesis testing problem of the features in each cluster after $K$-means clustering. We confirm the usefulness of the proposed method through synthetic data and single cell data analysis.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) Hypothesis Testing / $K$-means Clustering / Post Selection Inference / Single Cell Data
Paper # IBISML2018-3
Date of Issue 2018-06-06 (IBISML)

Conference Information
Committee NC / IBISML / IPSJ-BIO / IPSJ-MPS
Conference Date 2018/6/13(3days)
Place (in Japanese) (See Japanese page)
Place (in English) Okinawa Institute of Science and Technology
Topics (in Japanese) (See Japanese page)
Topics (in English) Machine Learning Approach to Biodata Mining, and General
Chair Yutaka Hirata(Chubu Univ.) / Hisashi Kashima(Kyoto Univ.)
Vice Chair Hayaru Shouno(UEC) / Masashi Sugiyama(Univ. of Tokyo) / Koji Tsuda(Univ. of Tokyo)
Secretary Hayaru Shouno(Nagoya Univ.) / Masashi Sugiyama(NAIST) / Koji Tsuda(Nagoya Inst. of Tech.) / (AIST)
Assistant Keiichiro Inagaki(Chubu Univ.) / Takashi Shinozaki(NICT) / Tomoharu Iwata(NTT) / Shigeyuki Oba(Kyoto Univ.)

Paper Information
Registration To Technical Committee on Neurocomputing / Technical Committee on Infomation-Based Induction Sciences and Machine Learning / Special Interest Group on Bioinformatics and Genomics / Special Interest Group on Mathematical Modeling and Problem Solving
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) Post Clustering Inference, with Application to Single Cell Analysis
Sub Title (in English)
Keyword(1) Hypothesis Testing
Keyword(2) $K$-means Clustering
Keyword(3) Post Selection Inference
Keyword(4) Single Cell Data
1st Author's Name Shigenori Inoue
1st Author's Affiliation Nagoya Institute of Technology(NIT)
2nd Author's Name Yuta Umezu
2nd Author's Affiliation Nagoya Institute of Technology(NIT)
3rd Author's Name Shouma Tsubota
3rd Author's Affiliation Nagoya University(Nagoya Univ.)
4th Author's Name Ichiro Takeuchi
4th Author's Affiliation Nagoya Institute of Technology/RIKEN/National Institute for Materials Science(NIT/RIKEN/NIMS)
Date 2018-06-13
Paper # IBISML2018-3
Volume (vol) vol.118
Number (no) IBISML-81
Page pp.pp.15-22(IBISML),
#Pages 8
Date of Issue 2018-06-06 (IBISML)