Presentation 2012-11-08
MCMC Strategy for Protein Complex Prediction Using Cluster Size Frequency
Daisuke TATSUKE, Osamu MARUYAMA,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) In this paper we propose a Markov chain Monte Carlo sampling method for predicting protein complexes from protein-protein interactions (PPIs). Many of the existing tools for this problem are designed more or less based on a density measure of a subgraph of the PPI network. This kind of measures is less effective for smaller complexes. On the other hand, it can be found that the frequency of complexes of size, i, in a database of protein complexes often follows a power-law, i^<-γ>, where γ is a constant. Thus, most of the complexes are small-sized. For example, in CYC2008, a database of curated protein complexes of yeast, 42% of the complexes are heterodimeric, i.e., a complex consisting of two different proteins. In this work, we propose a protein complex prediction algorithm, called PPSampler (Proteins' Partition Sampler), which is designed based on the Metropolis-Hastings algorithm using a parameter representing a target value of the relative frequency of the number of predicted protein complexes of a particular size. In a performance comparison, PPSampler outperforms other existing algorithms. Furthermore, about half of the predicted clusters that are not matched with any known complexes in CYC2008 are statistically significant by Gene Ontology terms. Some of them can be expected to be true complexes.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) protein complex / protein-protein interaction / sampling / Markov chain Monte Carlo / Metropolis-Hastings / power-law
Paper # IBISML2012-91
Date of Issue

Conference Information
Committee IBISML
Conference Date 2012/10/31(1days)
Place (in Japanese) (See Japanese page)
Place (in English)
Topics (in Japanese) (See Japanese page)
Topics (in English)
Chair
Vice Chair
Secretary
Assistant

Paper Information
Registration To Information-Based Induction Sciences and Machine Learning (IBISML)
Language ENG
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) MCMC Strategy for Protein Complex Prediction Using Cluster Size Frequency
Sub Title (in English)
Keyword(1) protein complex
Keyword(2) protein-protein interaction
Keyword(3) sampling
Keyword(4) Markov chain Monte Carlo
Keyword(5) Metropolis-Hastings
Keyword(6) power-law
1st Author's Name Daisuke TATSUKE
1st Author's Affiliation Graduate School of Mathematics, Kyushu University()
2nd Author's Name Osamu MARUYAMA
2nd Author's Affiliation Institute of Mathematics for Industry, Kyushu University
Date 2012-11-08
Paper # IBISML2012-91
Volume (vol) vol.112
Number (no) 279
Page pp.pp.-
#Pages 8
Date of Issue