Presentation 2001/10/12
Motif Search Algorithm and Its Applications for Gene Finding
Tetsuo Shibuya, Isidore Rigoutsos,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) Gene identification is one of the most important problems in molecular biology and has been receiving increasing attention with the advent of large scale sequencing projects. Previous strategies for solving this problem can be categorized into essentially two schools of thought: statistical approaches such as hidden Markov models (HMMs), and methods based on database similarity searches. In this paper, we propose a new approach for tackling the gene identification problem. The approach employs the Bio-Dictionary [27, 29], a database of patterns that cover essentially all of the currently available sample of natural protein sequence space, to determine gene candidates among the ORFs that can be identified in a given DNA strand; as a matter of fact, the method combines the best characteristics from each of the above-mentioned schools of thought. We additionally associate the patterns in the Bio-Dictionary with appropriately computed weights and this leads to further improvements in our gene identification ability. Furthermore, we propose a fast search algorithm for searching motifs of Bio-Dictionary. We demonstrate the method's improved capabilities through an analysis and discussion of the results we obtain by processing 17 whole archaeal and bacterial genomes.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) gene identification / Bio-Dictionary / motif library / algorithm / database search / coding quality
Paper # COMP 2001-48
Date of Issue

Conference Information
Committee COMP
Conference Date 2001/10/12(1days)
Place (in Japanese) (See Japanese page)
Place (in English)
Topics (in Japanese) (See Japanese page)
Topics (in English)
Chair
Vice Chair
Secretary
Assistant

Paper Information
Registration To Theoretical Foundations of Computing (COMP)
Language ENG
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) Motif Search Algorithm and Its Applications for Gene Finding
Sub Title (in English)
Keyword(1) gene identification
Keyword(2) Bio-Dictionary
Keyword(3) motif library
Keyword(4) algorithm
Keyword(5) database search
Keyword(6) coding quality
1st Author's Name Tetsuo Shibuya
1st Author's Affiliation IBM Tokyo Research Laboratory()
2nd Author's Name Isidore Rigoutsos
2nd Author's Affiliation Bioinformatics & Pattern Discovery Group, Computational Biology Center, IBM Thomas J. Watson Research Center
Date 2001/10/12
Paper # COMP 2001-48
Volume (vol) vol.101
Number (no) 376
Page pp.pp.-
#Pages 8
Date of Issue