Presentation | 2001/10/12 Motif Search Algorithm and Its Applications for Gene Finding Tetsuo Shibuya, Isidore Rigoutsos, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | Gene identification is one of the most important problems in molecular biology and has been receiving increasing attention with the advent of large scale sequencing projects. Previous strategies for solving this problem can be categorized into essentially two schools of thought: statistical approaches such as hidden Markov models (HMMs), and methods based on database similarity searches. In this paper, we propose a new approach for tackling the gene identification problem. The approach employs the Bio-Dictionary [27, 29], a database of patterns that cover essentially all of the currently available sample of natural protein sequence space, to determine gene candidates among the ORFs that can be identified in a given DNA strand; as a matter of fact, the method combines the best characteristics from each of the above-mentioned schools of thought. We additionally associate the patterns in the Bio-Dictionary with appropriately computed weights and this leads to further improvements in our gene identification ability. Furthermore, we propose a fast search algorithm for searching motifs of Bio-Dictionary. We demonstrate the method's improved capabilities through an analysis and discussion of the results we obtain by processing 17 whole archaeal and bacterial genomes. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | gene identification / Bio-Dictionary / motif library / algorithm / database search / coding quality |
Paper # | COMP 2001-48 |
Date of Issue |
Conference Information | |
Committee | COMP |
---|---|
Conference Date | 2001/10/12(1days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | |
Chair | |
Vice Chair | |
Secretary | |
Assistant |
Paper Information | |
Registration To | Theoretical Foundations of Computing (COMP) |
---|---|
Language | ENG |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | Motif Search Algorithm and Its Applications for Gene Finding |
Sub Title (in English) | |
Keyword(1) | gene identification |
Keyword(2) | Bio-Dictionary |
Keyword(3) | motif library |
Keyword(4) | algorithm |
Keyword(5) | database search |
Keyword(6) | coding quality |
1st Author's Name | Tetsuo Shibuya |
1st Author's Affiliation | IBM Tokyo Research Laboratory() |
2nd Author's Name | Isidore Rigoutsos |
2nd Author's Affiliation | Bioinformatics & Pattern Discovery Group, Computational Biology Center, IBM Thomas J. Watson Research Center |
Date | 2001/10/12 |
Paper # | COMP 2001-48 |
Volume (vol) | vol.101 |
Number (no) | 376 |
Page | pp.pp.- |
#Pages | 8 |
Date of Issue |