Presentation 2000/7/21
Search and compression of large amount of text files with two stage compression method
Shingo Otsuka, Nobuyoshi Miyazaki,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) In this paper, we discuss search and compression of large amount of text files with two stage compression method. The texts are usually stored in secondary storage, and they are frequently compressed for file size saving. When we search compressed text files, it is usually necessary to decode them before search. Therefore, search is time consuming. On the other hand, we can use indexing for fast search. But indices consume extra amount of secondary storage. We proposed a two-stage compression method to improve the performance. It compresses text files using index files and compresses the result again with another algorithm. This paper discusses application of the two-stage compression method for large amount of text files such as newspaper and magazine data, and proposes an improved method.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) text search / text compression / efficient search
Paper # DE2000-78
Date of Issue

Conference Information
Committee DE
Conference Date 2000/7/21(1days)
Place (in Japanese) (See Japanese page)
Place (in English)
Topics (in Japanese) (See Japanese page)
Topics (in English)
Chair
Vice Chair
Secretary
Assistant

Paper Information
Registration To Data Engineering (DE)
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) Search and compression of large amount of text files with two stage compression method
Sub Title (in English)
Keyword(1) text search
Keyword(2) text compression
Keyword(3) efficient search
1st Author's Name Shingo Otsuka
1st Author's Affiliation Department of Computer Science, Chiba Institute of Technology()
2nd Author's Name Nobuyoshi Miyazaki
2nd Author's Affiliation Department of Computer Science, Chiba Institute of Technology
Date 2000/7/21
Paper # DE2000-78
Volume (vol) vol.100
Number (no) 228
Page pp.pp.-
#Pages 7
Date of Issue