Presentation 2010-11-19
A Note on Documents Classification using PLSI
Gendo KUMOI, Takashi ISHIDA, Masayuki GOTO, Shigeichi HIRASAWA,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) Recently, with increasing electric documents, effective knowledge acquisition become important issue method. Document classification is one of the techniques. In this paper, we use Probabilistic Latent Semantic Indexing (PLSI) as document classification technique. The characteristics of PLSI is dealtwith handle multi-category and lead solution with using EM algorithm. We propose a method that can classify a small amount of training data using the EM algorithm. We apply this method into WEB page and show its effectiveness.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) PLSI / Document classification / EM algorithm
Paper # AI2010-33
Date of Issue

Conference Information
Committee AI
Conference Date 2010/11/12(1days)
Place (in Japanese) (See Japanese page)
Place (in English)
Topics (in Japanese) (See Japanese page)
Topics (in English)
Chair
Vice Chair
Secretary
Assistant

Paper Information
Registration To Artificial Intelligence and Knowledge-Based Processing (AI)
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) A Note on Documents Classification using PLSI
Sub Title (in English)
Keyword(1) PLSI
Keyword(2) Document classification
Keyword(3) EM algorithm
1st Author's Name Gendo KUMOI
1st Author's Affiliation Waseda Research Institute for Science and Engineering()
2nd Author's Name Takashi ISHIDA
2nd Author's Affiliation Media Network Center, Waseda University
3rd Author's Name Masayuki GOTO
3rd Author's Affiliation Faculty of Science and Engineering, Waseda University
4th Author's Name Shigeichi HIRASAWA
4th Author's Affiliation Faculty of Information Technology and Business, Cyber University:Waseda Research Institute for Science and Engineering
Date 2010-11-19
Paper # AI2010-33
Volume (vol) vol.110
Number (no) 301
Page pp.pp.-
#Pages 6
Date of Issue