Presentation 2002/5/10
Common Sequence Analysis of Web Logs
Junji UNEDA, Haruo YOKOTA,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) The usability of a web site has to be dramatically changed by a configuration of the web site. We can expect a desirable web site configuration with the knowledge of user access patterns by analyzing web access logs of the web site. The knowledge of high frequent common access sequence patterns will also be useful for the placement strategy of advertisement, and so on. In this paper, we propose to derive frequent access sequence patterns by applying the Longest Common Subsequence (LCS) algorithm to access routes, transition of URLs accessed by a user in a session with in the web logs. We also propose a preprocess of the web logs using the web configuration with session ID to apply the proposed method. We then show the effect on the application of the LCS by an experimentation using access logs of an actual web site.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) Web / Data mining / access log analysis / Longest Common Subsequence / extract access seuquence
Paper # DE2002-2
Date of Issue

Conference Information
Committee DE
Conference Date 2002/5/10(1days)
Place (in Japanese) (See Japanese page)
Place (in English)
Topics (in Japanese) (See Japanese page)
Topics (in English)
Chair
Vice Chair
Secretary
Assistant

Paper Information
Registration To Data Engineering (DE)
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) Common Sequence Analysis of Web Logs
Sub Title (in English)
Keyword(1) Web
Keyword(2) Data mining
Keyword(3) access log analysis
Keyword(4) Longest Common Subsequence
Keyword(5) extract access seuquence
1st Author's Name Junji UNEDA
1st Author's Affiliation Dept. of Electrical and Electronic Engineering, Tokyo Institute of Technology()
2nd Author's Name Haruo YOKOTA
2nd Author's Affiliation Global Scientific Information & Computing Center, Tokyo Institute of Technology
Date 2002/5/10
Paper # DE2002-2
Volume (vol) vol.102
Number (no) 64
Page pp.pp.-
#Pages 6
Date of Issue