Presentation 2006-02-20
A study on accurate audiovisual indexing using audio signal
Naoki NITANDA, Miki HASEYAMA, Hideo KITAJIMA,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) An accurate audiovisual indexing method is proposed in this paper. The proposed method classifies the audio signal into the following five audio classes: silence, speech, music, speech with music background, and speech with noise background. For this audio classification, both principal component analysis (PCA) and fuzzy c-means clustering (FCM) are utilized. The effective features can be theoretically extracted by using PCA, and the reliability of the clustering results can be measured by using FCM. Moreover, combination use of the proposed method and video segmentation technique, which can accurately detect the segmentation point, make it possible to measure the similarity between two adjacent shots. According to the similarity, the scene transition can be accurately detected. Experimental results obtained by the combination approach to actual audiovisual materials are shown to verify its effectiveness.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) audiovisual / indexing / principal component analysis / fuzzy c-means clustering
Paper # ITS2005-60,IE2005-267
Date of Issue

Conference Information
Committee ITS
Conference Date 2006/2/13(1days)
Place (in Japanese) (See Japanese page)
Place (in English)
Topics (in Japanese) (See Japanese page)
Topics (in English)
Chair
Vice Chair
Secretary
Assistant

Paper Information
Registration To Intelligent Transport Systems Technology (ITS)
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) A study on accurate audiovisual indexing using audio signal
Sub Title (in English)
Keyword(1) audiovisual
Keyword(2) indexing
Keyword(3) principal component analysis
Keyword(4) fuzzy c-means clustering
1st Author's Name Naoki NITANDA
1st Author's Affiliation Graduate School of Information Science and Technology, Hokkaido University()
2nd Author's Name Miki HASEYAMA
2nd Author's Affiliation Graduate School of Information Science and Technology, Hokkaido University
3rd Author's Name Hideo KITAJIMA
3rd Author's Affiliation Graduate School of Information Science and Technology, Hokkaido University
Date 2006-02-20
Paper # ITS2005-60,IE2005-267
Volume (vol) vol.105
Number (no) 608
Page pp.pp.-
#Pages 6
Date of Issue