背景雑音を対象とした特徴パラメータ正準化法(ポスターセッション)(第6回音声言語シンポジウム)

福田 隆; 新田 恒雄

講演名	2004/12/14 背景雑音を対象とした特徴パラメータ正準化法(ポスターセッション)(第6回音声言語シンポジウム) 福田隆, 新田恒雄,
PDFダウンロードページ	PDFダウンロードページへ
抄録(和)	確率的分類器(HMM)に基づく認識システムは,性別,話速,音響環境等に起因するHMMの隠れ変数のバイアスにより性能が低下する.隠れ変数の影響を特徴抽出の段階で吸収することができれば,より頑健な認識システムを構成できると考えられる.本報告では,雑音強度と男女の声質を対象とした特徴パラメータ正準化方式について述べる.正準化は音響特徴空間と音響モデルの間に,中間表現である音素弁別特徴(DPF)空間を導入することにより実現する.正準化器は,音響的変動要因に対応させた複数のDPF抽出器と一つのDPF選択器から成る.DPF抽出法として,(A)雑音強度および声質の双方を表現するDPF抽出器から直接抽出する方法,および(B)声質に応じて設計したDPF抽出器から二つのDPFを抽出した後,雑音を抑圧する方式の二通りを比較する.DPF選択器は8種類のDPFベクトルのうち,音響モデルに最も適合するDPFベクトルを正準化DPFベクトルとして抽出する.評価実験では,単一のHMM分類器に正準化DPFベクトルを入力する方式と,隠れ変数に対応させた複数のHMM分類器に従来の音響特徴パラメータ(MFCC)を入力する方式とを比較する.提案方式は,少ない演算量にもかかわらず良好な性能が得られることを示す.
抄録(英)	Acoustic models (AMs) of an HMM-based classifier include various types of hidden variables such as gender type, speaking rate, and acoustic environment. If there exists a canonicalization process that reduces the influence of the hidden variables from AMs, a robust automatic speech recognition (ASR) system can be realized, hi this paper, we describe the configuration of a canonicalization process targeting gender type and noise intensity as hidden variables. The proposed canonicalization process is composed of multiple distinctive phonetic feature (DPF) extractors corresponding to the hidden variables and a DPF selector which selects a desirable DPF from multiple DPFs. hi a DPF extraction stage, two approaches, namely (A) extracting DPFs directly from each DPF extractor which represents both gender type and noise intensity, and (B) dividing the DPF extraction stage into a DPF extraction part of gender type and a noise suppression part of different S/N-ratio type, are investigated. Experiments are carried out by comparing the combination of the canonicalized DPF and a single HMM classifier, and also the combination of a single acoustic feature (MFCC) and multiple HMM classifiers. The result shows that the proposed canonicalization method outperformed both of the conventional ASR with MFCC and a single HMM, and the ASR with multiple HMMs
キーワード(和)	音声認識 / 特徴抽出 / 正準化 / 音素弁別特徴 / 雑音抑圧器
キーワード(英)	automatic speech recognition / feature extraction / canonicalization / distinctive phonetic feature / noise suppressor
資料番号	NLC2004-78,SP2004-118
発行日

研究会情報
研究会	SP
開催期間	2004/12/14(から1日開催)
開催地（和）
開催地（英）
テーマ（和）
テーマ（英）
委員長氏名（和）
委員長氏名（英）
副委員長氏名（和）
副委員長氏名（英）
幹事氏名（和）
幹事氏名（英）
幹事補佐氏名（和）
幹事補佐氏名（英）

講演論文情報詳細
申込み研究会	Speech (SP)
本文の言語	JPN
タイトル（和）	背景雑音を対象とした特徴パラメータ正準化法(ポスターセッション)(第6回音声言語シンポジウム)
サブタイトル（和）
タイトル（英）	Canonicalization of Feature Parameters targeting Background Noise
サブタイトル（和）
キーワード(1)（和/英）	音声認識 / automatic speech recognition
キーワード(2)（和/英）	特徴抽出 / feature extraction
キーワード(3)（和/英）	正準化 / canonicalization
キーワード(4)（和/英）	音素弁別特徴 / distinctive phonetic feature
キーワード(5)（和/英）	雑音抑圧器 / noise suppressor
第 1 著者氏名（和/英）	福田隆 / Takashi FUKUDA
第 1 著者所属（和/英）	豊橋技術科学大学大学院工学研究科 Graduate School of Engineering, Toyohashi University of Technology
第 2 著者氏名（和/英）	新田恒雄 / Tsuneo NITTA
第 2 著者所属（和/英）	豊橋技術科学大学大学院工学研究科 Graduate School of Engineering, Toyohashi University of Technology
発表年月日	2004/12/14
資料番号	NLC2004-78,SP2004-118
巻番号（vol）	vol.104
号番号（no）	542
ページ範囲	pp.-
ページ数	6
発行日