連続マッチング回路を用いた子音の特徴抽出による音声認識方法(認識アルゴリズム,第12回音声言語シンポジウム:情報アクセス,音声・言語処理一般)

野中 淳; 岡本 佳太; 田向 権; 関根 優年

講演名	2010-12-21 連続マッチング回路を用いた子音の特徴抽出による音声認識方法(認識アルゴリズム,第12回音声言語シンポジウム:情報アクセス,音声・言語処理一般) 野中淳, 岡本佳太, 田向権, 関根優年,
PDFダウンロードページ	PDFダウンロードページへ
抄録(和)	人間の聴覚は二つの耳から流入した音に対し,頑健性のある前処理を施すことで,音の特徴量や発生位置などを即座に抽出する.このとき,聴覚では周波数弁別が行われている.蝸牛処理回路による日本語母音判別は報告済みである.本報告では,聴覚前処理での子音判別方法として連続マッチング回路を提案する.本方式は実時間処理かつ子音テンプレート長に非依存であり,連続認識へも適用可能である.FPGAに本方式の回路を実装し,多重解像度化したテンプレートマッチングがほぼ実時間で動作した.また,連続マッチングにより得られた結果と周波数情報による分類とで子音認識を行ったところ、子音の認識率は高確率であった.提案手法の有用性を確認することが出来た.
抄録(英)	The human audition extracts voice characteristics from the sounds flowing into two ears. The recognition circuit of the vowel sound in Japanese has been reported by using a cochlea modeled. In this report, we propose a consonant recognition method inspired by the preprocessing circuits for human audition. In the proposed method, we also propose a pipelined consecutive matching circuits for the consonant recognition. The proposed circuit worked in success as a real-time processing circuit for the voice recognition, because it is independent from the length of consonant templates. In order to realize the proposed system, we implemented the circuit with FPGA board. As expected, the circuit achieved real-time template matching with the multiresolution analysis. In addition, the result shows that the circuit can classify the consonant almost completely.
キーワード(和)	子音認識 / テンプレート・マッチング / 多重解像度解析 / FPGA / 音声認識
キーワード(英)	Consonant Recognition / Template Matching / Multiresolution / FPGA / Speech Recognition
資料番号	NLC2010-22,SP2010-95
発行日

研究会情報
研究会	SP
開催期間	2010/12/13(から1日開催)
開催地（和）
開催地（英）
テーマ（和）
テーマ（英）
委員長氏名（和）
委員長氏名（英）
副委員長氏名（和）
副委員長氏名（英）
幹事氏名（和）
幹事氏名（英）
幹事補佐氏名（和）
幹事補佐氏名（英）

講演論文情報詳細
申込み研究会	Speech (SP)
本文の言語	JPN
タイトル（和）	連続マッチング回路を用いた子音の特徴抽出による音声認識方法(認識アルゴリズム,第12回音声言語シンポジウム:情報アクセス,音声・言語処理一般)
サブタイトル（和）
タイトル（英）	Voice recognition method based on Characterization of Consonant using pipelined matching circuit
サブタイトル（和）
キーワード(1)（和/英）	子音認識 / Consonant Recognition
キーワード(2)（和/英）	テンプレート・マッチング / Template Matching
キーワード(3)（和/英）	多重解像度解析 / Multiresolution
キーワード(4)（和/英）	FPGA / FPGA
キーワード(5)（和/英）	音声認識 / Speech Recognition
第 1 著者氏名（和/英）	野中淳 / Jun NONAKA
第 1 著者所属（和/英）	東京農工大学工学府 Faculty of Engineering, Tokyo University of Agriculture and Technology
第 2 著者氏名（和/英）	岡本佳太 / Keita OKAMOTO
第 2 著者所属（和/英）	東京農工大学工学府 Faculty of Engineering, Tokyo University of Agriculture and Technology
第 3 著者氏名（和/英）	田向権 / Hakaru TAMUKOH
第 3 著者所属（和/英）	東京農工大学工学府 Faculty of Engineering, Tokyo University of Agriculture and Technology
第 4 著者氏名（和/英）	関根優年 / Masatoshi SEKINE
第 4 著者所属（和/英）	東京農工大学工学府 Faculty of Engineering, Tokyo University of Agriculture and Technology
発表年月日	2010-12-21
資料番号	NLC2010-22,SP2010-95
巻番号（vol）	vol.110
号番号（no）	357
ページ範囲	pp.-
ページ数	6
発行日