Patrick,Lumban,Tobing; Tomoki,Toda; Hirokazu,Kameoka; Satoshi,Nakamura

お知らせ 2023年度・2024年度学生員会費割引キャンペーン実施中です
お知らせ技術研究報告と和文論文誌Ｃの同時投稿施策(掲載料1割引き)について
お知らせ電子情報通信学会における研究会開催について
お知らせ NEW 参加費の返金について

電子情報通信学会研究会発表申込システム
講演論文詳細

技報閲覧サービス
[ログイン]
技報アーカイブ

トップに戻る

前のページに戻る

[Japanese] / [English]

講演抄録／キーワード
講演名		2016-03-28 13:15 ［ポスター講演］An evaluation of acoustic-to-articulatory inversion mapping with latent trajectory Gaussian mixture model ○Patrick Lumban Tobing（NAIST）・Tomoki Toda（Nagoya Univ./NAIST）・Hirokazu Kameoka（NTT）・Satoshi Nakamura（NAIST） EA2015-85 SIP2015-134 SP2015-113
抄録	（和）	In this report, we present an evaluation of acoustic-to-articulatory inversion mapping based on latent trajectory Gaussian mixture model (LTGMM). In a conventional GMM-based inversion mapping system, GMM parameters are optimized by maximizing the likelihood of joint static and dynamic features of acoustic-articulatory data. In the mapping process, given the acoustic data, smoothly varying articulatory parameter trajectories are estimated by maximizing the conditional likelihood of their static features only, where the inter-frame correlation is taken into account by imposing the explicit relationship between static and dynamic features. Because training and optimization criteria are different from each other, the trained GMM is not optimum for the mapping process. A trajectory training method has been proposed to address this inconsistency problem [1]. However, this method has difficulties in optimization of some parameters, such as covariance matrices and a mixture component sequence. In this report, as another method to address the inconsistency problem, we propose an inversion mapping method based on latent trajectory GMM, inspired by the latent trjectory hidden Markov model [2]. The proposed method makes it possible to apply EM algorithm to model parameter optimization, which is difficult in the conventional trajectory training method. The experimental results demonstrate that the proposed LTGMM method outperforms the conventional GMM for the acoustic-to-articulatory inversion mapping task with lower values of root-mean-square error and higher values of correlation coefficient.
	（英）	In this report, we present an evaluation of acoustic-to-articulatory inversion mapping based on latent trajectory Gaussian mixture model (LTGMM). In a conventional GMM-based inversion mapping system, GMM parameters are optimized by maximizing the likelihood of joint static and dynamic features of acoustic-articulatory data. In the mapping process, given the acoustic data, smoothly varying articulatory parameter trajectories are estimated by maximizing the conditional likelihood of their static features only, where the inter-frame correlation is taken into account by imposing the explicit relationship between static and dynamic features. Because training and optimization criteria are different from each other, the trained GMM is not optimum for the mapping process. A trajectory training method has been proposed to address this inconsistency problem [1]. However, this method has difficulties in optimization of some parameters, such as covariance matrices and a mixture component sequence. In this report, as another method to address the inconsistency problem, we propose an inversion mapping method based on latent trajectory GMM, inspired by the latent trjectory hidden Markov model [2]. The proposed method makes it possible to apply EM algorithm to model parameter optimization, which is difficult in the conventional trajectory training method. The experimental results demonstrate that the proposed LTGMM method outperforms the conventional GMM for the acoustic-to-articulatory inversion mapping task with lower values of root-mean-square error and higher values of correlation coefficient.
キーワード	（和）	acoustic-to-articulatory inversion mapping / Gaussian mixture model / trajectory training / inter-frame correlation / EM algorithm / / /
	（英）	acoustic-to-articulatory inversion mapping / Gaussian mixture model / trajectory training / inter-frame correlation / EM algorithm / / /
文献情報		信学技報, vol. 115, no. 523, SP2015-113, pp. 111-116, 2016年3月.
資料番号		SP2015-113
発行日		2016-03-21 (EA, SIP, SP)
ISSN		Print edition: ISSN 0913-5685 Online edition: ISSN 2432-6380
著作権について		技術研究報告に掲載された論文の著作権は電子情報通信学会に帰属します．(許諾番号：10GA0019/12GB0052/13GB0056/17GB0034/18GB0034)
PDFダウンロード		EA2015-85 SIP2015-134 SP2015-113

研究会情報
研究会	EA SP SIP
開催期間	2016-03-28 - 2016-03-29
開催地（和）	別府国際コンベンションセンター B-ConPlaza
開催地（英）	Beppu International Convention Center B-ConPlaza
テーマ（和）	応用／電気音響，音声，信号処理，一般
テーマ（英）	Engineering/Electro Acoustics, Speech, Signal Processing, and Related Topics
講演論文情報の詳細
申込み研究会	SP
会議コード	2016-03-EA-SP-SIP
本文の言語	英語
タイトル（和）
サブタイトル（和）
タイトル（英）	An evaluation of acoustic-to-articulatory inversion mapping with latent trajectory Gaussian mixture model
サブタイトル（英）
キーワード(1)（和/英）	acoustic-to-articulatory inversion mapping / acoustic-to-articulatory inversion mapping
キーワード(2)（和/英）	Gaussian mixture model / Gaussian mixture model
キーワード(3)（和/英）	trajectory training / trajectory training
キーワード(4)（和/英）	inter-frame correlation / inter-frame correlation
キーワード(5)（和/英）	EM algorithm / EM algorithm
キーワード(6)（和/英）	/
キーワード(7)（和/英）	/
キーワード(8)（和/英）	/
第1著者氏名（和/英/ヨミ）	Patrick Lumban Tobing / Patrick Lumban Tobing /
第1著者所属（和/英）	Nara Institute of Science and Technology (略称：奈良先端大) Nara Institute of Science and Technology (略称： NAIST)
第2著者氏名（和/英/ヨミ）	Tomoki Toda / Tomoki Toda /
第2著者所属（和/英）	Nagoya University/Nara Institute of Science and Technology (略称：名大/奈良先端大) Nagoya University/Nara Institute of Science and Technology (略称： Nagoya Univ./NAIST)
第3著者氏名（和/英/ヨミ）	Hirokazu Kameoka / Hirokazu Kameoka /
第3著者所属（和/英）	Nippon Telegraph and Telephone Corporation (略称： NTT) Nippon Telegraph and Telephone Corporation (略称： NTT)
第4著者氏名（和/英/ヨミ）	Satoshi Nakamura / Satoshi Nakamura /
第4著者所属（和/英）	Nara Institute of Science and Technology (略称：奈良先端大) Nara Institute of Science and Technology (略称： NAIST)
第5著者氏名（和/英/ヨミ）	/ /
第5著者所属（和/英）	(略称： ) (略称： )
第6著者氏名（和/英/ヨミ）	/ /
第6著者所属（和/英）	(略称： ) (略称： )
第7著者氏名（和/英/ヨミ）	/ /
第7著者所属（和/英）	(略称： ) (略称： )
第8著者氏名（和/英/ヨミ）	/ /
第8著者所属（和/英）	(略称： ) (略称： )
第9著者氏名（和/英/ヨミ）	/ /
第9著者所属（和/英）	(略称： ) (略称： )
第10著者氏名（和/英/ヨミ）	/ /
第10著者所属（和/英）	(略称： ) (略称： )
第11著者氏名（和/英/ヨミ）	/ /
第11著者所属（和/英）	(略称： ) (略称： )
第12著者氏名（和/英/ヨミ）	/ /
第12著者所属（和/英）	(略称： ) (略称： )
第13著者氏名（和/英/ヨミ）	/ /
第13著者所属（和/英）	(略称： ) (略称： )
第14著者氏名（和/英/ヨミ）	/ /
第14著者所属（和/英）	(略称： ) (略称： )
第15著者氏名（和/英/ヨミ）	/ /
第15著者所属（和/英）	(略称： ) (略称： )
第16著者氏名（和/英/ヨミ）	/ /
第16著者所属（和/英）	(略称： ) (略称： )
第17著者氏名（和/英/ヨミ）	/ /
第17著者所属（和/英）	(略称： ) (略称： )
第18著者氏名（和/英/ヨミ）	/ /
第18著者所属（和/英）	(略称： ) (略称： )
第19著者氏名（和/英/ヨミ）	/ /
第19著者所属（和/英）	(略称： ) (略称： )
第20著者氏名（和/英/ヨミ）	/ /
第20著者所属（和/英）	(略称： ) (略称： )
講演者	第1著者
発表日時	2016-03-28 13:15:00
発表時間	90分
申込先研究会	SP
資料番号	EA2015-85, SIP2015-134, SP2015-113
巻番号（vol）	vol.115
号番号（no）	no.521(EA), no.522(SIP), no.523(SP)
ページ範囲	pp.111-116
ページ数	6
発行日	2016-03-21 (EA, SIP, SP)

[研究会発表申込システムのトップページに戻る]

[電子情報通信学会ホームページ]

IEICE / 電子情報通信学会