ken-system: - All Technical Committee Conferences

IEICE Technical Committee Submission System
Conference Schedule

Online Proceedings
[Sign in]
Tech. Rep. Archives

[Japanese] / [English]

(

Committee/Place/Topics

) --Press->

(

Paper Keywords: / Column:Title Auth. Affi. Abst. Keyword

) --Press->

All Technical Committee Conferences (Searched in: All Years)

Search Results: Conference Papers

Conference Papers (Available on Advance Programs) (Sort by: Date Descending)

Committee	Date Time	Place		Paper Title / Authors	Abstract	Paper #
SIP, SP, EA, IPSJ-SLP [detail]	2024-03-01 09:30	Okinawa	(Primary: On-site, Secondary: Online)	Evaluation of Automatic Speech Recognition for Deaf and Hard-of-Hearing People by Speaker Adaptation. Kaito Takahashi, Takahiro Kinouchi, Yukoh Wakabayashi (TUT), Kengo Ohta (NITAC), Akio Kobayashi (Yamato Univ.), Norihide Kitaoka (TUT) EA2023-102 SIP2023-149 SP2023-84	Communication between normal-hearing people and the deaf is generally used sign language, written communication, and spe... [more]	EA2023-102 SIP2023-149 SP2023-84 pp.244-249
SP, IPSJ-MUS, IPSJ-SLP [detail]	2023-06-24 13:50	Tokyo	(Primary: On-site, Secondary: Online)	Evaluation of multi-speaker text-to-speech synthesis using a corpus for speech recognition with x-vectors for various speech styles Koki Hida (Wakayama Univ/NICT), Takuma Okamoto (NICT), Ryuichi Nisimura (Wakayama Univ), Yamato Ohtani (NICT), Tomoki Toda (Nagoya Univ/NICT), Hisashi Kawai (NICT) SP2023-25	We have implemented multi-speaker end-to-end text-to-speech synthesis based on JETS using x-vectors as speaker embedding... [more]	SP2023-25 pp.125-130
PRMU, SP	2018-06-29 11:00	Nagano		Speaker adaptation in speech synthesis based on neural networks including temporal structure modeling Kento Nakao, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda (NIT) PRMU2018-31 SP2018-11	This paper proposes a speaker adaptation technique for speech synthesis based on deep neural networks (DNNs) using a str... [more]	PRMU2018-31 SP2018-11 pp.53-58
NLC, IPSJ-NL, SP, IPSJ-SLP (Joint) [detail]	2017-12-21 12:50	Tokyo	Waseda Univ. Green Computing Systems Research Organization	[Poster Presentation] Development of Speaker/Environment-Dependent Acoustic Model for Non-Audible Murmur Recognition Based on DNN Adaptation Seita Noda, Tomoki Hayashi, Tomoki Toda, Kazuya Takeda (Nagoya Univ.) SP2017-56	In this research, we aim to improve the performance of non-audible murmur (NAM) recognition towards the development of s... [more]	SP2017-56 pp.7-10
SP, SIP, EA	2017-03-01 12:40	Okinawa	Okinawa Industry Support Center	[Poster Presentation] An investigation of speaker adaptation method for DNN-based speech synthesis using speaker codes Nobukatsu Hojo, Yusuke Ijima (NTT) EA2016-108 SIP2016-163 SP2016-103	In this work, we conducted objective evaluation experiments on the conventional speaker adaptation methods for DNN-based... [more]	EA2016-108 SIP2016-163 SP2016-103 pp.147-152
SP	2016-10-27 16:25	Shizuoka	Shizuoka University.	A DNN-based Text-to-Speech Synthesis System using Speaker, Gender and Age Codes Hieu Thi Luong (VNU - HCM - University of Science), Shinji Takaki (NII), SangJin Kim (Naver Labs), Junichi Yamagishi (NII) SP2016-48	(To be available after the conference date) [more]	SP2016-48 pp.37-42
NLC, IPSJ-NL, SP, IPSJ-SLP, JSAI-SLUD (Joint) [detail]	2014-12-16 11:00	Kanagawa	Tokyo Institute of Technology (Suzukakedai Campus)	Speaker adaptation using speaker-normalized DNN based on speaker codes Yosuke Kashiwagi, Daisuke Saito, Nobuaki Minematsu, Keikichi Hirose (Univ. of Tokyo) SP2014-118	Recently, deep neural network (DNN) becomes one of the main streams of acoustic modeling for automatic speech recognitio... [more]	SP2014-118 pp.105-110
SP	2013-02-28 15:00	Aichi	Daido University	[Poster Presentation] The effect of Speaker Adaptive Training on the naturalness of conversational speech synthesis and the controllability of paralinguistic information Shunsuke Takahashi, Hiroki Mori (Utsunomiya Univ.) SP2012-121	The Speaker Adaptive Training (SAT) was applied to the HMM-based dialogue speech synthesis based on the UU Database. Its... [more]	SP2012-121 pp.25-26
SP	2012-06-14 16:00	Kanagawa	NTT Atsugi R&D Center	Perceptual evaluation of synthesized speech reflecting "personalities" Minoru Tsuzaki (KCUA), Keiichi Tokuda (NITEC), Hisashi Kawai (KDDI R&D Labs), Yoshinori Shiga, Jinfu Ni (NICT), Keiichiro Oura, Sayaka Shiota (NITEC) SP2012-39	Perceptual evaluation tests were performed for talker selection methods in the application of the speaker adaptation fra... [more]	SP2012-39 pp.33-38
SP, NLC, IPSJ-SLP [detail]	2011-12-20 09:00	Tokyo		Simultaneous application of speaker adaptation and noise mixture model estimation for noise suppression Masakiyo Fujimoto, Shinji Watanabe, Tomohiro Nakatani (NTT) NLC2011-46 SP2011-91	In this paper, we propose a joint processing method for a model-based noise suppression that simultaneously achieves spe... [more]	NLC2011-46 SP2011-91 pp.113-118
WIT, SP	2011-10-07 11:00	Tokyo	TFT Bldg.	On an analysis of speaker dependency and speaker adaptation for the recognition of back-channeling expressions in dialog between an robot and persons with dementia Ken Sadohara, Hiroaki Kojima (AIST), Takuya Narita, Misato Nihei, Minoru Kamata (Univ. Tokyo), Shin-ichi Ohnaka, Yoshihiro Fujita (NEC), Rina Ishiwata, Takenobu Inoue (NRCD) SP2011-62 WIT2011-44	For the purpose of establishing a dialog between a robot and persons with dementia to remind information, this paper con... [more]	SP2011-62 WIT2011-44 pp.61-65
SP, NLC	2009-12-21 10:10	Tokyo	Univ. of Tokyo	Speaker Adaptation Using Nonlinear Spectral Transformation For Speech Recognition. Toyohiro Hayashi, Yoshihiko Nankaku, Akinobu Lee, Keiichi Tokuda (Nagoya Inst. of Tech.) NLC2009-12 SP2009-76	This paper proposes a speaker adaptation technique using nonlinear spectral transform based on GMMs. One of the most po... [more]	NLC2009-12 SP2009-76 pp.1-6
SP	2009-07-17 13:25	Fukushima		Improvement of lecture speech recognition by using speaker-class models Takashi Ito, Yohei Okuyama, Masaharu Kato, Tetsuo Kosaka (Yamagata Univ.), Masaki Kohda (professor emeritus at Yamagata Univ.) SP2009-42	This paper describes a new method based on speaker-class (SC) models in order to improve the performance of lecture spee... [more]	SP2009-42 pp.7-12
SP	2009-06-25 14:30	Hokkaido	Clark Memorial Hall, Hokkaido Univ.	A mean F0 speaker adaptation method for regression model-based F0 contour generation Hosana Kamiyama, Takahiro Shinozaki (Tokyo Inst. of Tech.), Koji Iwano (Tokyo City Univ.), Sadaoki Furui (Tokyo Inst. of Tech.) SP2009-38	This paper proposes a new speaker adaptation method for the fundamental frequency ($F_0$) contour generation models base... [more]	SP2009-38 pp.87-92
SP, NLC	2008-12-09 14:10	Tokyo	Waseda Univ.	Acoustic Model Training Technique for Speech Recognition using Style Estimation with Multiple-Regression HMM Yusuke Ijima, Makoto Tachibana, Takashi Nose, Takao Kobayashi (Tokyo Tech) NLC2008-30 SP2008-85	We propose a technique for emotional speech recognition based on multiple-regression HMM (MRHMM). To achieve emotional s... [more]	NLC2008-30 SP2008-85 pp.37-42
SP	2008-10-23 17:30	Kumamoto	Kumamoto Univ.	An MRHSMM-based voice quality control technique for synthetic speech using speaker adaptation from average voice model Makoto Tachibana, Akifumi Kouno, Takashi Nose, Takao Kobayashi (Tokyo Tech) SP2008-63	This paper describes a technique for controlling voice quality of synthetic speech using multiple-regression hidden semi... [more]	SP2008-63 pp.41-46
SP	2008-07-17 - 2008-07-19	Iwate	Iwate Prefectural Univ.	Spontaneous Speech Recognition using Speaker-Class Model and Speaker Adaptation Tetsuo Kosaka (Yamagata Univ.) SP2008-47	This paper describes a new method based on speaker-class models in order to improve the performance of an unsupervised b... [more]	SP2008-47 pp.25-30
WIT, TL, SP	2008-01-25 17:10	Shimane	Shimane Univ.	The consideration between number of phone in training sets and recognition accuracy for speaker adaptation. Shougo Matsuura, Jin'ichi Murakami, Satoru Ikehara (Tottori Univ.) TL2007-87 SP2007-182 WIT2007-87	This study pay attention to number of phones in training sets. We make a hybrid HMM which mixed speaker adaptation HMM... [more]	TL2007-87 SP2007-182 WIT2007-87 pp.87-91
SP	2007-11-28	Chiba	Chiba Institute of Technology	Speech synthesis with diverse voices and styles using average voice model Makoto Tachibana, Takao Kobayashi (Tokyo Tech) SP2007-87	This paper describes a technique for synthesizing speech with desired style of an arbitrary target speaker's voice in a ... [more]	SP2007-87 pp.7-12
SP	2007-10-25 - 2007-10-26	Nagasaki	Nagasaki University	A speaker adaptation technique using average voice model for MRHSMM-based style control of synthetic speech Shinsuke Izawa, Makoto Tachibana, Takashi Nose, Takao Kobayashi (Tokyo Tech) SP2007-85	This paper describes a technique for synthesizing speech with an arbitrary target speaker’s voice as well as desired sty... [more]	SP2007-85 pp.81-86

Copyright and reproduction : All rights are reserved and no part of this publication may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopy, recording, or any information storage and retrieval system, without permission in writing from the publisher. Notwithstanding, instructors are permitted to photocopy isolated articles for noncommercial classroom use without fee. (License No.: 10GA0019/12GB0052/13GB0056/17GB0034/18GB0034)

[Return to Top Page]

[Return to IEICE Web Page]

The Institute of Electronics, Information and Communication Engineers (IEICE), Japan