Committee |
Date Time |
Place |
Paper Title / Authors |
Abstract |
Paper # |
SIP, SP, EA, IPSJ-SLP [detail] |
2024-03-01 09:30 |
Okinawa |
(Primary: On-site, Secondary: Online) |
Evaluation of Automatic Speech Recognition for Deaf and Hard-of-Hearing People by Speaker Adaptation. Kaito Takahashi, Takahiro Kinouchi, Yukoh Wakabayashi (TUT), Kengo Ohta (NITAC), Akio Kobayashi (Yamato Univ.), Norihide Kitaoka (TUT) EA2023-102 SIP2023-149 SP2023-84 |
Communication between normal-hearing people and the deaf is generally used sign language, written communication, and spe... [more] |
EA2023-102 SIP2023-149 SP2023-84 pp.244-249 |
SP, IPSJ-MUS, IPSJ-SLP [detail] |
2023-06-24 13:50 |
Tokyo |
(Primary: On-site, Secondary: Online) |
Evaluation of multi-speaker text-to-speech synthesis using a corpus for speech recognition with x-vectors for various speech styles Koki Hida (Wakayama Univ/NICT), Takuma Okamoto (NICT), Ryuichi Nisimura (Wakayama Univ), Yamato Ohtani (NICT), Tomoki Toda (Nagoya Univ/NICT), Hisashi Kawai (NICT) SP2023-25 |
We have implemented multi-speaker end-to-end text-to-speech synthesis based on JETS using x-vectors as speaker embedding... [more] |
SP2023-25 pp.125-130 |
PRMU, SP |
2018-06-29 11:00 |
Nagano |
|
Speaker adaptation in speech synthesis based on neural networks including temporal structure modeling Kento Nakao, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda (NIT) PRMU2018-31 SP2018-11 |
This paper proposes a speaker adaptation technique for speech synthesis based on deep neural networks (DNNs) using a str... [more] |
PRMU2018-31 SP2018-11 pp.53-58 |
NLC, IPSJ-NL, SP, IPSJ-SLP (Joint) [detail] |
2017-12-21 12:50 |
Tokyo |
Waseda Univ. Green Computing Systems Research Organization |
[Poster Presentation]
Development of Speaker/Environment-Dependent Acoustic Model for Non-Audible Murmur Recognition Based on DNN Adaptation Seita Noda, Tomoki Hayashi, Tomoki Toda, Kazuya Takeda (Nagoya Univ.) SP2017-56 |
In this research, we aim to improve the performance of non-audible murmur (NAM) recognition towards the development of s... [more] |
SP2017-56 pp.7-10 |
SP, SIP, EA |
2017-03-01 12:40 |
Okinawa |
Okinawa Industry Support Center |
[Poster Presentation]
An investigation of speaker adaptation method for DNN-based speech synthesis using speaker codes Nobukatsu Hojo, Yusuke Ijima (NTT) EA2016-108 SIP2016-163 SP2016-103 |
In this work, we conducted objective evaluation experiments on the conventional speaker adaptation methods for DNN-based... [more] |
EA2016-108 SIP2016-163 SP2016-103 pp.147-152 |
SP |
2016-10-27 16:25 |
Shizuoka |
Shizuoka University. |
A DNN-based Text-to-Speech Synthesis System using Speaker, Gender and Age Codes Hieu Thi Luong (VNU - HCM - University of Science), Shinji Takaki (NII), SangJin Kim (Naver Labs), Junichi Yamagishi (NII) SP2016-48 |
(To be available after the conference date) [more] |
SP2016-48 pp.37-42 |
NLC, IPSJ-NL, SP, IPSJ-SLP, JSAI-SLUD (Joint) [detail] |
2014-12-16 11:00 |
Kanagawa |
Tokyo Institute of Technology (Suzukakedai Campus) |
Speaker adaptation using speaker-normalized DNN based on speaker codes Yosuke Kashiwagi, Daisuke Saito, Nobuaki Minematsu, Keikichi Hirose (Univ. of Tokyo) SP2014-118 |
Recently, deep neural network (DNN) becomes one of the main streams of acoustic modeling for automatic speech recognitio... [more] |
SP2014-118 pp.105-110 |
SP |
2013-02-28 15:00 |
Aichi |
Daido University |
[Poster Presentation]
The effect of Speaker Adaptive Training on the naturalness of conversational speech synthesis and the controllability of paralinguistic information Shunsuke Takahashi, Hiroki Mori (Utsunomiya Univ.) SP2012-121 |
The Speaker Adaptive Training (SAT) was applied to the HMM-based dialogue speech synthesis based on the UU Database. Its... [more] |
SP2012-121 pp.25-26 |
SP |
2012-06-14 16:00 |
Kanagawa |
NTT Atsugi R&D Center |
Perceptual evaluation of synthesized speech reflecting "personalities" Minoru Tsuzaki (KCUA), Keiichi Tokuda (NITEC), Hisashi Kawai (KDDI R&D Labs), Yoshinori Shiga, Jinfu Ni (NICT), Keiichiro Oura, Sayaka Shiota (NITEC) SP2012-39 |
Perceptual evaluation tests were performed for talker selection methods in the application of the speaker adaptation fra... [more] |
SP2012-39 pp.33-38 |
SP, NLC, IPSJ-SLP [detail] |
2011-12-20 09:00 |
Tokyo |
|
Simultaneous application of speaker adaptation and noise mixture model estimation for noise suppression Masakiyo Fujimoto, Shinji Watanabe, Tomohiro Nakatani (NTT) NLC2011-46 SP2011-91 |
In this paper, we propose a joint processing method for a model-based noise suppression that simultaneously achieves spe... [more] |
NLC2011-46 SP2011-91 pp.113-118 |
WIT, SP |
2011-10-07 11:00 |
Tokyo |
TFT Bldg. |
On an analysis of speaker dependency and speaker adaptation for the recognition of back-channeling expressions in dialog between an robot and persons with dementia Ken Sadohara, Hiroaki Kojima (AIST), Takuya Narita, Misato Nihei, Minoru Kamata (Univ. Tokyo), Shin-ichi Ohnaka, Yoshihiro Fujita (NEC), Rina Ishiwata, Takenobu Inoue (NRCD) SP2011-62 WIT2011-44 |
For the purpose of establishing a dialog between a robot and persons with dementia to remind information, this paper con... [more] |
SP2011-62 WIT2011-44 pp.61-65 |
SP, NLC |
2009-12-21 10:10 |
Tokyo |
Univ. of Tokyo |
Speaker Adaptation Using Nonlinear Spectral Transformation For Speech Recognition. Toyohiro Hayashi, Yoshihiko Nankaku, Akinobu Lee, Keiichi Tokuda (Nagoya Inst. of Tech.) NLC2009-12 SP2009-76 |
This paper proposes a speaker adaptation technique using nonlinear spectral transform based on GMMs.
One of the most po... [more] |
NLC2009-12 SP2009-76 pp.1-6 |
SP |
2009-07-17 13:25 |
Fukushima |
|
Improvement of lecture speech recognition by using speaker-class models Takashi Ito, Yohei Okuyama, Masaharu Kato, Tetsuo Kosaka (Yamagata Univ.), Masaki Kohda (professor emeritus at Yamagata Univ.) SP2009-42 |
This paper describes a new method based on speaker-class (SC) models in order to improve the performance of lecture spee... [more] |
SP2009-42 pp.7-12 |
SP |
2009-06-25 14:30 |
Hokkaido |
Clark Memorial Hall, Hokkaido Univ. |
A mean F0 speaker adaptation method for regression model-based F0 contour generation Hosana Kamiyama, Takahiro Shinozaki (Tokyo Inst. of Tech.), Koji Iwano (Tokyo City Univ.), Sadaoki Furui (Tokyo Inst. of Tech.) SP2009-38 |
This paper proposes a new speaker adaptation method for the fundamental frequency ($F_0$) contour generation models base... [more] |
SP2009-38 pp.87-92 |
SP, NLC |
2008-12-09 14:10 |
Tokyo |
Waseda Univ. |
Acoustic Model Training Technique for Speech Recognition using Style Estimation with Multiple-Regression HMM Yusuke Ijima, Makoto Tachibana, Takashi Nose, Takao Kobayashi (Tokyo Tech) NLC2008-30 SP2008-85 |
We propose a technique for emotional speech recognition based on multiple-regression HMM (MRHMM). To achieve emotional s... [more] |
NLC2008-30 SP2008-85 pp.37-42 |
SP |
2008-10-23 17:30 |
Kumamoto |
Kumamoto Univ. |
An MRHSMM-based voice quality control technique for synthetic speech using speaker adaptation from average voice model Makoto Tachibana, Akifumi Kouno, Takashi Nose, Takao Kobayashi (Tokyo Tech) SP2008-63 |
This paper describes a technique for controlling voice quality of synthetic speech using multiple-regression hidden semi... [more] |
SP2008-63 pp.41-46 |
SP |
2008-07-17 - 2008-07-19 |
Iwate |
Iwate Prefectural Univ. |
Spontaneous Speech Recognition using Speaker-Class Model and Speaker Adaptation Tetsuo Kosaka (Yamagata Univ.) SP2008-47 |
This paper describes a new method based on speaker-class models in order to improve the performance of an unsupervised b... [more] |
SP2008-47 pp.25-30 |
WIT, TL, SP |
2008-01-25 17:10 |
Shimane |
Shimane Univ. |
The consideration between number of phone in training sets and recognition accuracy for speaker adaptation. Shougo Matsuura, Jin'ichi Murakami, Satoru Ikehara (Tottori Univ.) TL2007-87 SP2007-182 WIT2007-87 |
This study pay attention to number of phones in training sets. We make
a hybrid HMM which mixed speaker adaptation HMM... [more] |
TL2007-87 SP2007-182 WIT2007-87 pp.87-91 |
SP |
2007-11-28 |
Chiba |
Chiba Institute of Technology |
Speech synthesis with diverse voices and styles using average voice model Makoto Tachibana, Takao Kobayashi (Tokyo Tech) SP2007-87 |
This paper describes a technique for synthesizing speech with desired style of an arbitrary target speaker's voice in a ... [more] |
SP2007-87 pp.7-12 |
SP |
2007-10-25 - 2007-10-26 |
Nagasaki |
Nagasaki University |
A speaker adaptation technique using average voice model for MRHSMM-based style control of synthetic speech Shinsuke Izawa, Makoto Tachibana, Takashi Nose, Takao Kobayashi (Tokyo Tech) SP2007-85 |
This paper describes a technique for synthesizing speech with an arbitrary target speaker’s voice as well as desired sty... [more] |
SP2007-85 pp.81-86 |