Committee |
Date Time |
Place |
Paper Title / Authors |
Abstract |
Paper # |
SP |
2019-06-13 13:30 |
Kanagawa |
Tokyo Institute of Technology |
A study on style transplantation modeling techniques for DNN-based speech synthesis Yoshiki Hiruta (Tokyo Tech), Tomoki Koriyama (The Univ. of Tokyo), Yuuki Tachioka (Denso IT Lab), Takao Kobayashi (Tokyo Tech) SP2019-1 |
This paper investigates style transplantation modeling techniques for DNN-based statistical parametric speech synthesis.... [more] |
SP2019-1 pp.1-6 |
EA, SIP, SP |
2019-03-14 16:05 |
Nagasaki |
i+Land nagasaki (Nagasaki-shi) |
A Study on Speech Synthesis Based on Deep Gaussain Processes and Latent Variable Representation of Accent Tomoki Koriyama, Takao Kobayashi (Tokyo Tech) EA2018-129 SIP2018-135 SP2018-91 |
[more] |
EA2018-129 SIP2018-135 SP2018-91 pp.179-184 |
SIP, EA, SP, MI (Joint) [detail] |
2018-03-19 10:50 |
Okinawa |
|
On the Use of Deep Gaussian Processes for GPR-based Speech Synthesis Tomoki Koriyama, Takao Kobayashi (Tokyo Inst. of Tech.) EA2017-106 SIP2017-115 SP2017-89 |
This paper proposes a speech synthesis framework
based on deep Gaussian processes (DGPs).
DGP is a Bayesian deep learn... [more] |
EA2017-106 SIP2017-115 SP2017-89 pp.27-32 |
SP, ASJ-H |
2018-01-20 13:25 |
Tokyo |
The University of Tokyo |
A study on statistical speech synthesis based on GP-DNN hybrid model Tomoki Koriyama, Takao Kobayashi (Tokyo Tech) SP2017-67 |
We propose a novel approach to Gaussian process regression (GPR)-based speech synthesis
in this paper.
Since the conve... [more] |
SP2017-67 pp.5-10 |
NLC, IPSJ-NL, SP, IPSJ-SLP (Joint) [detail] |
2017-12-22 13:00 |
Tokyo |
Waseda Univ. Green Computing Systems Research Organization |
[Invited Talk]
Expressive Speech Synthesis: Approaches to Text-to-Speech with Diverse Voices and Styles Takao Kobayashi (Tokyo Tech.) SP2017-64 |
As the performance of smart devices and information systems becomes higher, more advanced speech interfaces are requeste... [more] |
SP2017-64 pp.85-86 |
SP |
2016-01-14 10:30 |
Kanagawa |
Sunpian Kawasaki |
Performance evaluation of CRF/HMM-based automatic accent labeling for speech synthesis Rina Mashiko, Tomoki Koriyama, Takao Kobayashi (Tokyo Tech) SP2015-85 |
We have proposed an accent type and phrase boundary estimation technique using acoustic and language models represented ... [more] |
SP2015-85 pp.1-6 |
SP |
2015-08-21 15:50 |
Iwate |
Iwate Prefectural Univ. |
Performance Evaluation of Large-Scale Training Sentence Set Construction Based on Entropy in Statistical Speech Synthesis Takashi Nose (Tohoku Univ.), Yusuke Arao (DNP), Takao Kobayashi (Tokyo Tech), Komei Sugiura, Yoshinori Shiga (NICT) SP2015-57 |
This paper reports the evaluation results of training sentence set construction based on entropy that we previously prop... [more] |
SP2015-57 pp.39-44 |
SP, IPSJ-MUS |
2014-05-25 11:30 |
Tokyo |
|
A Kana Protocol Recommendation Method for Switch Input Speech Synthesis Systems Fuming Fang, Takahiro Shinozaki, Takao Kobayashi (Tokyo Tech) SP2014-36 |
Switch-to-speech interface can provide a means of interactive speech communication as a support system
for people with ... [more] |
SP2014-36 pp.355-360 |
SP |
2014-01-23 16:30 |
Aichi |
Meijo Univ. |
A study on hyperparameter optimization for speech synthesis based on Gaussian process regression Tomoki Koriyama (Tokyo Inst. of Tech.), Takashi Nose (Tohoku Univ.), Takao Kobayashi (Tokyo Inst. of Tech.) SP2013-99 |
[more] |
SP2013-99 pp.19-24 |
SP, IPSJ-SLP |
2013-12-20 10:10 |
Tokyo |
|
Automatic Estimation of Accent Phrase Boundaries Using Language and Acoustic Models Hiroshi Suzuki, Tomoki Koriyama (Tokyo Tech), Takashi Nose (Tohoku Univ.), Takahiro Shinozaki, Takao Kobayashi (Tokyo Tech) SP2013-89 |
This paper proposes a technique for automatically estimating accent phrase boundaries for text-to-speech synthesis syste... [more] |
SP2013-89 pp.97-102 |
SP, IPSJ-SLP |
2013-12-20 15:25 |
Tokyo |
|
[Fellow Memorial Lecture]
Toward Speech Synthesis with Diverse Voices and Styles: Approaches and Issues Takao Kobayashi (Tokyo Tech.) SP2013-93 |
Recently, hidden Markov model-based (HMM-based) speech synthesis has been widely studied in the text-to-speech (TTS) syn... [more] |
SP2013-93 pp.119-122 |
SP |
2013-01-31 14:15 |
Kyoto |
Doshisha Univ. |
A study on speaker-normalized style conversion for arbitrary speaker's expressive speech synthesis Hiroki Kanagawa, Takashi Nose, Takao Kobayashi (Tokyo Inst. of Tech.) SP2012-110 |
This paper proposes a technique for improving naturalness of synthetic speech using a framework of speaker adaptive trai... [more] |
SP2012-110 pp.73-78 |
SP |
2013-01-31 14:45 |
Kyoto |
Doshisha Univ. |
A Study on Style Control Based on Multiple-Regression HSMM for Synthesizing Singing Voices with Various Expressivity Takashi Nose, Misa Kanemoto, Tomoki Koriyama, Takao Kobayashi (Tokyo Inst. of Tech.) SP2012-111 |
This paper proposes a style control technique based on multiple regression HSMM (MRHSMM)
for changing styles and their ... [more] |
SP2012-111 pp.79-84 |
SP |
2013-01-31 15:15 |
Kyoto |
Doshisha Univ. |
A Study on Multi-class Local Prosodic Context for Expressive Prosody Generation Yu Maeno, Takashi Nose, Takao Kobayashi, Tomoki Koriyama (Tokyo Inst. of Tech.), Yusuke Ijima, Hideharu Nakajima, Hideyuki Mizuno, Osamu Yoshioka (NTT) SP2012-112 |
This paper describes a technique for reproducing local prosodic variability which appears in expressive speech including... [more] |
SP2012-112 pp.85-90 |
SP |
2012-11-08 15:15 |
Miyagi |
Ichibancho Lobby, Tohoku Institute of Technology |
Modeling of local variance of spectral features and its application to parameter generation in HMM-based speech synthesis Takashi Nose, Vataya Chunwijitra, Takao Kobayashi (Tokyo Tech) SP2012-79 |
In this paper, we describe a technique for modeling local variance (LV)
of speech features and propose a novel paramete... [more] |
SP2012-79 pp.43-48 |
SP |
2012-06-14 11:00 |
Kanagawa |
NTT Atsugi R&D Center |
A Study on Automatic Prosodic Context Labeling for Emphatic Speech Synthesis Yu Maeno, Takashi Nose, Takao Kobayashi (Tokyo Tech), Yusuke Ijima, Hideharu Nakajima, Hideyuki Mizuno, Osamu Yoshioka (NTT) SP2012-33 |
This paper describes automatic prosodic context labeling of training data for synthesizing expressive speech in HMM-base... [more] |
SP2012-33 pp.1-6 |
SP, NLC, IPSJ-SLP [detail] |
2011-12-20 16:10 |
Tokyo |
|
On the use of prosodic-event-based HMM in F0 generation of conversational speech Tomoki Koriyama, Takashi Nose, Takao Kobayashi (Tokyo Tech) NLC2011-53 SP2011-98 |
In this paper, we propose prosodic-event-based HMM
for effectively modeling F0 pattern of spontaneous conversational sp... [more] |
NLC2011-53 SP2011-98 pp.185-190 |
SP, NLC, IPSJ-SLP [detail] |
2011-12-20 16:35 |
Tokyo |
|
A Study on Speaker Independent Style Conversion in HMM Speech Synthesis Hiroki Kanagawa, Takashi Nose, Takao Kobayashi (Tokyo Tech) NLC2011-54 SP2011-99 |
This paper proposes a technique for synthesizing speech of a desired style using speaker-independent style conversion in... [more] |
NLC2011-54 SP2011-99 pp.191-196 |
SP, NLC, IPSJ-SLP [detail] |
2011-12-20 17:00 |
Tokyo |
|
A study on modeling phone duration using dynamic features for HMM-based speech synthesis Takashi Nose, Takao Kobayashi (Tokyo Tech) NLC2011-55 SP2011-100 |
This paper proposes a technique for modeling and generating phone durations
using their dynamic features to improve pre... [more] |
NLC2011-55 SP2011-100 pp.197-202 |
EA, SIP, SP |
2011-05-13 13:00 |
Osaka |
Ritsumeikan Univ. |
Performance evaluation of contexts for conversational speech synthesis using Corpus of Spontaneous Japanese Tomoki Koriyama, Takashi Nose, Takao Kobayashi (Tokyo Tech) EA2011-27 SIP2011-27 SP2011-27 |
This paper proposes an extended context set for generating the prosodic variability of spontaneous speech in HMM-based c... [more] |
EA2011-27 SIP2011-27 SP2011-27 pp.155-160 |