Committee |
Date Time |
Place |
Paper Title / Authors |
Abstract |
Paper # |
EA, SIP, SP, IPSJ-SLP [detail] |
2022-03-02 10:45 |
Okinawa |
(Primary: On-site, Secondary: Online) |
Evaluation of sentence-level generation in Japanese dialect speech synthesis using accent latent variables Kazuya Yufune, Tomoki Koriyama, Shinnosuke Takamichi, Hiroshi Saruwatari (UTokyo) EA2021-79 SIP2021-106 SP2021-64 |
Japanese dialect speech synthesis is useful for personalized speech synthesis systems. However, inability to prepare acc... [more] |
EA2021-79 SIP2021-106 SP2021-64 pp.96-101 |
NLC, IPSJ-NL, SP, IPSJ-SLP [detail] |
2021-12-03 11:00 |
Online |
Online |
Multi-speaker Audiobook Speech Synthesis using Discrete Character Acting Styles Acquired by VQVAE Wataru Nakata, Tomoki Koriyama, Shinnosuke Takamichi, Yuki Saito (UT), Yusuke Ijima, Ryo Masumura (NTT), Hiroshi Saruwatari (UT) NLC2021-26 SP2021-47 |
In this paper, we propose a method of extracting discrete character acting styles using vector quantized variational aut... [more] |
NLC2021-26 SP2021-47 pp.42-47 |
SP, EA, SIP |
2020-03-02 13:00 |
Okinawa |
Okinawa Industry Support Center (Cancelled but technical report was issued) |
The Effectiveness of Additional Context in DNN-based Spontaneous Speech Synthesis Yuki Yamashita, Tomoki Koriyama, Yuki Saito, Shinnosuke Takamichi (UTokyo), Yusuke Ijima, Ryo Masumura (NTT), Hiroshi Saruwatari (UTokyo) EA2019-112 SIP2019-114 SP2019-61 |
In DNN-based speech synthesis, contexts, which are input features of DNN, can be used not only for the representation of... [more] |
EA2019-112 SIP2019-114 SP2019-61 pp.65-70 |
SP |
2020-01-29 11:30 |
Toyama |
|
Application of Deep Gaussian Process to Multi-Speaker Text-to-Speech Synthesis using Speaker Codes Kentaro Mitsui, Tomoki Koriyama, Hiroshi Saruwatari (UTokyo) SP2019-49 |
Speaker codes are widely used to achieve multi-speaker text-to-speech synthesis.
Conventionally, Deep Neural Network (D... [more] |
SP2019-49 pp.31-36 |
SP |
2019-06-13 13:30 |
Kanagawa |
Tokyo Institute of Technology |
A study on style transplantation modeling techniques for DNN-based speech synthesis Yoshiki Hiruta (Tokyo Tech), Tomoki Koriyama (The Univ. of Tokyo), Yuuki Tachioka (Denso IT Lab), Takao Kobayashi (Tokyo Tech) SP2019-1 |
This paper investigates style transplantation modeling techniques for DNN-based statistical parametric speech synthesis.... [more] |
SP2019-1 pp.1-6 |
EA, SIP, SP |
2019-03-14 16:05 |
Nagasaki |
i+Land nagasaki (Nagasaki-shi) |
A Study on Speech Synthesis Based on Deep Gaussain Processes and Latent Variable Representation of Accent Tomoki Koriyama, Takao Kobayashi (Tokyo Tech) EA2018-129 SIP2018-135 SP2018-91 |
[more] |
EA2018-129 SIP2018-135 SP2018-91 pp.179-184 |
SIP, EA, SP, MI (Joint) [detail] |
2018-03-19 10:50 |
Okinawa |
|
On the Use of Deep Gaussian Processes for GPR-based Speech Synthesis Tomoki Koriyama, Takao Kobayashi (Tokyo Inst. of Tech.) EA2017-106 SIP2017-115 SP2017-89 |
This paper proposes a speech synthesis framework
based on deep Gaussian processes (DGPs).
DGP is a Bayesian deep learn... [more] |
EA2017-106 SIP2017-115 SP2017-89 pp.27-32 |
SP, ASJ-H |
2018-01-20 13:25 |
Tokyo |
The University of Tokyo |
A study on statistical speech synthesis based on GP-DNN hybrid model Tomoki Koriyama, Takao Kobayashi (Tokyo Tech) SP2017-67 |
We propose a novel approach to Gaussian process regression (GPR)-based speech synthesis
in this paper.
Since the conve... [more] |
SP2017-67 pp.5-10 |
SP |
2016-01-14 10:30 |
Kanagawa |
Sunpian Kawasaki |
Performance evaluation of CRF/HMM-based automatic accent labeling for speech synthesis Rina Mashiko, Tomoki Koriyama, Takao Kobayashi (Tokyo Tech) SP2015-85 |
We have proposed an accent type and phrase boundary estimation technique using acoustic and language models represented ... [more] |
SP2015-85 pp.1-6 |
SP |
2014-01-23 16:30 |
Aichi |
Meijo Univ. |
A study on hyperparameter optimization for speech synthesis based on Gaussian process regression Tomoki Koriyama (Tokyo Inst. of Tech.), Takashi Nose (Tohoku Univ.), Takao Kobayashi (Tokyo Inst. of Tech.) SP2013-99 |
[more] |
SP2013-99 pp.19-24 |
SP, IPSJ-SLP |
2013-12-20 10:10 |
Tokyo |
|
Automatic Estimation of Accent Phrase Boundaries Using Language and Acoustic Models Hiroshi Suzuki, Tomoki Koriyama (Tokyo Tech), Takashi Nose (Tohoku Univ.), Takahiro Shinozaki, Takao Kobayashi (Tokyo Tech) SP2013-89 |
This paper proposes a technique for automatically estimating accent phrase boundaries for text-to-speech synthesis syste... [more] |
SP2013-89 pp.97-102 |
SP |
2013-01-31 14:45 |
Kyoto |
Doshisha Univ. |
A Study on Style Control Based on Multiple-Regression HSMM for Synthesizing Singing Voices with Various Expressivity Takashi Nose, Misa Kanemoto, Tomoki Koriyama, Takao Kobayashi (Tokyo Inst. of Tech.) SP2012-111 |
This paper proposes a style control technique based on multiple regression HSMM (MRHSMM)
for changing styles and their ... [more] |
SP2012-111 pp.79-84 |
SP |
2013-01-31 15:15 |
Kyoto |
Doshisha Univ. |
A Study on Multi-class Local Prosodic Context for Expressive Prosody Generation Yu Maeno, Takashi Nose, Takao Kobayashi, Tomoki Koriyama (Tokyo Inst. of Tech.), Yusuke Ijima, Hideharu Nakajima, Hideyuki Mizuno, Osamu Yoshioka (NTT) SP2012-112 |
This paper describes a technique for reproducing local prosodic variability which appears in expressive speech including... [more] |
SP2012-112 pp.85-90 |
SP, NLC, IPSJ-SLP [detail] |
2011-12-20 16:10 |
Tokyo |
|
On the use of prosodic-event-based HMM in F0 generation of conversational speech Tomoki Koriyama, Takashi Nose, Takao Kobayashi (Tokyo Tech) NLC2011-53 SP2011-98 |
In this paper, we propose prosodic-event-based HMM
for effectively modeling F0 pattern of spontaneous conversational sp... [more] |
NLC2011-53 SP2011-98 pp.185-190 |
EA, SIP, SP |
2011-05-13 13:00 |
Osaka |
Ritsumeikan Univ. |
Performance evaluation of contexts for conversational speech synthesis using Corpus of Spontaneous Japanese Tomoki Koriyama, Takashi Nose, Takao Kobayashi (Tokyo Tech) EA2011-27 SIP2011-27 SP2011-27 |
This paper proposes an extended context set for generating the prosodic variability of spontaneous speech in HMM-based c... [more] |
EA2011-27 SIP2011-27 SP2011-27 pp.155-160 |
PRMU, SP, MVE, CQ |
2010-01-21 11:10 |
Kyoto |
Kyoto Univ. |
A study on Conversational Speech Synthesis Based on Average Voice Model Tomoki Koriyama, Takashi Nose, Takao Kobayashi (Tokyo Inst. of Tech.) CQ2009-61 PRMU2009-160 SP2009-101 MVE2009-83 |
[more] |
CQ2009-61 PRMU2009-160 SP2009-101 MVE2009-83 pp.33-38 |