Committee |
Date Time |
Place |
Paper Title / Authors |
Abstract |
Paper # |
WIT, SP, IPSJ-SLP [detail] |
2020-10-22 13:00 |
Online |
Online |
[Invited Talk]
NHK's activities on Japanese end-to-end speech synthesis Kiyoshi Kurihara (NHK) SP2020-11 WIT2020-12 |
The main business of NHK (Japan Broadcasting Corporation) is the production and broadcasting of programs. Many programs ... [more] |
SP2020-11 WIT2020-12 pp.19-20 |
SP |
2020-01-29 11:30 |
Toyama |
|
Application of Deep Gaussian Process to Multi-Speaker Text-to-Speech Synthesis using Speaker Codes Kentaro Mitsui, Tomoki Koriyama, Hiroshi Saruwatari (UTokyo) SP2019-49 |
Speaker codes are widely used to achieve multi-speaker text-to-speech synthesis.
Conventionally, Deep Neural Network (D... [more] |
SP2019-49 pp.31-36 |
NLC, IPSJ-NL, SP, IPSJ-SLP (Joint) [detail] |
2019-12-06 13:55 |
Tokyo |
NHK Science & Technology Research Labs. |
[Poster Presentation]
Effectiveness of sequence-to-sequence acoustic modeling by using automatic generated labels Kiyoshi Kurihara, Nobumasa Seiyama, Tadashi Kumano (NHK) SP2019-37 |
We have proposed a method that uses yomigana (Japanese character readings) and prosodic symbols as input for sequence-to... [more] |
SP2019-37 pp.49-54 |
SP |
2019-01-27 10:40 |
Ishikawa |
Kanazawa-Harmonie |
Evaluation of end-to-end speech synthesis method using speaking styles Kiyoshi Kurihara, Nobumasa Seiyama, Tadashi Kumano, Atsushi Imai (NHK) SP2018-58 |
The purpose of this study was to conduct end-to-end text-to-speech synthesis in Japanese; we developed a system that use... [more] |
SP2018-58 pp.29-34 |
NLC, IPSJ-NL, SP, IPSJ-SLP (Joint) [detail] |
2018-12-10 16:30 |
Tokyo |
Waseda Univ. Nishiwaseda Campus |
Evaluation of Japanese end-to-end speech synthesis method inputting kana and prosodic symbols Kiyoshi Kurihara, Nobumasa Seiyama, Tadashi Kumano, Atsushi Imai (NHK) SP2018-49 |
The purpose of this study was to conduct end-to-end text-to-speech synthesis in Japanese; we developed a system that use... [more] |
SP2018-49 pp.89-94 |
PRMU, SP |
2018-06-29 11:00 |
Nagano |
|
Speaker adaptation in speech synthesis based on neural networks including temporal structure modeling Kento Nakao, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda (NIT) PRMU2018-31 SP2018-11 |
This paper proposes a speaker adaptation technique for speech synthesis based on deep neural networks (DNNs) using a str... [more] |
PRMU2018-31 SP2018-11 pp.53-58 |
SIP, EA, SP, MI (Joint) [detail] |
2018-03-19 10:50 |
Okinawa |
|
On the Use of Deep Gaussian Processes for GPR-based Speech Synthesis Tomoki Koriyama, Takao Kobayashi (Tokyo Inst. of Tech.) EA2017-106 SIP2017-115 SP2017-89 |
This paper proposes a speech synthesis framework
based on deep Gaussian processes (DGPs).
DGP is a Bayesian deep learn... [more] |
EA2017-106 SIP2017-115 SP2017-89 pp.27-32 |
SP, ASJ-H |
2018-01-20 13:25 |
Tokyo |
The University of Tokyo |
A study on statistical speech synthesis based on GP-DNN hybrid model Tomoki Koriyama, Takao Kobayashi (Tokyo Tech) SP2017-67 |
We propose a novel approach to Gaussian process regression (GPR)-based speech synthesis
in this paper.
Since the conve... [more] |
SP2017-67 pp.5-10 |
SP, ASJ-H |
2018-01-20 14:55 |
Tokyo |
The University of Tokyo |
[Poster Presentation]
TRAJECTORY TRAINING CONSIDERING POWER FOR SPEECH SYNTHESIS BASED ON NEURAL NETWORKS Ryohei Funato, Kei Hashimoto, keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda (Nagoya Inst. of Tech.) SP2017-74 |
In statistical parametric speech synthesis, a relation between acoustic features and linguistic features is modeled by s... [more] |
SP2017-74 pp.43-48 |
NLC, IPSJ-NL, SP, IPSJ-SLP (Joint) [detail] |
2017-12-22 13:00 |
Tokyo |
Waseda Univ. Green Computing Systems Research Organization |
[Invited Talk]
Expressive Speech Synthesis: Approaches to Text-to-Speech with Diverse Voices and Styles Takao Kobayashi (Tokyo Tech.) SP2017-64 |
As the performance of smart devices and information systems becomes higher, more advanced speech interfaces are requeste... [more] |
SP2017-64 pp.85-86 |
SP, SIP, EA |
2017-03-01 12:40 |
Okinawa |
Okinawa Industry Support Center |
[Poster Presentation]
Prosodic Word Embeddings for DNN-based speech synthesis Yusuke Ijima, Nobukatsu Hojo, Ryo Masumura, Taichi Asami (NTT) EA2016-109 SIP2016-164 SP2016-104 |
This paper proposed a novel word embeddings with prosodic information (prosodic word embeddings) for DNN-based speech sy... [more] |
EA2016-109 SIP2016-164 SP2016-104 pp.153-158 |
SP |
2017-01-21 11:00 |
Tokyo |
The University of Tokyo |
[Poster Presentation]
Designing linguistic features for expressive speech synthesis using audiobooks Chiaki Asai, Kei Sawada, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda (Nagoya Inst. of Tech.) SP2016-70 |
In order to synthesize expressive speech, various statistical parametric speech synthesis systems have been proposed. Sp... [more] |
SP2016-70 pp.35-40 |
SP, IPSJ-SLP, NLC, IPSJ-NL (Joint) [detail] |
2016-12-20 16:40 |
Tokyo |
NTT Musashino R&D |
Generative Adversarial Network-based Postfiltering for Statistical Parametric Speech Synthesis Takuhiro Kaneko, Hirokazu Kameoka, Nobukatsu Hojo, Yusuke Ijima, Kaoru Hiramatsu, Kunio Kashino (NTT) SP2016-61 |
In the field of speech synthesis, statistical parametric speech synthesis has been widely used due to the flexibility an... [more] |
SP2016-61 pp.89-94 |
SP |
2016-01-14 15:10 |
Kanagawa |
Sunpian Kawasaki |
Objective evaluation of synthetic speech using association between dimensions within spectral features Yusuke Ijima, Taichi Asami (NTT), Hideyuki Mizuno (TUSS) SP2015-90 |
This paper proposes a novel objective evaluation technique for statistical parametric speech synthesis. A novel point of... [more] |
SP2015-90 pp.27-32 |
NLC, IPSJ-NL, SP, IPSJ-SLP (Joint) [detail] |
2015-12-03 09:00 |
Aichi |
Nagoya Inst of Tech. |
Evaluation of text-to-speech system construction for unknown-pronunciation languages Kei Sawada, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda (Nagoya Inst. of Tech.) SP2015-80 |
This paper discusses a method to construction of text-to-speech (TTS) systems for unknown-pronunciation languages. There... [more] |
SP2015-80 pp.93-98 |
NLC, IPSJ-NL, SP, IPSJ-SLP (Joint) [detail] |
2015-12-03 09:25 |
Aichi |
Nagoya Inst of Tech. |
Deep Auto-encoder based Low-dimensional Feature Extraction using FFT Spectral Envelopes in Statistical Parametric Speech Synthesis Shinji Takaki, Junichi Yamagishi (NII) SP2015-81 |
In the state-of-the-art statistical parametric speech synthesis system, a speech analysis module, e.g. STRAIGHT spectral... [more] |
SP2015-81 pp.99-104 |
SP, IPSJ-SLP (Joint) |
2015-07-17 09:30 |
Nagano |
Katakura Suwako Hotel |
Multiple Feed-forward Deep Neural Networks for Statistical Parametric Speech Synthesis Shinji Takaki (NII), SangJin Kim (Naver Labs), Junichi Yamagishi (NII), JongJin Kim (Naver Labs) SP2015-44 |
In this paper, we investigate a combination of several feed-forward deep neural networks (DNNs) for a high-quality stati... [more] |
SP2015-44 pp.49-54 |
NLC, IPSJ-NL, SP, IPSJ-SLP, JSAI-SLUD (Joint) [detail] |
2014-12-15 14:00 |
Kanagawa |
Tokyo Institute of Technology (Suzukakedai Campus) |
[Invited Talk]
Statistical approach to flexible speech synthesis
-- towards human-like talking machines -- Keiichi Tokuda (NITech/Google) SP2014-109 |
This talk will give an overview of statistical approach to
flexible speech synthesis. For constructing human-like
tal... [more] |
SP2014-109 p.31 |