ken-system: - All Technical Committee Conferences

IEICE Technical Committee Submission System
Conference Schedule

Online Proceedings
[Sign in]
Tech. Rep. Archives

[Japanese] / [English]

(

Committee/Place/Topics

) --Press->

(

Paper Keywords: / Column:Title Auth. Affi. Abst. Keyword

) --Press->

All Technical Committee Conferences (Searched in: All Years)

Search Results: Conference Papers

Conference Papers (Available on Advance Programs) (Sort by: Date Descending)

Committee	Date Time	Place		Paper Title / Authors	Abstract	Paper #
EA, SIP, SP, IPSJ-SLP [detail]	2022-03-02 10:45	Okinawa	(Primary: On-site, Secondary: Online)	Evaluation of sentence-level generation in Japanese dialect speech synthesis using accent latent variables Kazuya Yufune, Tomoki Koriyama, Shinnosuke Takamichi, Hiroshi Saruwatari (UTokyo) EA2021-79 SIP2021-106 SP2021-64	Japanese dialect speech synthesis is useful for personalized speech synthesis systems. However, inability to prepare acc... [more]	EA2021-79 SIP2021-106 SP2021-64 pp.96-101
NLC, IPSJ-NL, SP, IPSJ-SLP [detail]	2021-12-03 11:00	Online	Online	Multi-speaker Audiobook Speech Synthesis using Discrete Character Acting Styles Acquired by VQVAE Wataru Nakata, Tomoki Koriyama, Shinnosuke Takamichi, Yuki Saito (UT), Yusuke Ijima, Ryo Masumura (NTT), Hiroshi Saruwatari (UT) NLC2021-26 SP2021-47	In this paper, we propose a method of extracting discrete character acting styles using vector quantized variational aut... [more]	NLC2021-26 SP2021-47 pp.42-47
SP, EA, SIP	2020-03-02 13:00	Okinawa	Okinawa Industry Support Center (Cancelled but technical report was issued)	The Effectiveness of Additional Context in DNN-based Spontaneous Speech Synthesis Yuki Yamashita, Tomoki Koriyama, Yuki Saito, Shinnosuke Takamichi (UTokyo), Yusuke Ijima, Ryo Masumura (NTT), Hiroshi Saruwatari (UTokyo) EA2019-112 SIP2019-114 SP2019-61	In DNN-based speech synthesis, contexts, which are input features of DNN, can be used not only for the representation of... [more]	EA2019-112 SIP2019-114 SP2019-61 pp.65-70
SP	2020-01-29 11:30	Toyama		Application of Deep Gaussian Process to Multi-Speaker Text-to-Speech Synthesis using Speaker Codes Kentaro Mitsui, Tomoki Koriyama, Hiroshi Saruwatari (UTokyo) SP2019-49	Speaker codes are widely used to achieve multi-speaker text-to-speech synthesis. Conventionally, Deep Neural Network (D... [more]	SP2019-49 pp.31-36
SP	2019-06-13 13:30	Kanagawa	Tokyo Institute of Technology	A study on style transplantation modeling techniques for DNN-based speech synthesis Yoshiki Hiruta (Tokyo Tech), Tomoki Koriyama (The Univ. of Tokyo), Yuuki Tachioka (Denso IT Lab), Takao Kobayashi (Tokyo Tech) SP2019-1	This paper investigates style transplantation modeling techniques for DNN-based statistical parametric speech synthesis.... [more]	SP2019-1 pp.1-6
EA, SIP, SP	2019-03-14 16:05	Nagasaki	i+Land nagasaki (Nagasaki-shi)	A Study on Speech Synthesis Based on Deep Gaussain Processes and Latent Variable Representation of Accent Tomoki Koriyama, Takao Kobayashi (Tokyo Tech) EA2018-129 SIP2018-135 SP2018-91	[more]	EA2018-129 SIP2018-135 SP2018-91 pp.179-184
SIP, EA, SP, MI (Joint) [detail]	2018-03-19 10:50	Okinawa		On the Use of Deep Gaussian Processes for GPR-based Speech Synthesis Tomoki Koriyama, Takao Kobayashi (Tokyo Inst. of Tech.) EA2017-106 SIP2017-115 SP2017-89	This paper proposes a speech synthesis framework based on deep Gaussian processes (DGPs). DGP is a Bayesian deep learn... [more]	EA2017-106 SIP2017-115 SP2017-89 pp.27-32
SP, ASJ-H	2018-01-20 13:25	Tokyo	The University of Tokyo	A study on statistical speech synthesis based on GP-DNN hybrid model Tomoki Koriyama, Takao Kobayashi (Tokyo Tech) SP2017-67	We propose a novel approach to Gaussian process regression (GPR)-based speech synthesis in this paper. Since the conve... [more]	SP2017-67 pp.5-10
SP	2016-01-14 10:30	Kanagawa	Sunpian Kawasaki	Performance evaluation of CRF/HMM-based automatic accent labeling for speech synthesis Rina Mashiko, Tomoki Koriyama, Takao Kobayashi (Tokyo Tech) SP2015-85	We have proposed an accent type and phrase boundary estimation technique using acoustic and language models represented ... [more]	SP2015-85 pp.1-6
SP	2014-01-23 16:30	Aichi	Meijo Univ.	A study on hyperparameter optimization for speech synthesis based on Gaussian process regression Tomoki Koriyama (Tokyo Inst. of Tech.), Takashi Nose (Tohoku Univ.), Takao Kobayashi (Tokyo Inst. of Tech.) SP2013-99	[more]	SP2013-99 pp.19-24
SP, IPSJ-SLP	2013-12-20 10:10	Tokyo		Automatic Estimation of Accent Phrase Boundaries Using Language and Acoustic Models Hiroshi Suzuki, Tomoki Koriyama (Tokyo Tech), Takashi Nose (Tohoku Univ.), Takahiro Shinozaki, Takao Kobayashi (Tokyo Tech) SP2013-89	This paper proposes a technique for automatically estimating accent phrase boundaries for text-to-speech synthesis syste... [more]	SP2013-89 pp.97-102
SP	2013-01-31 14:45	Kyoto	Doshisha Univ.	A Study on Style Control Based on Multiple-Regression HSMM for Synthesizing Singing Voices with Various Expressivity Takashi Nose, Misa Kanemoto, Tomoki Koriyama, Takao Kobayashi (Tokyo Inst. of Tech.) SP2012-111	This paper proposes a style control technique based on multiple regression HSMM (MRHSMM) for changing styles and their ... [more]	SP2012-111 pp.79-84
SP	2013-01-31 15:15	Kyoto	Doshisha Univ.	A Study on Multi-class Local Prosodic Context for Expressive Prosody Generation Yu Maeno, Takashi Nose, Takao Kobayashi, Tomoki Koriyama (Tokyo Inst. of Tech.), Yusuke Ijima, Hideharu Nakajima, Hideyuki Mizuno, Osamu Yoshioka (NTT) SP2012-112	This paper describes a technique for reproducing local prosodic variability which appears in expressive speech including... [more]	SP2012-112 pp.85-90
SP, NLC, IPSJ-SLP [detail]	2011-12-20 16:10	Tokyo		On the use of prosodic-event-based HMM in F0 generation of conversational speech Tomoki Koriyama, Takashi Nose, Takao Kobayashi (Tokyo Tech) NLC2011-53 SP2011-98	In this paper, we propose prosodic-event-based HMM for effectively modeling F0 pattern of spontaneous conversational sp... [more]	NLC2011-53 SP2011-98 pp.185-190
EA, SIP, SP	2011-05-13 13:00	Osaka	Ritsumeikan Univ.	Performance evaluation of contexts for conversational speech synthesis using Corpus of Spontaneous Japanese Tomoki Koriyama, Takashi Nose, Takao Kobayashi (Tokyo Tech) EA2011-27 SIP2011-27 SP2011-27	This paper proposes an extended context set for generating the prosodic variability of spontaneous speech in HMM-based c... [more]	EA2011-27 SIP2011-27 SP2011-27 pp.155-160
PRMU, SP, MVE, CQ	2010-01-21 11:10	Kyoto	Kyoto Univ.	A study on Conversational Speech Synthesis Based on Average Voice Model Tomoki Koriyama, Takashi Nose, Takao Kobayashi (Tokyo Inst. of Tech.) CQ2009-61 PRMU2009-160 SP2009-101 MVE2009-83	[more]	CQ2009-61 PRMU2009-160 SP2009-101 MVE2009-83 pp.33-38

Copyright and reproduction : All rights are reserved and no part of this publication may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopy, recording, or any information storage and retrieval system, without permission in writing from the publisher. Notwithstanding, instructors are permitted to photocopy isolated articles for noncommercial classroom use without fee. (License No.: 10GA0019/12GB0052/13GB0056/17GB0034/18GB0034)

[Return to Top Page]

[Return to IEICE Web Page]

The Institute of Electronics, Information and Communication Engineers (IEICE), Japan