|
|
All Technical Committee Conferences (Searched in: All Years)
|
|
Search Results: Conference Papers |
Conference Papers (Available on Advance Programs) (Sort by: Date Descending) |
|
Committee |
Date Time |
Place |
Paper Title / Authors |
Abstract |
Paper # |
SP, IPSJ-MUS, IPSJ-SLP [detail] |
2023-06-23 13:50 |
Tokyo |
(Primary: On-site, Secondary: Online) |
[Poster Presentation]
MS-Harmonic-Net++ vs SiFi-GAN: Comparison of fundamental frequency controllable fast neural waveform generative models. Sota Shimizu (Kobe Univ./NICT), Takuma Okamoto (NICT), Ryoichi Takashima (Kobe Univ.), Yamato Ohtani (NICT), Tetsuya Takiguchi (Kobe Univ.), Tomoki Toda (Nagoya Univ./NICT), Hisashi Kawai (NICT) SP2023-5 |
Although Harmonic-Net+ has been proposed as a fundamental frequency (fo) and speech rate (SR) controllable fast neural v... [more] |
SP2023-5 pp.20-25 |
SP, IPSJ-MUS, IPSJ-SLP [detail] |
2023-06-24 13:50 |
Tokyo |
(Primary: On-site, Secondary: Online) |
Fast Neural Waveform Generation Model With Fully Connected Upsampling Haruki Yamashita (Kobe cniv/NICT), Takuma Okamoto (NICT), Ryoichi Takashima (Kobe Univ), Yamato Ohtani (NICT), Tetsuya Takiguchi (Kobe Univ), Tomoki Toda (Nagoya Univ/NICT), Hisashi Kawai (NICT) SP2023-15 |
In recent years, in text-to-speech synthesis, it is required to improve the inference speed while keeping the quality.
... [more] |
SP2023-15 pp.73-78 |
SP, IPSJ-MUS, IPSJ-SLP [detail] |
2023-06-24 13:50 |
Tokyo |
(Primary: On-site, Secondary: Online) |
Evaluation of multi-speaker text-to-speech synthesis using a corpus for speech recognition with x-vectors for various speech styles Koki Hida (Wakayama Univ/NICT), Takuma Okamoto (NICT), Ryuichi Nisimura (Wakayama Univ), Yamato Ohtani (NICT), Tomoki Toda (Nagoya Univ/NICT), Hisashi Kawai (NICT) SP2023-25 |
We have implemented multi-speaker end-to-end text-to-speech synthesis based on JETS using x-vectors as speaker embedding... [more] |
SP2023-25 pp.125-130 |
SP |
2014-11-13 14:35 |
Fukuoka |
Kyushu Univ. Chikushi Campus |
Shared emotion additive model for HMM-based emotional speech synthesis Yamato Ohtani, Yu Nasu, Ryo Morinaka, Masatsune Tamura, Masahiro Morita, Masami Akamine (Toshiba) SP2014-92 |
We have proposed an emotion addition method based on additive structure in HMM-based speech synthesis.
This technique ... [more] |
SP2014-92 pp.13-18 |
SP, IPSJ-MUS |
2014-05-25 11:30 |
Tokyo |
|
Statistical bandwidth extension using sub-band basis spectrum model Yamato Ohtani, Masatsune Tamura, Masahiro Morita, Masami Akamine (Toshiba) SP2014-27 |
This paper describes a novel statistical bandwidth extension (BWE) method based on a Gaussian mixture model (GMM) and a ... [more] |
SP2014-27 pp.303-308 |
SP |
2009-01-30 15:05 |
Nara |
NAIST |
Many-to-many eigenvoice conversion algorithms with a reference speaker Yamato Ohtani, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano (Nara Inst. of Scie and Tech.) SP2008-140 |
In this paper, we propose many-to-many voice conversion (VC) technique to convert an arbitrary source speaker's voice in... [more] |
SP2008-140 pp.85-90 |
SP |
2009-01-30 15:30 |
Nara |
NAIST |
Low-delay voice conversion algorithm based on maximum likelihood estimation of spectral parameter trajectory Takashi Muramatsu, Yamato Ohtani, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano (Nara Inst. of Scie and Tech.) SP2008-141 |
In this paper, we aim to achieve high-quality and real-time VC considering spectral conversion method and post-processin... [more] |
SP2008-141 pp.91-96 |
SP |
2007-10-25 - 2007-10-26 |
Nagasaki |
Nagasaki University |
Many-to-One Voice Conversion Algorithms with Pre-Stored Speaker Data Sets Daisuke Tani, Yamato Ohtani, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano (NAIST) SP2007-81 |
This paper describes an evaluation of four many-to-one voice conversion (VC) algorithms converting an arbitrary speaker'... [more] |
SP2007-81 pp.61-66 |
SP |
2007-10-25 - 2007-10-26 |
Nagasaki |
Nagasaki University |
Evaluation of Voice Quality Control Based on One-to-Many Eigenvoice Conversion Kumi Ohta, Yamato Ohtani, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano (NAIST) SP2007-82 |
This paper proposes techniques for flexibly controlling voice quality of converted speech from a particular source speak... [more] |
SP2007-82 pp.67-72 |
|
|
|
Copyright and reproduction :
All rights are reserved and no part of this publication may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopy, recording, or any information storage and retrieval system, without permission in writing from the publisher. Notwithstanding, instructors are permitted to photocopy isolated articles for noncommercial classroom use without fee. (License No.: 10GA0019/12GB0052/13GB0056/17GB0034/18GB0034)
|
[Return to Top Page]
[Return to IEICE Web Page]
|