Committee |
Date Time |
Place |
Paper Title / Authors |
Abstract |
Paper # |
SP, IPSJ-MUS, IPSJ-SLP [detail] |
2023-06-23 13:50 |
Tokyo |
(Primary: On-site, Secondary: Online) |
[Poster Presentation]
MS-Harmonic-Net++ vs SiFi-GAN: Comparison of fundamental frequency controllable fast neural waveform generative models. Sota Shimizu (Kobe Univ./NICT), Takuma Okamoto (NICT), Ryoichi Takashima (Kobe Univ.), Yamato Ohtani (NICT), Tetsuya Takiguchi (Kobe Univ.), Tomoki Toda (Nagoya Univ./NICT), Hisashi Kawai (NICT) SP2023-5 |
Although Harmonic-Net+ has been proposed as a fundamental frequency (fo) and speech rate (SR) controllable fast neural v... [more] |
SP2023-5 pp.20-25 |
SP, IPSJ-MUS, IPSJ-SLP [detail] |
2023-06-24 13:50 |
Tokyo |
(Primary: On-site, Secondary: Online) |
Fast Neural Waveform Generation Model With Fully Connected Upsampling Haruki Yamashita (Kobe cniv/NICT), Takuma Okamoto (NICT), Ryoichi Takashima (Kobe Univ), Yamato Ohtani (NICT), Tetsuya Takiguchi (Kobe Univ), Tomoki Toda (Nagoya Univ/NICT), Hisashi Kawai (NICT) SP2023-15 |
In recent years, in text-to-speech synthesis, it is required to improve the inference speed while keeping the quality.
... [more] |
SP2023-15 pp.73-78 |
SP, IPSJ-SLP, EA, SIP [detail] |
2023-02-28 09:30 |
Okinawa |
(Primary: On-site, Secondary: Online) |
MS-FC-HiFiGAN : Fast Neural Waveform Generation Model With Learnable Lightweight Upsampling Haruki Yamashita (Kobe Univ/NICT), Takuma Okamoto (NICT), Ryoichi Takashima, Tetsuya Takiguchi (Kobe Univ), Tomoki Toda (Nagoya Univ/NICT), Hisashi Kawai (NICT) EA2022-76 SIP2022-120 SP2022-40 |
In recent years, in text-to-speech synthesis, it is required to improve the inference speed while keeping the quality.
... [more] |
EA2022-76 SIP2022-120 SP2022-40 pp.7-12 |
SP, IPSJ-MUS, IPSJ-SLP [detail] |
2022-06-18 10:50 |
Online |
Online |
[Invited Talk]
Crazy vocoder is unbreakable
-- But let's talk about an informal vision of the future -- Masanori Morise (Meiji Univ.) SP2022-15 |
When current speech synthesis researchers refer to Vocoder in their papers, they are most likely referring to Neural voc... [more] |
SP2022-15 pp.61-66 |
SIS |
2021-03-04 09:00 |
Online |
Online |
Optimization source-filtere based speech waveform generation using adversarial training Hayato Mitsui, Yosuke Sugiura, Nozomiko Yasui, Tetsuya Shimamura (Saitama Univ.) SIS2020-35 |
This research aims to improve the accuracy of the source-filter based speech waveform generation model using deep learni... [more] |
SIS2020-35 pp.1-4 |
SP, EA, SIP |
2020-03-02 13:00 |
Okinawa |
Okinawa Industry Support Center (Cancelled but technical report was issued) |
Data augmentation for ASR system by using locally time-reversed speech
-- Temporal inversion of feature sequence -- Takanori Ashihara, Tomohiro Tanaka, Takafumi Moriya, Ryo Masumura, Yusuke Shinohara, Makio Kashino (NTT) EA2019-110 SIP2019-112 SP2019-59 |
Data augmentation is one of the techniques to mitigate overfitting and improve robustness against several acoustic varia... [more] |
EA2019-110 SIP2019-112 SP2019-59 pp.53-58 |
SP, EA, SIP |
2020-03-03 09:00 |
Okinawa |
Okinawa Industry Support Center (Cancelled but technical report was issued) |
A Study for HMM-based embedded speech synthesis using a large-scale speech corpus Nobuyuki Nishizawa, Tomohiro Obara, Hiromi Ishizaki (KDDI Research, Inc.) EA2019-141 SIP2019-143 SP2019-90 |
This study shows that our speech synthesis system based on HMM speech synthesis for embedded devices can perform real-ti... [more] |
EA2019-141 SIP2019-143 SP2019-90 pp.231-236 |
NLC, IPSJ-NL, SP, IPSJ-SLP (Joint) [detail] |
2019-12-06 10:35 |
Tokyo |
NHK Science & Technology Research Labs. |
[Invited Talk]
Progress and prospects of statistical speech synthesis Keiichi Tokuda (Nagoya Inst. of Tech.) SP2019-35 |
The basic problem of statistical speech synthesis is quite simple: we have a speech database for training, i.e., a set o... [more] |
SP2019-35 pp.11-12 |
SP |
2016-01-14 15:35 |
Kanagawa |
Sunpian Kawasaki |
Pitch-synchronous band group delay vocoder for high quality speech synthesis Masatsune Tamura, Ryo Morinaka, Masahiro Morita (Toshiba) SP2015-91 |
This paper presents a speech analysis and synthesis method that can precisely synthesize speech waveforms for high quali... [more] |
SP2015-91 pp.33-38 |
SP, IPSJ-MUS |
2014-05-25 11:30 |
Tokyo |
|
Speech waveform generation on subband domain Nobuyuki Nishizawa, Tsuneo Kato (KDDI R&D Labs) SP2014-35 |
To reduce the computational cost for waveform generation in speech synthesis based on analysis-synthesis systems like HM... [more] |
SP2014-35 pp.349-354 |
SP, EA, SIP |
2013-05-17 09:45 |
Okayama |
|
Fast speech waveform generation using subband coding for speech synthesis Nobuyuki Nishizawa, Tsuneo Kato (KDDI Labs) EA2013-15 SIP2013-15 SP2013-15 |
For fast waveform generation in HMM-based speech synthesizers, a new method using a subband coding method that is also u... [more] |
EA2013-15 SIP2013-15 SP2013-15 pp.85-90 |