Committee |
Date Time |
Place |
Paper Title / Authors |
Abstract |
Paper # |
EA |
2024-05-22 14:55 |
Online |
Online |
Environmental sound synthesis and creation of dataset using vocal imitations Yuki Okamoto (Ritsumeikan Univ.), Keisuke Imoto (Doshisha Univ.), Shinnosuke Takamichi (The Univ. of Tokyo/Keio Univ.), Ryotaro Nagase, Takahiro Fukumori, Yoichi Yamashita (Ritsumeikan Univ.) |
(To be available after the conference date) [more] |
|
SIP, SP, EA, IPSJ-SLP [detail] |
2024-02-29 10:10 |
Okinawa |
(Primary: On-site, Secondary: Online) |
Noise-Robust Voice Conversion by Denoising Training Conditioned with Latent Variables of Speech Quality and Recording Environment Takuto Igarashi, Yuki Saito, Kentaro Seki, Shinnosuke Takamichi (UT), Ryuichi Yamamoto, Kentaro Tachibana (LY), Hiroshi Saruwatari (UT) EA2023-63 SIP2023-110 SP2023-45 |
In this paper, we propose noise-robust voice conversion by conditioning latent variables representing speech quality and... [more] |
EA2023-63 SIP2023-110 SP2023-45 pp.13-18 |
SIP, SP, EA, IPSJ-SLP [detail] |
2024-02-29 15:35 |
Okinawa |
(Primary: On-site, Secondary: Online) |
SRC4VC: Smartphone-Recorded Corpus for Benchmarking Multi-Speaker Voice Conversion Models Yuki Saito, Takuto Igarashi, Kentaro Seki, Shinnosuke Takamichi (UT), Ryuichi Yamamoto, Kentaro Tachibana (LY), Hiroshi Saruwatari (UT) |
[more] |
|
SIP, SP, EA, IPSJ-SLP [detail] |
2024-02-29 15:40 |
Okinawa |
(Primary: On-site, Secondary: Online) |
Preliminary Evaluation of Japanese Speech Corpus J-SpAW for Speaker Verification and Spoofing Detection Kota Kanno (Tokyo Metropolitan Univ.), Shinnosuke Takamichi (UTokyo), Sayaka Shiota (Tokyo Metropolitan Univ.) |
[more] |
|
NLC, IPSJ-NL |
2023-03-18 16:40 |
Okinawa |
OIST (Primary: On-site, Secondary: Online) |
Collection of Textual Expressions in the Wild Toward Voice-quality Control from Free Description Aya Watanabe, Shinnosuke Takamichi, Yuki Saito, Hiroshi Saruwatari (UTokyo) NLC2022-29 |
[more] |
NLC2022-29 pp.55-60 |
SP, IPSJ-SLP, EA, SIP [detail] |
2023-02-28 16:15 |
Okinawa |
(Primary: On-site, Secondary: Online) |
Visual onoma-to-wave: environmental sound synthesis from visual onomatopoeias and sound-source images Hien Ohnaka (NITTC), Shinnosuke Takamichi (UT), Keisuke Imoto (DU), Yuki Okamoto (Rits), Kazuki Fujii, Hiroshi Saruwatari (UT) EA2022-90 SIP2022-134 SP2022-54 |
(To be available after the conference date) [more] |
EA2022-90 SIP2022-134 SP2022-54 pp.83-88 |
SP, IPSJ-SLP, EA, SIP [detail] |
2023-03-01 11:00 |
Okinawa |
(Primary: On-site, Secondary: Online) |
Representation and Prediction of Accent Phrase Prosodic Features in Japanese Text-to-Speech Masaki Sato, Shinnosuke Takamichi, Hiroshi Saruwatari (The Univ. of Tokyo) EA2022-108 SIP2022-152 SP2022-72 |
In order to use speech synthesis in a variety of situations such as dialogue systems and emotional expression in audiobo... [more] |
EA2022-108 SIP2022-152 SP2022-72 pp.197-202 |
SP, IPSJ-SLP, EA, SIP [detail] |
2023-03-01 14:50 |
Okinawa |
(Primary: On-site, Secondary: Online) |
Corpus construction toward multi-domain empathetic dialogue speech synthesis Yuki Saito, Eiji Iimori, Shinnosuke Takamichi (UT), Kentaro Tachibana (LINE), Hiroshi Saruwatari (UT) |
(To be available after the conference date) [more] |
|
EA, SIP, SP, IPSJ-SLP [detail] |
2022-03-02 10:45 |
Okinawa |
(Primary: On-site, Secondary: Online) |
Evaluation of sentence-level generation in Japanese dialect speech synthesis using accent latent variables Kazuya Yufune, Tomoki Koriyama, Shinnosuke Takamichi, Hiroshi Saruwatari (UTokyo) EA2021-79 SIP2021-106 SP2021-64 |
Japanese dialect speech synthesis is useful for personalized speech synthesis systems. However, inability to prepare acc... [more] |
EA2021-79 SIP2021-106 SP2021-64 pp.96-101 |
EA, SIP, SP, IPSJ-SLP [detail] |
2022-03-02 12:00 |
Okinawa |
(Primary: On-site, Secondary: Online) |
Evaluating the robustness of signal processing-based pseudonymization using parameter optimization against inversion attack. Hiroto Kai (Tokyo Metro. Univ.), Shinnosuke Takamichi (The Univ. of Tokyo), Sayaka Shiota, Hitoshi Kiya (Tokyo Metro. Univ.) EA2021-82 SIP2021-109 SP2021-67 |
[more] |
EA2021-82 SIP2021-109 SP2021-67 pp.114-119 |
NLC, IPSJ-NL, SP, IPSJ-SLP [detail] |
2021-12-03 11:00 |
Online |
Online |
Multi-speaker Audiobook Speech Synthesis using Discrete Character Acting Styles Acquired by VQVAE Wataru Nakata, Tomoki Koriyama, Shinnosuke Takamichi, Yuki Saito (UT), Yusuke Ijima, Ryo Masumura (NTT), Hiroshi Saruwatari (UT) NLC2021-26 SP2021-47 |
In this paper, we propose a method of extracting discrete character acting styles using vector quantized variational aut... [more] |
NLC2021-26 SP2021-47 pp.42-47 |
EA, US, SP, SIP, IPSJ-SLP [detail] |
2021-03-03 14:05 |
Online |
Online |
[Poster Presentation]
End-to-end incremental TTS with lookahead generation with large pretrained language model Takaaki Saeki, Shinnosuke Takamichi, Hiroshi Saruwatari (UTokyo) EA2020-74 SIP2020-105 SP2020-39 |
(To be available after the conference date) [more] |
EA2020-74 SIP2020-105 SP2020-39 pp.85-90 |
SP, IPSJ-MUS, IPSJ-SLP |
2020-06-07 15:45 |
Online |
Online |
HumanGAN: generative adversarial network with human-based discriminator and its naturalness evaluation in synthesized voice Kazuki Fujii (NITTC), Yuki Saito, Shinnosuke Takamichi (UTokyo), Yukino Baba (UTsukuba), Hiroshi Saruwatari (UTokyo) SP2020-6 |
[more] |
SP2020-6 pp.15-20 |
SP, EA, SIP |
2020-03-02 13:00 |
Okinawa |
Okinawa Industry Support Center (Cancelled but technical report was issued) |
The Effectiveness of Additional Context in DNN-based Spontaneous Speech Synthesis Yuki Yamashita, Tomoki Koriyama, Yuki Saito, Shinnosuke Takamichi (UTokyo), Yusuke Ijima, Ryo Masumura (NTT), Hiroshi Saruwatari (UTokyo) EA2019-112 SIP2019-114 SP2019-61 |
In DNN-based speech synthesis, contexts, which are input features of DNN, can be used not only for the representation of... [more] |
EA2019-112 SIP2019-114 SP2019-61 pp.65-70 |
SP |
2019-06-13 15:25 |
Kanagawa |
Tokyo Institute of Technology |
[Invited Talk]
Constructing voice corpus for next-generation speech research Shinnosuke Takamichi (UTokyo) SP2019-5 |
Thanks to developments of machine learning techniques including deep learning, solving more diverse issues is required i... [more] |
SP2019-5 p.25 |
EA, ASJ-H, EMM, IPSJ-MUS [detail] |
2018-11-21 13:30 |
Ishikawa |
Hotel Koshuen |
Evaluation of DNN-based Low-Musical-Noise Speech Enhancement Using Kurtosis Matching Satoshi Mizoguchi, Yuki Saito, Shinnosuke Takamichi, Hiroshi Saruwatari (UTokyo) EA2018-66 EMM2018-66 |
This paper proposes DNN-based speech enhancement with low musical noise by kurtosis matching. Musical noise, artifacts g... [more] |
EA2018-66 EMM2018-66 pp.19-24 |
SIP, EA, SP, MI (Joint) [detail] |
2018-03-19 10:00 |
Okinawa |
|
Experimental Evaluation of Multichannel Audio Source Separation Based on IDLMA Daichi Kitamura, Hayato Sumino, Norihiro Takamune, Shinnosuke Takamichi, Hiroshi Saruwatari (Univ. of Tokyo), Nobutaka Ono (Tokyo Metropolitan Univ.) EA2017-104 SIP2017-113 SP2017-87 |
In this paper, we propose a new informed multichannel audio source separation called independent deeply learned matrix a... [more] |
EA2017-104 SIP2017-113 SP2017-87 pp.13-20 |
SIP, EA, SP, MI (Joint) [detail] |
2018-03-19 10:25 |
Okinawa |
|
Non-parallel and Many-to-Many Voice Conversion Using Variational Autoencoder Conditioned by Phonetic Posteriorgrams and d-vectors Yuki Saito (NTT/Univ. of Tokyo), Yusuke Ijima, Kyosuke Nishida (NTT), Shinnosuke Takamichi (Univ. of Tokyo) EA2017-105 SIP2017-114 SP2017-88 |
This paper proposes novel frameworks for non-parallel and many-to-many voice conversion (VC) using variational autoencod... [more] |
EA2017-105 SIP2017-114 SP2017-88 pp.21-26 |
SP, IPSJ-SLP (Joint) |
2017-07-27 16:15 |
Miyagi |
Akiu Resort Hotel Crescent |
Voice Conversion Using Sequence-to-Sequence Learning of Context Posterior Probabilities and Evaluation of Dual Learning Hiroyuki Miyoshi, Yuki Saito, Shinnosuke Takamichi, Hiroshi Saruwatari (Univ. of Tokyo) SP2017-17 |
Voice conversion (VC) using sequence-to-sequence learning of context posterior probabilities is proposed. Conventional V... [more] |
SP2017-17 pp.9-14 |
EA, US (Joint) |
2017-01-25 13:00 |
Kyoto |
Doshisha Univ. |
[Poster Presentation]
Study on efficient solver for independent low-rank matrix analysis with sparse time-series-activity regularization Yoshiki Mitsui (Univ. Tokyo), Daichi Kitamura (SOKENDAI), Shinnosuke Takamichi (Univ. Tokyo), Nobutaka Ono (NII/SOKENDAI), Hiroshi Saruwatari (Univ. Tokyo) EA2016-72 |
In this paper, we propose a new blind source separation (BSS) method based on independent low-rank matrix analysis (ILRM... [more] |
EA2016-72 pp.25-30 |