Committee |
Date Time |
Place |
Paper Title / Authors |
Abstract |
Paper # |
EA, EMM, ASJ-H |
2022-11-22 13:00 |
Online |
Online |
[Fellow Memorial Lecture]
Security and Privacy Preservation for Speech Signal
-- Approach from speech information hiding technology -- Masashi Unoki (JAIST) EA2022-60 EMM2022-60 |
Non-authentic but skillfully fabricated artificial replicas of authentic media in the real world are known as “media clo... [more] |
EA2022-60 EMM2022-60 pp.99-104 |
CCS |
2022-11-18 09:00 |
Mie |
(Primary: On-site, Secondary: Online) |
Voice Quality Conversion by Two-Step Process of Speech Feature Extraction and Speaker-Controlled Speech Synthesis Taichi Fukawa, Kenya Jin'no (Tokyo City Univ.) CCS2022-52 |
Many methods have been proposed in the field of voice quality conversion that use a style-transforming autoencoder. Howe... [more] |
CCS2022-52 pp.47-52 |
SP, IPSJ-MUS, IPSJ-SLP [detail] |
2022-06-17 15:00 |
Online |
Online |
Study of End-to-End Text-to-Speech that can seamlessly control speaker's individuality by Manipulating Speaker features Naoki Aotani, Sunao Hara, Msanobu Abe (Okayama Univ) SP2022-14 |
In this paper, we investigate an End-to-End speech synthesis scheme that enables to seamlessly control speaker individua... [more] |
SP2022-14 pp.55-60 |
SP, IPSJ-MUS, IPSJ-SLP [detail] |
2022-06-18 10:50 |
Online |
Online |
[Invited Talk]
Crazy vocoder is unbreakable
-- But let's talk about an informal vision of the future -- Masanori Morise (Meiji Univ.) SP2022-15 |
When current speech synthesis researchers refer to Vocoder in their papers, they are most likely referring to Neural voc... [more] |
SP2022-15 pp.61-66 |
SP, IPSJ-MUS, IPSJ-SLP [detail] |
2022-06-18 15:00 |
Online |
Online |
[Poster Presentation]
Worker Filtering Criteria for Subjective Evaluation of Synthesized Voice Sound Quality Using Crowdsourcing Moe Yaegashi (Waseda Univ.), Susumu Saito, Teppei Nakano (Waseda Univ./ifLab.), Tetsuji Ogawa (Waseda Univ.) SP2022-24 |
We investigate the effect of filtering criteria of crowdworkers on the subjective evaluation results of synthesized voi... [more] |
SP2022-24 pp.104-109 |
EA, SIP, SP, IPSJ-SLP [detail] |
2022-03-02 10:45 |
Okinawa |
(Primary: On-site, Secondary: Online) |
Evaluation of sentence-level generation in Japanese dialect speech synthesis using accent latent variables Kazuya Yufune, Tomoki Koriyama, Shinnosuke Takamichi, Hiroshi Saruwatari (UTokyo) EA2021-79 SIP2021-106 SP2021-64 |
Japanese dialect speech synthesis is useful for personalized speech synthesis systems. However, inability to prepare acc... [more] |
EA2021-79 SIP2021-106 SP2021-64 pp.96-101 |
EA, US (Joint) |
2021-12-22 13:30 |
Kumamoto |
Sojo University |
[Poster Presentation]
Improved voice quality due to multi-speaker learning with WaveNet vocoder Satoshi Yoshida, Shingo Uenohara, Ken'ichi Furuya (Oita Univ.) EA2021-57 |
In recent years, speech synthesis and voice quality conversion techniques using neural networks have attracted much atte... [more] |
EA2021-57 pp.1-6 |
NLC, IPSJ-NL, SP, IPSJ-SLP [detail] |
2021-12-02 15:20 |
Online |
Online |
improvement of multilingual speech emotion recognition by normalizing features using CRNN Jinhai Qi, Motoyuki Suzuki (OIT) NLC2021-22 SP2021-43 |
In this research, a new multilingual emotion recognition method by normalizing features using CRNN has been proposed. We... [more] |
NLC2021-22 SP2021-43 pp.22-26 |
NLC, IPSJ-NL, SP, IPSJ-SLP [detail] |
2021-12-03 11:00 |
Online |
Online |
Multi-speaker Audiobook Speech Synthesis using Discrete Character Acting Styles Acquired by VQVAE Wataru Nakata, Tomoki Koriyama, Shinnosuke Takamichi, Yuki Saito (UT), Yusuke Ijima, Ryo Masumura (NTT), Hiroshi Saruwatari (UT) NLC2021-26 SP2021-47 |
In this paper, we propose a method of extracting discrete character acting styles using vector quantized variational aut... [more] |
NLC2021-26 SP2021-47 pp.42-47 |
SP, WIT, IPSJ-SLP, ASJ-H [detail] |
2021-10-19 15:10 |
Online |
Online |
A study on model training for DNN-HSMM-based speech synthesis using a large-scale speech corpus Nobuyuki Nishizawa, Gen Hattori (KDDI Research) SP2021-34 WIT2021-27 |
In this study, an investigation into model training for DNN-HSMM-based speech synthesis using a large speech corpus coll... [more] |
SP2021-34 WIT2021-27 pp.52-57 |
EA, ASJ-H |
2021-07-16 11:05 |
Online |
Online |
A study on the online speech data collection for speech synthesis Yuya Hoshiko, Naofumi Aoki, Kosei Ozeki, Yoshinori Dobashi (Hokkaido Univ.) EA2021-15 |
There are high expectations for text-to-speech systems for people who are unable to speak with their own voice, such as ... [more] |
EA2021-15 pp.72-74 |
EA, ASJ-H |
2021-07-16 11:30 |
Online |
Online |
A study on the number of speech samples required for making acoustic models in tailor-made speech synthesis Keigo Narita, Naofumi Aoki, Atsuhito Udo, Yoshinori Dobashi (Hokkaido Univ.) EA2021-16 |
In this study, we created speaker dependent acoustic models with varying numbers of samples, and confirmed differences i... [more] |
EA2021-16 pp.75-76 |
SP, IPSJ-SLP, IPSJ-MUS |
2021-06-19 09:30 |
Online |
Online |
[Invited Talk]
Toward a Unification of Various Speech Processing Tasks Based on End-to-End Neural networks Shinji Watanabe (CMU) SP2021-8 |
This presentation will introduce the recent progress of speech processing technologies based on end-to-end neural networ... [more] |
SP2021-8 p.38 |
SP, IPSJ-SLP, IPSJ-MUS |
2021-06-19 13:00 |
Online |
Online |
Creating of Japanese Phoneme Balanced Sentences for Speech Synthesis Yuko Takai, Naofumi Aoki, Yoshinori Dobashi (Hokkaido Univ.) SP2021-9 |
When the loss of voice is inevitable due to pharyngectomy or other reasons, it has become possible to realizespeech synt... [more] |
SP2021-9 pp.39-41 |
SP, IPSJ-SLP, IPSJ-MUS |
2021-06-19 13:00 |
Online |
Online |
A Study on Error Correction for Improving the Accuracy of Acoustic Models Saki Anazawa, Naofumi Aoki, Yoshinori Dobashi (Hokkaido Univ.) SP2021-12 |
People with ALS (amyotrophic lateral sclerosis) or dysarthria sometimes use their own voice for speech synthesis. In thi... [more] |
SP2021-12 pp.51-52 |
SP, IPSJ-SLP, IPSJ-MUS |
2021-06-19 15:00 |
Online |
Online |
Simulation of Body-conducted Speech and Synthesis of One's Own Voice with a Sound-proof Earmuff and Bone-conduction Microphones Chen Ruiyan, Nishimura Tazuko, Minematsu Nobuaki, Saito Daisuke (UTokyo) SP2021-15 |
When one hears his/her recorded voices for the first time, s/he is probably surprised and not rarely disappointed at the... [more] |
SP2021-15 pp.63-68 |
SP, IPSJ-SLP, IPSJ-MUS |
2021-06-19 15:00 |
Online |
Online |
Dynamic Display of Guidelines in Interactive Speech Synthesizer Daiki Goto (Hokkai Gakuen Univ.), Naofumi Aoki, Keisuke ai (Hokkaido Univ.), Kunitoshi Motoki (Hokkai Gakuen Univ.) SP2021-18 |
We are developing a speech synthesis system that can play sounds by interactive control, just like playing a musical ins... [more] |
SP2021-18 pp.80-84 |
SP, IPSJ-SLP, IPSJ-MUS |
2021-06-19 15:00 |
Online |
Online |
Preliminary study on synthesizing relaxing voices
-- from a perspective of recognized/evoked emotions and acoustic features -- Yuki Watanabe, Shuichi Sakamoto (Tohoku Univ.), Takayuki Hoshi, Yoshiki Nagatani, Manabu Nakano (Pixie Dust Technologies) SP2021-19 |
The goal of this study is to synthesize speech sound which induces relaxed emotion. As the preliminary study, we investi... [more] |
SP2021-19 pp.85-90 |
SP, IPSJ-SLP, IPSJ-MUS |
2021-06-19 15:00 |
Online |
Online |
Neural speech synthesis using local phrase dependency structure information Nobuyoshi Kaiki, Sakriani Sakti, Satoshi Nakamura (NIST) SP2021-23 |
In order to synthesize Japanese speech with natural prosody, we introduce an end-to-end TTS with new prosodic symbol rep... [more] |
SP2021-23 pp.107-112 |
WIT |
2021-06-01 14:55 |
Online |
Online |
The relationship between speech rate and environmental noise in synthesized speech for easy listening of movie audio discription Takeya Naono, Sawako Nakajima, Kazutaka Mitobe (Akita Univ) WIT2021-8 |
In recent years, speech synthesis has been used for audio description of movies and videos, and there is a need to impro... [more] |
WIT2021-8 pp.38-42 |