EMM 2022-01-27
Online Online Speech Watermarking Approach for Securing Speaker Anonymization using McAdams Coefficients
Candy Olivia Mawalim, Masashi Unoki (JAIST) EMM2021-88
 [more] EMM2021-88
SeMI 2022-01-21
(Primary: On-site, Secondary: Online)
[Short Paper] Dementia Detection Using Two Perplexities Methods with Part-of-Speech Tags
Chuheng Zheng, Mondher Bouazizi, Tomoaki Ohtsuki (Keio Univ.) SeMI2021-77
Alzheimer’s disease is a kind of dementia that causes problems with memory, thinking, and behavior. Using automated comp... [more] SeMI2021-77
Kumamoto Sojo University [Poster Presentation] Improved voice quality due to multi-speaker learning with WaveNet vocoder
Satoshi Yoshida, Shingo Uenohara, Ken'ichi Furuya (Oita Univ.) EA2021-57
In recent years, speech synthesis and voice quality conversion techniques using neural networks have attracted much atte... [more] EA2021-57
TL 2021-12-18
Online Online Interactions among Speakers of English as a Lingua Franca in Collaborative Work with Plastic Blocks
Kazuhiro Takeuchi (OECU), Midori Tanimura (Ritsumeikan University), Etsuko Yoshida (Mie University) TL2021-28
This article presents a corpus we originally recorded, which contains ELF (English as a lingua franca) interactions in a... [more] TL2021-28
WIT, HI-SIGACI 2021-12-09
Online Online Significance of the publication of "Speech communication and people with disabilities"
Akira Ichikawa (Chiba Univ.), Yuji Nagashima (Kogakuin Univ.), Akira Okamoto (Tsukuba University of Technology), Naoto Kato (i Univ.), Shinji Sako (NITech), Testuya Takiguchi (Kobe Univ.), Daisuke Hara (Toyota Technological Institute), Michiru Makuuchi (National Rehabilitation Center For Persons with Disabilities) WIT2021-42
The book we authored, "Speech Communication and People with Disabilities," (edited by Acoustical Society of Japan, Acous... [more] WIT2021-42
NLC, IPSJ-NL, SP, IPSJ-SLP [detail] 2021-12-02
Online Online improvement of multilingual speech emotion recognition by normalizing features using CRNN
Jinhai Qi, Motoyuki Suzuki (OIT) NLC2021-22 SP2021-43
In this research, a new multilingual emotion recognition method by normalizing features using CRNN has been proposed. We... [more] NLC2021-22 SP2021-43
NLC, IPSJ-NL, SP, IPSJ-SLP [detail] 2021-12-03
Online Online An approach to voice conversion for manipulating emotion dimensions
Keita Mukada, Hiroki Mori (Utsunomiya Univ.) NLC2021-25 SP2021-46
We propose an emotional voice conversion method based on the emotion dimensions. Conventional emotional voice conversion... [more] NLC2021-25 SP2021-46
NLC, IPSJ-NL, SP, IPSJ-SLP [detail] 2021-12-03
Online Online Multi-speaker Audiobook Speech Synthesis using Discrete Character Acting Styles Acquired by VQVAE
Wataru Nakata, Tomoki Koriyama, Shinnosuke Takamichi, Yuki Saito (UT), Yusuke Ijima, Ryo Masumura (NTT), Hiroshi Saruwatari (UT) NLC2021-26 SP2021-47
In this paper, we propose a method of extracting discrete character acting styles using vector quantized variational aut... [more] NLC2021-26 SP2021-47
EMM, EA, ASJ-H 2021-11-15
Online Online [Poster Presentation] Study on frame synchronization in spread-spectrum based speech information hiding method by using linear prediction residue
Takuto Isoyama, Masashi Unoki (JAIST) EA2021-40 EMM2021-67
Our previous study proposed a blindly-detectable direct-spread spectrum (DSS) method using linear prediction (LP) residu... [more] EA2021-40 EMM2021-67
SP, WIT, IPSJ-SLP, ASJ-H [detail] 2021-10-19
Online Online A study on model training for DNN-HSMM-based speech synthesis using a large-scale speech corpus
Nobuyuki Nishizawa, Gen Hattori (KDDI Research) SP2021-34 WIT2021-27
In this study, an investigation into model training for DNN-HSMM-based speech synthesis using a large speech corpus coll... [more] SP2021-34 WIT2021-27
EA, ASJ-H 2021-07-15
Online Online Acoustic characteristics of a face mask invented for the purpose of not impairing speech clarity
Hiroki Matsuzaki (HUS) EA2021-10
Due to the epidemic of the new coronavirus infection (COVID-19), wearing a mask is required in daily life to prevent the... [more] EA2021-10
EA, ASJ-H 2021-07-15
Online Online A production note of a hand-made vocal cords model
Naofumi Aoki (Hokkaido Univ.) EA2021-11
This study tries to develop a vocal cords model that may be employed for a teaching material explaining the mechanism of... [more] EA2021-11
EA, ASJ-H 2021-07-16
Online Online A study on the online speech data collection for speech synthesis
Yuya Hoshiko, Naofumi Aoki, Kosei Ozeki, Yoshinori Dobashi (Hokkaido Univ.) EA2021-15
There are high expectations for text-to-speech systems for people who are unable to speak with their own voice, such as ... [more] EA2021-15
EA, ASJ-H 2021-07-16
Online Online A study on the number of speech samples required for making acoustic models in tailor-made speech synthesis
Keigo Narita, Naofumi Aoki, Atsuhito Udo, Yoshinori Dobashi (Hokkaido Univ.) EA2021-16
In this study, we created speaker dependent acoustic models with varying numbers of samples, and confirmed differences i... [more] EA2021-16
IMQ, HIP 2021-07-09
Online Online IMQ2021-5 HIP2021-20 To achieve emotional intensity estimation, we use segment-based estimation in which the utterance is divided into severa... [more] IMQ2021-5 HIP2021-20
SP, IPSJ-SLP, IPSJ-MUS 2021-06-18
Online Online F0 estimation of speech based on l2-norm regularized TV-CAR analysis
Keiichi Funaki (Univ. of the Ryukyus) SP2021-2
Linear Prediction (LP) is the most successful speech analysis in speech processing, including speech coding implemented
... [more]
SP, IPSJ-SLP, IPSJ-MUS 2021-06-18
Online Online Protection method with audio processing against Audio Adversarial Example
Taisei Yamamoto, Yuya Tarutani, Yukinobu Fukusima, Tokumi Yokohira (Okayama Univ) SP2021-4
Machine learning technology has improved the recognition accuracy of voice recognition, and demand for voice recognition... [more] SP2021-4
SP, IPSJ-SLP, IPSJ-MUS 2021-06-18
Online Online Speech Intelligibility Experiments using crowdsourcing -- from designing Web page to Data screening --
Ayako Yamamoto, Toshio Irino (Wakayama Univ.), Kenichi Arai, Shoko Araki, Atsunori Ogawa, Keisuke Kinoshita, Tomohiro Nakatani (NTT) SP2021-5
Many subjective experiments have been performed to develop objective speech intelligibility measures, but the novel coro... [more] SP2021-5
SP, IPSJ-SLP, IPSJ-MUS 2021-06-19
Online Online [Invited Talk] Toward a Unification of Various Speech Processing Tasks Based on End-to-End Neural networks
Shinji Watanabe (CMU) SP2021-8
This presentation will introduce the recent progress of speech processing technologies based on end-to-end neural networ... [more] SP2021-8
SP, IPSJ-SLP, IPSJ-MUS 2021-06-19
Online Online Creating of Japanese Phoneme Balanced Sentences for Speech Synthesis
Yuko Takai, Naofumi Aoki, Yoshinori Dobashi (Hokkaido Univ.) SP2021-9
When the loss of voice is inevitable due to pharyngectomy or other reasons, it has become possible to realizespeech synt... [more] SP2021-9
