Committee |
Date Time |
Place |
Paper Title / Authors |
Abstract |
Paper # |
SIP, SP, EA, IPSJ-SLP [detail] |
2024-03-01 09:30 |
Okinawa |
(Primary: On-site, Secondary: Online) |
SELECTING N-LOWEST SCORES FOR TRAINING MOS PREDICTION MODELS Yuto Kondo, Hirokazu Kameoka, Kou Tanaka, Takuhiro Kaneko (NTT) EA2023-94 SIP2023-141 SP2023-76 |
Automatic speech quality assessment (SQA) is a task to evaluate the quality of speech samples without resorting to time-... [more] |
EA2023-94 SIP2023-141 SP2023-76 pp.196-201 |
SIP, SP, EA, IPSJ-SLP [detail] |
2024-03-01 16:35 |
Okinawa |
(Primary: On-site, Secondary: Online) |
Simulation Evaluation of Speech Detection Based on Distributed Sound-to-Light Conversion Device Blinkies Satoshi Motoyama, Natsuki Ueno, Masahiro Yasuda (TMU), Yuma Kinoshita (Tokai Univ.), Nobutaka Ono (TMU) EA2023-126 SIP2023-173 SP2023-108 |
The purpose of this study is speech detection using the distributed sound-to-light conversion device Blinkies. As an ini... [more] |
EA2023-126 SIP2023-173 SP2023-108 pp.382-387 |
HCGSYMPO (2nd) |
2023-12-11 - 2023-12-13 |
Fukuoka |
Asia pacific Import Mart (Kitakyushu) (Primary: On-site, Secondary: Online) |
Investigation of audio feedback method for speakers in speech rate converted conversation.
-- Evaluation of presented sound to understand the listening completion point of listener. -- Yudai Ishikawa, Hiroto Saito (Tokyo Denki Univ.) |
This study focuses on the time difference between the speaker and the listener that occurs when speech-rate conversion i... [more] |
|
HCS, CNR |
2023-11-05 16:35 |
Tokyo |
Kogakuin University (Primary: On-site, Secondary: Online) |
Evaluation of Ease of Hearing for Speech Rate Converted Speech with Constant Speech Rate Miou Oyama, Hiroto Saito (Tokyo Denki Univ.) CNR2023-18 HCS2023-80 |
Speech rate conversion (SRC) is a technology used to expand or contract the playback time without altering the pitch of ... [more] |
CNR2023-18 HCS2023-80 pp.62-67 |
WIT, SP, IPSJ-SLP [detail] |
2023-10-14 16:40 |
Fukuoka |
Kyushu Institute of Technology (Primary: On-site, Secondary: Online) |
Sequence-to-sequence Voice Conversion for Electrolaryngeal Speech Enhancement with Multi-stage Pretraining and Fine-tuning Techniques Ding Ma, Lester Phillip Violeta, Kazuhiro Kobayashi, Tomoki Toda (Nagoya Univ.) SP2023-32 WIT2023-23 |
Sequence-to-sequence (seq2seq) voice conversion (VC) models have great potential for electrolaryngeal (EL) speech to nor... [more] |
SP2023-32 WIT2023-23 pp.27-32 |
WIT, SP, IPSJ-SLP [detail] |
2023-10-14 17:05 |
Fukuoka |
Kyushu Institute of Technology (Primary: On-site, Secondary: Online) |
Electrolaryngeal Speech Enhancement through Strong Linguistic Encoding Methods Lester Phillip Violeta, Wen-Chin Huang, Ding Ma, Ryuichi Yamamoto, Kazuhiro Kobayashi, Tomoki Toda (Nagoya Univ.) SP2023-33 WIT2023-24 |
Although pretraining and fine-tuning approaches have proven to work well in speech intelligibility enhancement, various ... [more] |
SP2023-33 WIT2023-24 pp.33-38 |
SP, IPSJ-SLP, EA, SIP [detail] |
2023-03-01 11:20 |
Okinawa |
(Primary: On-site, Secondary: Online) |
An Investigation of Text-to-Speech Synthesis Using Voice Conversion and x-vector Embedding Sympathizing Emotion of Input Audio for Spoken Dialogue Systems Shunichi Kohara, Masanobu Abe, Sunao Hara (Okayama Univ.) EA2022-109 SIP2022-153 SP2022-73 |
In this paper, we propose a Text-to-Speech synthesis method to synthesize the same emotional expression as the input spe... [more] |
EA2022-109 SIP2022-153 SP2022-73 pp.203-208 |
EA, US (Joint) |
2022-12-22 16:50 |
Hiroshima |
Satellite Campus Hiroshima |
[Poster Presentation]
Data augmentation method for machine learning on speech data Tsubasa Maruyama (Tokyo Tech), Tsutomu Ikegami (AIST), Toshio Endo (Tokyo Tech), Takahiro Hirofuchi (AIST) EA2022-68 |
In machine learning, data augmentation is a method to enhance the number and diversity of data by adding transformations... [more] |
EA2022-68 pp.42-48 |
CCS |
2022-11-18 09:00 |
Mie |
(Primary: On-site, Secondary: Online) |
Voice Quality Conversion by Two-Step Process of Speech Feature Extraction and Speaker-Controlled Speech Synthesis Taichi Fukawa, Kenya Jin'no (Tokyo City Univ.) CCS2022-52 |
Many methods have been proposed in the field of voice quality conversion that use a style-transforming autoencoder. Howe... [more] |
CCS2022-52 pp.47-52 |
HCS |
2022-08-27 15:15 |
Hyogo |
(Primary: On-site, Secondary: Online) |
A Study of Feedback Methods for Speakers in Speech Rate Converted Conversation
-- Comparative evaluation for adaptive switching between audio feedback and visual feedback -- Kazuma Ban (Tokyo Denki Univ.), Hiroko Tokunaga (Tokyo Denki Univ./RIKEN), Naoki Mukawa, Hiroto Saito (Tokyo Denki Univ.) HCS2022-47 |
Speech rate conversion is a useful technique for people who need assistance in listening comprehension and non-native sp... [more] |
HCS2022-47 pp.61-66 |
SIP |
2022-08-26 14:08 |
Okinawa |
Nobumoto Ohama Memorial Hall (Ishigaki Island) (Primary: On-site, Secondary: Online) |
Study on Bone-conducted Speech Enhancement Using Vector-quantized Variational Autoencoder and Gammachirp Filterbank Cepstral Coefficients Quoc-Huy Nguyen, Masashi Unoki (JAIST) SIP2022-71 |
Bone-conducted (BC) speech potentially avoids the undesired effects on recorded speech due to background noise or reverb... [more] |
SIP2022-71 pp.109-114 |
SP, IPSJ-MUS, IPSJ-SLP [detail] |
2022-06-17 15:00 |
Online |
Online |
Study of End-to-End Text-to-Speech that can seamlessly control speaker's individuality by Manipulating Speaker features Naoki Aotani, Sunao Hara, Msanobu Abe (Okayama Univ) SP2022-14 |
In this paper, we investigate an End-to-End speech synthesis scheme that enables to seamlessly control speaker individua... [more] |
SP2022-14 pp.55-60 |
SP, IPSJ-MUS, IPSJ-SLP [detail] |
2022-06-18 10:50 |
Online |
Online |
[Invited Talk]
Crazy vocoder is unbreakable
-- But let's talk about an informal vision of the future -- Masanori Morise (Meiji Univ.) SP2022-15 |
When current speech synthesis researchers refer to Vocoder in their papers, they are most likely referring to Neural voc... [more] |
SP2022-15 pp.61-66 |
SP, IPSJ-MUS, IPSJ-SLP [detail] |
2022-06-18 13:00 |
Online |
Online |
[Poster Presentation]
Proposal of Speech Content Conversion and the Initial Trial: Conversion of Linguistic Information Depending on Situations Kohei Takita, Saizo Aoyagi, Tatsunori Hirai (Komazawa Univ.) SP2022-19 |
It is important to speak dialects, honorifics, and simple words for listeners and the environment in order to smooth com... [more] |
SP2022-19 pp.82-87 |
HCS |
2022-03-12 10:10 |
Online |
Online |
Evaluation of Feedback Methods for Speakers in Speech Rate Converted Conversation Tamami Mizuta, Hiroko Tokunaga, Naoki Mukawa, Hiroto Saito (Tokyo Denki Univ.) HCS2021-70 |
This study clarifies the characteristics of voice feedback and visual feedback, which are support functions for speakers ... [more] |
HCS2021-70 pp.55-60 |
EA, US (Joint) |
2021-12-22 13:30 |
Kumamoto |
Sojo University |
[Poster Presentation]
Improved voice quality due to multi-speaker learning with WaveNet vocoder Satoshi Yoshida, Shingo Uenohara, Ken'ichi Furuya (Oita Univ.) EA2021-57 |
In recent years, speech synthesis and voice quality conversion techniques using neural networks have attracted much atte... [more] |
EA2021-57 pp.1-6 |
NLC, IPSJ-NL, SP, IPSJ-SLP [detail] |
2021-12-03 10:30 |
Online |
Online |
An approach to voice conversion for manipulating emotion dimensions Keita Mukada, Hiroki Mori (Utsunomiya Univ.) NLC2021-25 SP2021-46 |
We propose an emotional voice conversion method based on the emotion dimensions. Conventional emotional voice conversion... [more] |
NLC2021-25 SP2021-46 pp.39-41 |
SP, IPSJ-SLP, IPSJ-MUS |
2021-06-19 15:00 |
Online |
Online |
Simulation of Body-conducted Speech and Synthesis of One's Own Voice with a Sound-proof Earmuff and Bone-conduction Microphones Chen Ruiyan, Nishimura Tazuko, Minematsu Nobuaki, Saito Daisuke (UTokyo) SP2021-15 |
When one hears his/her recorded voices for the first time, s/he is probably surprised and not rarely disappointed at the... [more] |
SP2021-15 pp.63-68 |
SP, IPSJ-SLP, IPSJ-MUS |
2021-06-19 15:00 |
Online |
Online |
Preliminary study on synthesizing relaxing voices
-- from a perspective of recognized/evoked emotions and acoustic features -- Yuki Watanabe, Shuichi Sakamoto (Tohoku Univ.), Takayuki Hoshi, Yoshiki Nagatani, Manabu Nakano (Pixie Dust Technologies) SP2021-19 |
The goal of this study is to synthesize speech sound which induces relaxed emotion. As the preliminary study, we investi... [more] |
SP2021-19 pp.85-90 |
HCGSYMPO (2nd) |
2020-12-15 - 2020-12-17 |
Online |
Online |
Effects on Speakers' Behaviors in Speech Rate Converted Conversation by Keeping Constant of Listening Speed for Hearer Hiroyuki Oba, Tamami Mizuta, Hiroko Tokunaga, Naoki Mukawa, Hiroto Saito (Tokyo Denki Univ.) |
In this study, we evaluate the effects on speakers’ behaviors in speech rate converted conversation that keeps the liste... [more] |
|