Committee |
Date Time |
Place |
Paper Title / Authors |
Abstract |
Paper # |
PRMU, IBISML, IPSJ-CVIM [detail] |
2023-03-03 16:50 |
Hokkaido |
Future University Hakodate (Primary: On-site, Secondary: Online) |
Parallel-Data-Free Japanese Singer Conversion using CycleGAN Considering Perceptual Loss in Singing Phoneme Sequences Kanade Gemmoto, Nobutaka Shimada, Tadashi Matsuo (Ritsumeikan Univ) PRMU2022-114 IBISML2022-121 |
This paper proposes a one-to-one Japanese Singing Voice Conversion (SVC) method without using parallel data.
Our method... [more] |
PRMU2022-114 IBISML2022-121 pp.293-298 |
SP, IPSJ-SLP, EA, SIP [detail] |
2023-02-28 10:10 |
Okinawa |
(Primary: On-site, Secondary: Online) |
Singing voice synthesis based on a frame-driven attention mechanism considering vocal timing deviation Miku Nishihara, Yukiya Hono, Kei Hashimoto, Yoshihiko Nankaku, Keiichi Tokuda (NITech) EA2022-78 SIP2022-122 SP2022-42 |
This paper proposes singing voice synthesis (SVS) based on a frame-driven attention mechanism considering vocal timing d... [more] |
EA2022-78 SIP2022-122 SP2022-42 pp.19-24 |
SP, IPSJ-SLP, EA, SIP [detail] |
2023-03-01 10:40 |
Okinawa |
(Primary: On-site, Secondary: Online) |
Diffusion-based parallel voice conversion with source-feature condition Takuya Kishida, Toru Nakashika (UEC) EA2022-107 SIP2022-151 SP2022-71 |
We propose a voice conversion method based on a diffusion probabilistic model trained on a parallel dataset. Since the d... [more] |
EA2022-107 SIP2022-151 SP2022-71 pp.191-196 |
SP, IPSJ-SLP, EA, SIP [detail] |
2023-03-01 11:20 |
Okinawa |
(Primary: On-site, Secondary: Online) |
An Investigation of Text-to-Speech Synthesis Using Voice Conversion and x-vector Embedding Sympathizing Emotion of Input Audio for Spoken Dialogue Systems Shunichi Kohara, Masanobu Abe, Sunao Hara (Okayama Univ.) EA2022-109 SIP2022-153 SP2022-73 |
In this paper, we propose a Text-to-Speech synthesis method to synthesize the same emotional expression as the input spe... [more] |
EA2022-109 SIP2022-153 SP2022-73 pp.203-208 |
SP, IPSJ-SLP, EA, SIP [detail] |
2023-03-01 11:40 |
Okinawa |
(Primary: On-site, Secondary: Online) |
Choral Singing Voice Synthesis with Modulation Acoustic Features Sora Miyazawa, Anan Kikuchi, Daisuke Saito, Nobuaki Minematsu (UTokyo) EA2022-110 SIP2022-154 SP2022-74 |
In this paper, we analyzed the sense of multipule singing focused on unison and implemented it for a singing voice
synt... [more] |
EA2022-110 SIP2022-154 SP2022-74 pp.209-214 |
SP, IPSJ-SLP, EA, SIP [detail] |
2023-03-01 16:50 |
Okinawa |
(Primary: On-site, Secondary: Online) |
Effects of Voice Artificiality on the Degree of Compatibility between Voice and Appearance of Voice Agents Kota Iura, Naotake Masuda, Daisuke Saito, Nobuaki Minematsu (UTokyo) EA2022-121 SIP2022-165 SP2022-85 |
For a spoken agent such as interactive robots, it is important to use a voice that fits the image of the agent in terms ... [more] |
EA2022-121 SIP2022-165 SP2022-85 pp.264-269 |
SP, IPSJ-SLP, EA, SIP [detail] |
2023-03-01 17:10 |
Okinawa |
(Primary: On-site, Secondary: Online) |
Quantification of Voice Register Information including Mixed Voice based on Class Posterior Probabilities Yu Kitamura, Anan Kikuchi, Daisuke Saito, Nobuaki Minematsu (UTokyo) EA2022-122 SIP2022-166 SP2022-86 |
Methods to distinguish between modal and falsetto have been proposed so far,
but there are few studies analyzing mixed ... [more] |
EA2022-122 SIP2022-166 SP2022-86 pp.270-275 |
HIP, ITE-HI, VRPSY [detail] |
2023-02-22 13:20 |
Toyama |
(Primary: On-site, Secondary: Online) |
Is discomfort with one's own voice like an "Uncanny valley" phenomenon?
-- A study using voice morphing -- Hidekazu Nagamura, Seita Tomioka, Taichirou Tanaka, Kohta I. Kobayasi (Doshisha Univ.) HIP2022-79 |
Whereas our own voices are familiar to us in everyday life, we feel uncomfortable when listening to recordings of the vo... [more] |
HIP2022-79 pp.19-22 |
HCS |
2023-01-22 15:35 |
Kyoto |
Kyoto Institute of Technology (Primary: On-site, Secondary: Online) |
Nanto-Type Exclamatory Sentences
-- The Effect of the Presence or Absence of the Particle No on (Un)pleasantness Evaluation -- Akihiko Sakamoto (Tokyo Denki Univ.), Sachiko Takagi (Tokiwa Univ.) HCS2022-73 |
(To be available after the conference date) [more] |
HCS2022-73 pp.102-107 |
KBSE |
2023-01-19 14:00 |
Ishikawa |
(Primary: On-site, Secondary: Online) |
Using Speech Dialogue Agent to Extract User Needs for Service Personalization Takuya Nakata, Sinan Chen (Kobe Univ.), Sachio Saiki (Kochi Univ. of Tech.), Masahide Nakamura (Kobe Univ.) KBSE2022-45 |
In recent years, there has been a lot of research on service personalization. Although there are many prior studies base... [more] |
KBSE2022-45 pp.13-18 |
LOIS, ICM |
2023-01-19 17:15 |
Fukuoka |
Kitakyushu International Conference Center (Primary: On-site, Secondary: Online) |
Consider Crowdsourcing Support for Automated Minute Taking Shun Kuroiwa, Kazumu Nakahira, Takahiro Koita (Doshisha Univ.) ICM2022-40 LOIS2022-40 |
Minutes of meeting requires a huge amount of cost to record all the conversations in a meeting. In recent years,many res... [more] |
ICM2022-40 LOIS2022-40 pp.54-58 |
MBE, MICT, IEE-MBE [detail] |
2023-01-17 10:40 |
Saga |
|
Potential problems that will arise for hospital LANs Eisuke Hanada (Saga Univ.), Takato Kudou (Oita Univ.) MICT2022-46 MBE2022-46 |
Hospital Information Systems (HIS) have been introduced in almost all large hospitals. In addition to this, IP networks ... [more] |
MICT2022-46 MBE2022-46 pp.17-21 |
EA, US (Joint) |
2022-12-22 16:50 |
Hiroshima |
Satellite Campus Hiroshima |
[Poster Presentation]
Data augmentation method for machine learning on speech data Tsubasa Maruyama (Tokyo Tech), Tsutomu Ikegami (AIST), Toshio Endo (Tokyo Tech), Takahiro Hirofuchi (AIST) EA2022-68 |
In machine learning, data augmentation is a method to enhance the number and diversity of data by adding transformations... [more] |
EA2022-68 pp.42-48 |
HCGSYMPO (2nd) |
2022-12-14 - 2022-12-16 |
Kagawa |
Onsite (Sunport Takamatsu) and Online (Primary: On-site, Secondary: Online) |
Online Multi-party Voice Call Tool with Easy Switching Between Public and Private Conversations Naho Tomiki, Wataru Date, Ari Hautasaari, Takeshi Naemura (UTokyo) |
Online conferencing tools, both those with which the speaker's voice can be heard uniformly by all meeting participants ... [more] |
|
IN, IA (Joint) |
2022-12-13 11:05 |
Hiroshima |
Higashi-Senda campus, Hiroshima Univ. (Primary: On-site, Secondary: Online) |
Finite State VoIP Traffic Model for Variable Voice Payload Size Tomoyoshi Waki (Tokyo Tech), Sumiko Miyata (SIT), Yoshiaki Kitaguchi (Tokyo Tech), Ken-ichi Baba (KUTE-TOKYO), Katsunori Yamaoka (Tokyo Tech) IN2022-49 |
In emergencies, telephone calls provide a sense of security because one hears the other person's voice directly. Calls p... [more] |
IN2022-49 pp.32-38 |
CCS |
2022-11-18 09:00 |
Mie |
(Primary: On-site, Secondary: Online) |
Voice Quality Conversion by Two-Step Process of Speech Feature Extraction and Speaker-Controlled Speech Synthesis Taichi Fukawa, Kenya Jin'no (Tokyo City Univ.) CCS2022-52 |
Many methods have been proposed in the field of voice quality conversion that use a style-transforming autoencoder. Howe... [more] |
CCS2022-52 pp.47-52 |
PRMU |
2022-10-21 15:25 |
Tokyo |
Miraikan - The National Museum of Emerging Science and Innovation (Primary: On-site, Secondary: Online) |
Features and Deep Learning Models Suitable for Speech Source Discrimination Method in Plural Voice User Interfaces Environment Kengo Maeda, Takahiro Yoshida (TUS) PRMU2022-27 |
Under the situation that plural devices equipped with a voice user interface exist in the user’s environment in the near... [more] |
PRMU2022-27 pp.29-34 |
HCS |
2022-08-27 15:15 |
Hyogo |
(Primary: On-site, Secondary: Online) |
A Study of Feedback Methods for Speakers in Speech Rate Converted Conversation
-- Comparative evaluation for adaptive switching between audio feedback and visual feedback -- Kazuma Ban (Tokyo Denki Univ.), Hiroko Tokunaga (Tokyo Denki Univ./RIKEN), Naoki Mukawa, Hiroto Saito (Tokyo Denki Univ.) HCS2022-47 |
Speech rate conversion is a useful technique for people who need assistance in listening comprehension and non-native sp... [more] |
HCS2022-47 pp.61-66 |
NS, SR, RCS, SeMI, RCC (Joint) |
2022-07-13 14:50 |
Ishikawa |
The Kanazawa Theatre + Online (Primary: On-site, Secondary: Online) |
Investigation of noise removal using U-Net and voice recognition performance improvement
-- for train running noise -- Jian Lin, Shota Sano, Yuusuke Kawakita, Tsuyoshi Miyazaki, Hiroshi Tanaka (KAIT) SeMI2022-26 |
A method for converting noisy sound into images to remove the noise has been proposed. We are attempting to remove train... [more] |
SeMI2022-26 pp.34-39 |
ICM |
2022-07-07 13:25 |
Hokkaido |
Tokachi Plaza (Primary: On-site, Secondary: Online) |
A study on Voice Quality Deterioration Monitoring Using OSS for Service Monitoring Yasuhiro Onozuka, Akihiro Shibata, Yoshitaka Syuntou, Kozo Sakae (DOCOMO Technology) ICM2022-11 |
In recent years, stable voice calling services have been demanded to the increased use of calls by corporate users. Ther... [more] |
ICM2022-11 pp.7-10 |