Committee |
Date Time |
Place |
Paper Title / Authors |
Abstract |
Paper # |
SIS |
2024-03-14 13:00 |
Kanagawa |
Kanagawa Institute of Technology (Primary: On-site, Secondary: Online) |
On Time-Position Detection of Signals under Noise Considering Threshold
-- Applications of Fractal Dimension Filters -- Hideo Shibayama (Shibaura Institute of Technology), Yoshiaki Makabe (Kanagawa Institute of Technology), Kenji Muto (Shibaura Institute of Technology), Tomoaki Kimura (Kanagawa Institute of Technology) SIS2023-45 |
Conflicts due to neighborhood noise can occur even when the sound pressure level is low. In such cases, the sound pressu... [more] |
SIS2023-45 pp.1-6 |
SIS |
2024-03-14 14:00 |
Kanagawa |
Kanagawa Institute of Technology (Primary: On-site, Secondary: Online) |
Consideration on divisions and combinations of Learning Data for Speaker Diarization in Multiple Speakers Kaito Uemura, Keiichi Horio (Kyushu Institute of Technology) SIS2023-48 |
Today, the importance of a speech segment detection technique called speaker diarization is increasing, mainly in the fi... [more] |
SIS2023-48 pp.17-20 |
SIP, SP, EA, IPSJ-SLP [detail] |
2024-02-29 10:10 |
Okinawa |
(Primary: On-site, Secondary: Online) |
Noise-Robust Voice Conversion by Denoising Training Conditioned with Latent Variables of Speech Quality and Recording Environment Takuto Igarashi, Yuki Saito, Kentaro Seki, Shinnosuke Takamichi (UT), Ryuichi Yamamoto, Kentaro Tachibana (LY), Hiroshi Saruwatari (UT) EA2023-63 SIP2023-110 SP2023-45 |
In this paper, we propose noise-robust voice conversion by conditioning latent variables representing speech quality and... [more] |
EA2023-63 SIP2023-110 SP2023-45 pp.13-18 |
SIP, SP, EA, IPSJ-SLP [detail] |
2024-02-29 16:25 |
Okinawa |
(Primary: On-site, Secondary: Online) |
Development of the mental disorder estimation model using voice Kaho Kato, Akihiko Takashima, Kei Kikuiri, Takeshi Yoshimura (NTT docomo) EA2023-74 SIP2023-121 SP2023-56 |
It is known that early dealing with psychological stress prevents the mental disorder such as depression from developing... [more] |
EA2023-74 SIP2023-121 SP2023-56 pp.79-84 |
SIP, SP, EA, IPSJ-SLP [detail] |
2024-03-01 15:45 |
Okinawa |
(Primary: On-site, Secondary: Online) |
Prediction of Voice Processing Intensity Matching the Impression of a Voice Agent Ren Miyamoto, Wakuto Morita, Daisuke Saito, Nobuaki Minematsu (Tokyo Univ.) EA2023-132 SIP2023-179 SP2023-114 |
When a voice agent such as a robot interacts with a human, it is important in terms of familiarity with the agent to sel... [more] |
EA2023-132 SIP2023-179 SP2023-114 pp.415-420 |
CNR, BioX |
2024-03-01 09:30 |
Tokyo |
NHK Science & Technology Research Laboratories (Primary: On-site, Secondary: Online) |
Respiration-enhanced Human-Robot Interaction Takao Obi, Kotaro Funakoshi (Tokyo Tech.) BioX2023-75 CNR2023-42 |
In the field of Human-Robot Interaction (HRI), enhancing a robot's impression, affinity, and interaction smoothness is c... [more] |
BioX2023-75 CNR2023-42 pp.30-34 |
CNR |
2024-01-19 14:00 |
Kagoshima |
Fureai Plaza Nanohana-Kan Meeting Room 1 (IBUSUKI City) (Primary: On-site, Secondary: Online) |
Implementation and Evaluation of a Speech System Synchronized to Personal Tempo Yosuke Ujigawa, Kazunori Takashio (Keio University) CNR2023-30 |
In everyday life, people maintain their own unique tempo, known as Personal Tempo. Tempo is also highly important in dia... [more] |
CNR2023-30 pp.25-30 |
HCGSYMPO (2nd) |
2023-12-11 - 2023-12-13 |
Fukuoka |
Asia pacific Import Mart (Kitakyushu) (Primary: On-site, Secondary: Online) |
Turn-Taking Prediction Model Using Single Speaker Features Kazuyo Onishi, Hiroki Tanaka, Satoshi Nakamura (NAIST) |
Prediction of utterances in two-party conversations is important to make turn-taking between humans and virtual agents n... [more] |
|
SIS |
2023-12-08 09:50 |
Aichi |
Sakurayama Campus, Nagoya City University (Primary: On-site, Secondary: Online) |
Time-position Detection of Signal under Background Noise Using Fractal Dimensional Filter Hideo Shibayama (Shibaura Institute of Technology), Yoshiaki Makabe (Kanagawa Institute of Technology), Kenji Muto (Shibaura Institute of Technology), Tomoaki Kimura (Kanagawa Institute of Technology) SIS2023-34 |
Conflicts due to neighborhood noise occur even when noise levels are lower than those specified by environmental standar... [more] |
SIS2023-34 pp.55-60 |
WIT, SP, IPSJ-SLP [detail] |
2023-10-14 16:40 |
Fukuoka |
Kyushu Institute of Technology (Primary: On-site, Secondary: Online) |
Sequence-to-sequence Voice Conversion for Electrolaryngeal Speech Enhancement with Multi-stage Pretraining and Fine-tuning Techniques Ding Ma, Lester Phillip Violeta, Kazuhiro Kobayashi, Tomoki Toda (Nagoya Univ.) SP2023-32 WIT2023-23 |
Sequence-to-sequence (seq2seq) voice conversion (VC) models have great potential for electrolaryngeal (EL) speech to nor... [more] |
SP2023-32 WIT2023-23 pp.27-32 |
SIS, ITE-BCT |
2023-10-13 11:05 |
Yamaguchi |
HISTORIA UBE (Primary: On-site, Secondary: Online) |
Proposal of a fractal dimensional filter and its application to the detection of Schlegel's green frog voices. Hideo Shibayama (Shibaura Institute of Technology), Yoshiaki Makabe (Kanagawa Institute of Technology), Kenji Muto (Shibaura Institute of Technology), Tomoaki Kimura (Kanagawa Institute of Technology) SIS2023-22 |
We propose a fractal dimensional filter to estimate time-position of the target sound from time series data. The filter ... [more] |
SIS2023-22 pp.35-40 |
MIKA (3rd) |
2023-10-11 14:30 |
Okinawa |
Okinawa Jichikaikan (Primary: On-site, Secondary: Online) |
[Poster Presentation]
Voice Recognition AR System Using Edge Computing Taito Baba, Ryo Midorikawa, Takumi Senaha, Toma Uruizaka, Takuya Asaka (Tokyo Metropolitan Univ.) |
People with unilateral hearing loss struggle with sound localization and voice recognition. Additionally, users of noise... [more] |
|
AI |
2023-09-12 15:55 |
Hokkaido |
|
Estimation of unmasked face images based on voice and 3DMM Tetsumaru Akatsuka, Ryohei Orihara, Yuichi Sei, Yasuyuki Tahara, Akihiko Ohsuga (UEC) AI2023-32 |
Facemasks have become common due to the COVID-19 pandemic. They have begun to affect security and identification systems... [more] |
AI2023-32 pp.187-193 |
SP, IPSJ-MUS, IPSJ-SLP [detail] |
2023-06-23 13:50 |
Tokyo |
(Primary: On-site, Secondary: Online) |
Impression Conversion of Speech for Unknown Speakers Using FaderNet Saki Kugimoto, Toru Nakashika (UEC) SP2023-2 |
This paper proposes a model that can convert impressions of unknown speakers who do not have impression labels, based on... [more] |
SP2023-2 pp.4-7 |
SP, IPSJ-MUS, IPSJ-SLP [detail] |
2023-06-23 13:50 |
Tokyo |
(Primary: On-site, Secondary: Online) |
Data Augmentation by Synthesised Voice for Deep Learning-based A Cappella Separation Kyoka Kazama (TMU), Yuma Kinoshita (Tokai Univ.), Natsuki Ueno, Nobutaka Ono (TMU) SP2023-4 |
In this study, we examine efficacy of training data augmentation for a cappella singing voice separation using deep lear... [more] |
SP2023-4 pp.14-19 |
SP, IPSJ-MUS, IPSJ-SLP [detail] |
2023-06-23 13:50 |
Tokyo |
(Primary: On-site, Secondary: Online) |
[Poster Presentation]
Opera-singing voice synthesis using Diff-SVC Aoto Sugahara (Kobe Univ.), Soma Kishimoto, Yuji Adachi, Kiyoto Tai (MEC Company Ltd.), Ryoichi Takashima, Testuya Takiguchi (Kobe Univ.) SP2023-7 |
Singing voice synthesis technology is widely used in the entertainment field, it has attracted attention as a method to ... [more] |
SP2023-7 pp.30-35 |
SP, IPSJ-MUS, IPSJ-SLP [detail] |
2023-06-23 13:50 |
Tokyo |
(Primary: On-site, Secondary: Online) |
[Poster Presentation]
Parody Detection Based on Alignment Collapse Between Lyrics and Singing Voice Tomoki Ariga, Yosuke Higuchi (Waseda Univ.), Mitsunori Kanno, Rie Shigyo, Takato Mizuguchi, Naoki Okamoto (DAIICHIKOSHO), Tetsuji Ogawa (Waseda Univ.) SP2023-10 |
We propose a parody detection system for karaoke singing by evaluating alignment collapse between lyrics and singing voi... [more] |
SP2023-10 pp.48-53 |
SP, IPSJ-MUS, IPSJ-SLP [detail] |
2023-06-23 13:50 |
Tokyo |
(Primary: On-site, Secondary: Online) |
[Poster Presentation]
The effect of acoustic and linguistic information on the evaluation of one's own recorded speech Hidekazu Nagamura, Seita Tomioka, Taichirou Tanaka, Kohta I. Kobayasi (Doshisha Univ.) SP2023-13 |
Despite the fact that people usually hear their voice when they speak, they feel uncomfortable when listening to a recor... [more] |
SP2023-13 pp.65-67 |
NLP, MSS |
2023-03-17 11:00 |
Nagasaki |
(Primary: On-site, Secondary: Online) |
Knowledge Extraction by Machine Learning Using Physical and Human Sensors in Smart Agriculture Kenta Toya, Moritaro Inoue, Naoshi Uchihira (JAIST) MSS2022-97 NLP2022-142 |
Smart agriculture based on IoT is one of the effective methods to improve the efficiency of agricultural work. However, ... [more] |
MSS2022-97 NLP2022-142 pp.164-167 |
PRMU, IBISML, IPSJ-CVIM [detail] |
2023-03-03 16:50 |
Hokkaido |
Future University Hakodate (Primary: On-site, Secondary: Online) |
Parallel-Data-Free Japanese Singer Conversion using CycleGAN Considering Perceptual Loss in Singing Phoneme Sequences Kanade Gemmoto, Nobutaka Shimada, Tadashi Matsuo (Ritsumeikan Univ) PRMU2022-114 IBISML2022-121 |
This paper proposes a one-to-one Japanese Singing Voice Conversion (SVC) method without using parallel data.
Our method... [more] |
PRMU2022-114 IBISML2022-121 pp.293-298 |