Committee |
Date Time |
Place |
Paper Title / Authors |
Abstract |
Paper # |
SIP, SP, EA, IPSJ-SLP [detail] |
2024-02-29 09:30 |
Okinawa |
(Primary: On-site, Secondary: Online) |
Vocal tract length perturbation-based pseudo-speaker augmentation for automatic speaker verification Tomoka Wakamatsu, Sayaka Shiota, Hitoshi Kiya (Tokyo Metropolitan Univ.) EA2023-61 SIP2023-108 SP2023-43 |
In recent years, deep neural network (DNN)-based automatic speaker verification (ASV) systems have become mainstream. Da... [more] |
EA2023-61 SIP2023-108 SP2023-43 pp.1-6 |
SIP, SP, EA, IPSJ-SLP [detail] |
2024-02-29 09:50 |
Okinawa |
(Primary: On-site, Secondary: Online) |
Pseudo-speaker augmentation based on vocal tract length perturbation considering speaker variability for speaker verification Fumika Ono, Tomoka Wakamatsu, Sayaka Shiota (TMU) EA2023-62 SIP2023-109 SP2023-44 |
In order to construct a reliable speaker verification system based on speaker embeddings, it is necessary to train the s... [more] |
EA2023-62 SIP2023-109 SP2023-44 pp.7-12 |
SIP, SP, EA, IPSJ-SLP [detail] |
2024-02-29 10:10 |
Okinawa |
(Primary: On-site, Secondary: Online) |
Noise-Robust Voice Conversion by Denoising Training Conditioned with Latent Variables of Speech Quality and Recording Environment Takuto Igarashi, Yuki Saito, Kentaro Seki, Shinnosuke Takamichi (UT), Ryuichi Yamamoto, Kentaro Tachibana (LY), Hiroshi Saruwatari (UT) EA2023-63 SIP2023-110 SP2023-45 |
In this paper, we propose noise-robust voice conversion by conditioning latent variables representing speech quality and... [more] |
EA2023-63 SIP2023-110 SP2023-45 pp.13-18 |
SIP, SP, EA, IPSJ-SLP [detail] |
2024-02-29 10:30 |
Okinawa |
(Primary: On-site, Secondary: Online) |
Multi-task learning with age information model for highly accurate elderly speech recognition. Shine Takumi, Kinouchi Takahiro, Wakabayashi Yukoh, Kitaoka Norihide (TUT) EA2023-64 SIP2023-111 SP2023-46 |
The speech recognition of the elderly is less accurate, especially in smart speaker speech recognition, due to aging-rel... [more] |
EA2023-64 SIP2023-111 SP2023-46 pp.19-24 |
SIP, SP, EA, IPSJ-SLP [detail] |
2024-02-29 09:30 |
Okinawa |
(Primary: On-site, Secondary: Online) |
Simultaneous Estimation of Transfer Coefficients and Signals of Sound-to-Light Conversion Device Blinky Under Saturation Using Non-negative Matrix Factorization Kosuke Nishida, Natsuki Ueno, Nobutaka Ono (TMU), Daichi Kitamura (Kagawa NCT) EA2023-65 SIP2023-112 SP2023-47 |
In this study, we propose a method to estimate the intensity of optical signals emitted by sound-to-optical conversion d... [more] |
EA2023-65 SIP2023-112 SP2023-47 pp.25-30 |
SIP, SP, EA, IPSJ-SLP [detail] |
2024-02-29 09:50 |
Okinawa |
(Primary: On-site, Secondary: Online) |
Derivation of Direct Update Rule for Back-Projected Separation Matrix Yui Kuriki, Taishi Nakashima, Nobutaka Ono (TMU) EA2023-66 SIP2023-113 SP2023-48 |
Blind source separation (BSS) is a widely used technique for separating mixed signals originating from multiple sources.... [more] |
EA2023-66 SIP2023-113 SP2023-48 pp.31-36 |
SIP, SP, EA, IPSJ-SLP [detail] |
2024-02-29 10:10 |
Okinawa |
(Primary: On-site, Secondary: Online) |
Analysis of Overlapped Utterances in Everyday Conversation and Source Separation by Online Independent Vector Analysis for Asynchronous Distributed Recordings Haruki Nammoku, Taishi Nakashima, Kouei Yamaoka, Yukoh Wakabayashi, Nobutaka Ono (TMU) EA2023-67 SIP2023-114 SP2023-49 |
In this study, we investigate the effects of overlapped utterances on transcription in everyday conversation and propose... [more] |
EA2023-67 SIP2023-114 SP2023-49 pp.37-42 |
SIP, SP, EA, IPSJ-SLP [detail] |
2024-02-29 10:30 |
Okinawa |
(Primary: On-site, Secondary: Online) |
Accelerating and stabilizing vectorwise coordinate descent for spatially regularized independent low-rank matrix analysis Yuto Ishikawa, Takuya Okubo, Norihiro Takamune (UTokyo), Tomohiko Nakamura (AIST), Daichi Kitamura (NIT Kagawa), Hiroshi Saruwatari (UTokyo), Yu Takahashi, Kazunobu Kondo (Yamaha) EA2023-68 SIP2023-115 SP2023-50 |
Spatially regularized independent low-rank matrix analysis (SR-ILRMA) is the method that introduces the spatial prior in... [more] |
EA2023-68 SIP2023-115 SP2023-50 pp.43-50 |
SIP, SP, EA, IPSJ-SLP [detail] |
2024-02-29 11:00 |
Okinawa |
(Primary: On-site, Secondary: Online) |
Evaluation of Effect of Scatterer Shape on Incident Sound Field Estimation Based on Kernel Interpolation Shihori Kozuka (NTT), Shoichi Koyama (NII), Hiroaki Itou, Noriyoshi Kamado (NTT) EA2023-69 SIP2023-116 SP2023-51 |
Techniques for estimating the incident sound field using multiple microphones are effective for spatial sound field cont... [more] |
EA2023-69 SIP2023-116 SP2023-51 pp.51-56 |
SIP, SP, EA, IPSJ-SLP [detail] |
2024-02-29 11:20 |
Okinawa |
(Primary: On-site, Secondary: Online) |
Study on Virtual Sensing Feedback ANC System with Noise Control Filter Selection Shota Toyooka, Yoshinobu Kajikawa (Kansai Univ.) EA2023-70 SIP2023-117 SP2023-52 |
This paper proposes a virtual sensing feedback ANC system based on noise control filter selection.
The proposed ANC sys... [more] |
EA2023-70 SIP2023-117 SP2023-52 pp.57-60 |
SIP, SP, EA, IPSJ-SLP [detail] |
2024-02-29 11:40 |
Okinawa |
(Primary: On-site, Secondary: Online) |
EA2023-71 SIP2023-118 SP2023-53 |
(To be available after the conference date) [more] |
EA2023-71 SIP2023-118 SP2023-53 pp.61-64 |
SIP, SP, EA, IPSJ-SLP [detail] |
2024-02-29 12:00 |
Okinawa |
(Primary: On-site, Secondary: Online) |
On conditions for stably working filtered-x type active noise control systems Kensaku Fujii (Kodaway Lab.), Mitsuji Muneyasu (Kansai Univ.), Yoshifumi Chisaki (CIT) EA2023-72 SIP2023-119 SP2023-54 |
[more] |
EA2023-72 SIP2023-119 SP2023-54 pp.65-72 |
SIP, SP, EA, IPSJ-SLP [detail] |
2024-02-29 13:50 |
Okinawa |
(Primary: On-site, Secondary: Online) |
[Invited Talk]
Making the Invisible Visible: Toward High-Quality Deep THz Computational Imaging Chia-Wen Lin (National Tsing Hua Univ.) |
[more] |
|
SIP, SP, EA, IPSJ-SLP [detail] |
2024-02-29 15:10 |
Okinawa |
(Primary: On-site, Secondary: Online) |
Computational Complexity Reduction for Clustering in Speaker Diarization Komei Yamashita, Ryota Shimokura, Youji Iiguni (Osaka Univ.) |
[more] |
|
SIP, SP, EA, IPSJ-SLP [detail] |
2024-02-29 15:15 |
Okinawa |
(Primary: On-site, Secondary: Online) |
Selective Active Noise Control using Cartilage Conduction as a Secondary Source
-- Canceling complex and narrowband noise by Delayed-X Harmonics Synthesizer Algorithm -- Miyuki Azuma, Ryota Shimokura, Yoji Iiguni (Osaka Univ.) |
[more] |
|
SIP, SP, EA, IPSJ-SLP [detail] |
2024-02-29 15:20 |
Okinawa |
(Primary: On-site, Secondary: Online) |
Application of Audio Adversarial Examples to Audio CAPTCHA Yusuke Nobukawa, Ryota Shimokura, Yoji Iiguni (Osaka Univ.) |
[more] |
|
SIP, SP, EA, IPSJ-SLP [detail] |
2024-02-29 15:25 |
Okinawa |
(Primary: On-site, Secondary: Online) |
Evaluation of the validity of CNN-based image quality assessment Ririko Harada (Osaka Univ.), Ryo Hayakawa (TUAT), Youji Iiguni (Osaka Univ.) |
[more] |
|
SIP, SP, EA, IPSJ-SLP [detail] |
2024-02-29 15:30 |
Okinawa |
(Primary: On-site, Secondary: Online) |
Adaptation of End-to-End Japanese Speech Synthesis Using Crowdsoursed Dialect Accent Labels Yuki Oda, Kazuki Yamauchi, Yuki Saito, Hiroshi Saruwatari (UTokyo) |
[more] |
|
SIP, SP, EA, IPSJ-SLP [detail] |
2024-02-29 15:35 |
Okinawa |
(Primary: On-site, Secondary: Online) |
SRC4VC: Smartphone-Recorded Corpus for Benchmarking Multi-Speaker Voice Conversion Models Yuki Saito, Takuto Igarashi, Kentaro Seki, Shinnosuke Takamichi (UT), Ryuichi Yamamoto, Kentaro Tachibana (LY), Hiroshi Saruwatari (UT) |
[more] |
|
SIP, SP, EA, IPSJ-SLP [detail] |
2024-02-29 15:40 |
Okinawa |
(Primary: On-site, Secondary: Online) |
Preliminary Evaluation of Japanese Speech Corpus J-SpAW for Speaker Verification and Spoofing Detection Kota Kanno (Tokyo Metropolitan Univ.), Shinnosuke Takamichi (UTokyo), Sayaka Shiota (Tokyo Metropolitan Univ.) |
[more] |
|