Committee |
Date Time |
Place |
Paper Title / Authors |
Abstract |
Paper # |
EA |
2024-05-22 14:15 |
Online |
Online |
未定
-- 未定 -- Tsubasa Ochiai (NTT), Kazuma Iwamoto (Doshisha Univ.), Marc Delcroix, Rintaro Ikeshita, Hiroshi Sato, Shoko Araki (NTT), Shigeru Katagiri (Doshisha Univ.) |
(To be available after the conference date) [more] |
|
EA |
2024-05-22 16:50 |
Online |
Online |
[Invited Talk]
Fundamentals of Diffusion-based Generative Models and their Application to Speech Enhancement and Separation Scheibler Robin (LY Corp.) |
(To be available after the conference date) [more] |
|
SIS |
2024-03-14 13:00 |
Kanagawa |
Kanagawa Institute of Technology (Primary: On-site, Secondary: Online) |
On Time-Position Detection of Signals under Noise Considering Threshold
-- Applications of Fractal Dimension Filters -- Hideo Shibayama (Shibaura Institute of Technology), Yoshiaki Makabe (Kanagawa Institute of Technology), Kenji Muto (Shibaura Institute of Technology), Tomoaki Kimura (Kanagawa Institute of Technology) SIS2023-45 |
Conflicts due to neighborhood noise can occur even when the sound pressure level is low. In such cases, the sound pressu... [more] |
SIS2023-45 pp.1-6 |
CAS, CS |
2024-03-14 13:30 |
Okinawa |
|
Characterization of Semantic Communications in Speech Signal Transmission Futo Iwanaga, Daisuke Umehara (Kyoto Inst. of Tech.) CAS2023-118 CS2023-111 |
In recent years, the volume of data in data communication has surged, Characterization of Semantic Communications in Spe... [more] |
CAS2023-118 CS2023-111 pp.41-46 |
CAS, CS |
2024-03-14 15:55 |
Okinawa |
|
Residual Noise Removal in of Sound Source Separation Signal by Spectral Replacement Taiga Saito, Kenji Suyama (Tokyo Denki Univ.) CAS2023-122 CS2023-115 |
Although sound source separation method based on a multiplication of multiple weighted sum circuits has high suppression... [more] |
CAS2023-122 CS2023-115 pp.64-69 |
SIP, SP, EA, IPSJ-SLP [detail] |
2024-02-29 16:45 |
Okinawa |
(Primary: On-site, Secondary: Online) |
Multiple Lag Window Pairs for Estimation of Fundamental Frequency and Periodicity Measure Michiki Koshimori (UEC), Shigeki Sagayama (UTokyo/UEC), Toru Nakashika (UEC) EA2023-75 SIP2023-122 SP2023-57 |
Extending the main concept of modified autocorrelation method in LPC, we investigate lag windows, lag window pairs, and ... [more] |
EA2023-75 SIP2023-122 SP2023-57 pp.85-90 |
SIP, SP, EA, IPSJ-SLP [detail] |
2024-03-01 10:40 |
Okinawa |
(Primary: On-site, Secondary: Online) |
An Investigation on the Speech Recovery from EEG Signals Using Transformer Tomoaki Mizuno (The Univ. of Electro-Communications), Takuya Kishida (Aichi Shukutoku Univ.), Natsue Yoshimura (Tokyo Tech), Toru Nakashika (The Univ. of Electro-Communications) EA2023-108 SIP2023-155 SP2023-90 |
Synthesizing full speech from ElectroEncephaloGraphy(EEG) signals is a challenging task. In this paper, speech reconstru... [more] |
EA2023-108 SIP2023-155 SP2023-90 pp.277-282 |
SIP, SP, EA, IPSJ-SLP [detail] |
2024-03-01 15:25 |
Okinawa |
(Primary: On-site, Secondary: Online) |
Investigation of objective intelligibility metrics based on speech foundation models for Clarity Prediction Challenge 2 Katsuhiko Yamamoto (CyberAgent) EA2023-119 SIP2023-166 SP2023-101 |
Speech Foundation Models (SFMs), which use components like the encoder layer of Whisper, have been suggested to separate... [more] |
EA2023-119 SIP2023-166 SP2023-101 pp.334-339 |
SIP, SP, EA, IPSJ-SLP [detail] |
2024-03-01 16:35 |
Okinawa |
(Primary: On-site, Secondary: Online) |
Evaluations of Multi-channel Blind Source Separation for Speech Recognition in Car Environments Yutsuki Takeuchi, Natsuki Ueno, Nobutaka Ono (Tokyo Metropolitan Univ.), Takashi Takazawa, Shuhei Shimanoe, Tomoki Tanemura (MIRISE Technologies) EA2023-127 SIP2023-174 SP2023-109 |
In car environments, speech recognition is difficult due to various types of noise. For this issue, speech enhancement b... [more] |
EA2023-127 SIP2023-174 SP2023-109 pp.388-393 |
SIS |
2023-12-08 09:50 |
Aichi |
Sakurayama Campus, Nagoya City University (Primary: On-site, Secondary: Online) |
Time-position Detection of Signal under Background Noise Using Fractal Dimensional Filter Hideo Shibayama (Shibaura Institute of Technology), Yoshiaki Makabe (Kanagawa Institute of Technology), Kenji Muto (Shibaura Institute of Technology), Tomoaki Kimura (Kanagawa Institute of Technology) SIS2023-34 |
Conflicts due to neighborhood noise occur even when noise levels are lower than those specified by environmental standar... [more] |
SIS2023-34 pp.55-60 |
EA, ASJ-H, ASJ-MA, ASJ-SP |
2023-07-02 15:10 |
Hokkaido |
|
Speech Restoration of Spectrogram Images Printed in a Document "Visible Speech" Published in 1947 Naofumi Aoki (Hokkaido Univ.) EA2023-6 |
The restoration of speech materials recorded in the past might be regarded as a study in acoustical archeology. It may p... [more] |
EA2023-6 pp.12-15 |
HIP, HCS, HI-SIGCOASTER [detail] |
2023-05-15 10:20 |
Okinawa |
Okinawa Industry Support Center (Primary: On-site, Secondary: Online) |
Cognitive Load Estimation of Speech-in-Noise Recall Task with State-Space Models Mateusz Dubiel (uni.lu), Minoru Nakayama (Tokyo Tech.), Xin Wang (NII) HCS2023-7 HIP2023-7 |
Cognitive workload during a listening and recall task was estimated using a state-space model based on metrics of pupill... [more] |
HCS2023-7 HIP2023-7 pp.29-32 |
PRMU, IBISML, IPSJ-CVIM [detail] |
2023-03-02 15:10 |
Hokkaido |
Future University Hakodate (Primary: On-site, Secondary: Online) |
[Invited Talk]
-- Yuma Koizumi (Google Research) PRMU2022-87 IBISML2022-94 |
Machine learning tasks that deal with acoustic signals can be broadly classified into "recognizing sounds" and "generati... [more] |
PRMU2022-87 IBISML2022-94 p.149 |
SP, IPSJ-SLP, EA, SIP [detail] |
2023-02-28 13:00 |
Okinawa |
(Primary: On-site, Secondary: Online) |
[Invited Talk]
Multiple sound spot synthesis meets multilingual speech synthesis
-- Implementation is really all we need -- Takuma Okamoto (NICT) EA2022-87 SIP2022-131 SP2022-51 |
A multilingual multiple sound spot synthesis system is implemented as a user interface for real-time speech translation ... [more] |
EA2022-87 SIP2022-131 SP2022-51 pp.73-76 |
EMM |
2023-01-26 14:00 |
Miyagi |
Tohoku Univ. (Primary: On-site, Secondary: Online) |
Improving Frame Synchronization in Blind Speech Watermarking Method based on Spread-Spectrum using Linear Prediction Residue Takuto Isoyama (JAIST), Tetsuya Kojima (NIT, Tokyo College), Masashi Unoki (JAIST) EMM2022-66 |
We previously proposed a blindly-detectable direct-spread spectrum (DSS) method using linear prediction (LP) residue. Th... [more] |
EMM2022-66 pp.26-31 |
HCGSYMPO (2nd) |
2022-12-14 - 2022-12-16 |
Kagawa |
Onsite (Sunport Takamatsu) and Online (Primary: On-site, Secondary: Online) |
Modelling cognitive load with ocular responses during a noisy synthetic speech recall task Mateusz Dubiel (uni.lu), Minoru Nakayama (Tokyo Tech.), Xin Wang (NII) |
We applied state-space models to estimate the cognitive workload based
on participants' reactions to speech signals (i... [more] |
|
EA, EMM, ASJ-H |
2022-11-22 13:00 |
Online |
Online |
[Fellow Memorial Lecture]
Security and Privacy Preservation for Speech Signal
-- Approach from speech information hiding technology -- Masashi Unoki (JAIST) EA2022-60 EMM2022-60 |
Non-authentic but skillfully fabricated artificial replicas of authentic media in the real world are known as “media clo... [more] |
EA2022-60 EMM2022-60 pp.99-104 |
SIS, ITE-BCT |
2022-10-13 14:15 |
Aomori |
Hachinohe Institute of Technology (Primary: On-site, Secondary: Online) |
Toward Improving Speech Naturalness Introducing a Capsule Structure for Speech Enhancement Networks Reito Kasuga, Tetsuya Shimamura, Yosuke Sugiura, Nozomiko Yasui (Saitama Univ.) SIS2022-12 |
Although the field of speech enhancement has been extensively studied around the world, phase tends to be neglected comp... [more] |
SIS2022-12 pp.7-12 |
SP, IPSJ-MUS, IPSJ-SLP [detail] |
2022-06-18 15:00 |
Online |
Online |
Unsupervised Training of Sequential Neural Beamformer Using Blindly-separated and Non-separated Signals Kohei Saijo, Tetsuji Ogawa (Waseda Univ.) SP2022-25 |
We present an unsupervised training method of the sequential neural beamformer (Seq-NBF) using the separated signals fro... [more] |
SP2022-25 pp.110-115 |
EA, SIP, SP, IPSJ-SLP [detail] |
2022-03-01 14:45 |
Okinawa |
(Primary: On-site, Secondary: Online) |
Target speaker extraction based on conditional variational autoencoder and directional information in underdetermined condition Rui Wang, Li Li, Tomoki Toda (Nagoya Univ) EA2021-76 SIP2021-103 SP2021-61 |
This paper deals with a dual-channel target speaker extraction problem in underdetermined conditions. A blind source sep... [more] |
EA2021-76 SIP2021-103 SP2021-61 pp.76-81 |