Committee |
Date Time |
Place |
Paper Title / Authors |
Abstract |
Paper # |
EA |
2024-05-22 14:15 |
Online |
Online |
未定
-- 未定 -- Tsubasa Ochiai (NTT), Kazuma Iwamoto (Doshisha Univ.), Marc Delcroix, Rintaro Ikeshita, Hiroshi Sato, Shoko Araki (NTT), Shigeru Katagiri (Doshisha Univ.) |
(To be available after the conference date) [more] |
|
EA |
2024-05-22 16:50 |
Online |
Online |
[Invited Talk]
Fundamentals of Diffusion-based Generative Models and their Application to Speech Enhancement and Separation Scheibler Robin (LY Corp.) |
(To be available after the conference date) [more] |
|
SIS |
2024-03-14 13:00 |
Kanagawa |
Kanagawa Institute of Technology (Primary: On-site, Secondary: Online) |
On Time-Position Detection of Signals under Noise Considering Threshold
-- Applications of Fractal Dimension Filters -- Hideo Shibayama (Shibaura Institute of Technology), Yoshiaki Makabe (Kanagawa Institute of Technology), Kenji Muto (Shibaura Institute of Technology), Tomoaki Kimura (Kanagawa Institute of Technology) SIS2023-45 |
Conflicts due to neighborhood noise can occur even when the sound pressure level is low. In such cases, the sound pressu... [more] |
SIS2023-45 pp.1-6 |
CAS, CS |
2024-03-14 15:55 |
Okinawa |
|
Residual Noise Removal in of Sound Source Separation Signal by Spectral Replacement Taiga Saito, Kenji Suyama (Tokyo Denki Univ.) CAS2023-122 CS2023-115 |
Although sound source separation method based on a multiplication of multiple weighted sum circuits has high suppression... [more] |
CAS2023-122 CS2023-115 pp.64-69 |
SIP, SP, EA, IPSJ-SLP [detail] |
2024-02-29 15:45 |
Okinawa |
(Primary: On-site, Secondary: Online) |
|
We have developed automatic speech recognition and dialect identification techniques by using COJADS, a corpus of Japane... [more] |
|
SIS |
2023-12-08 09:50 |
Aichi |
Sakurayama Campus, Nagoya City University (Primary: On-site, Secondary: Online) |
Time-position Detection of Signal under Background Noise Using Fractal Dimensional Filter Hideo Shibayama (Shibaura Institute of Technology), Yoshiaki Makabe (Kanagawa Institute of Technology), Kenji Muto (Shibaura Institute of Technology), Tomoaki Kimura (Kanagawa Institute of Technology) SIS2023-34 |
Conflicts due to neighborhood noise occur even when noise levels are lower than those specified by environmental standar... [more] |
SIS2023-34 pp.55-60 |
SP, NLC, IPSJ-SLP, IPSJ-NL [detail] |
2023-12-03 09:30 |
Tokyo |
Kikai-Shinko-Kaikan Bldg. (Primary: On-site, Secondary: Online) |
Enhancing Recognition of Rare Words in ASR through Error Detection and Context-Aware Error Correction Jiajun He, Zekun Yang, Tomoki Toda (Nagoya Univ.) NLC2023-16 SP2023-36 |
Automatic speech recognition (ASR) systems often suffer from errors, particularly when recognizing rare words. These err... [more] |
NLC2023-16 SP2023-36 pp.13-18 |
ET |
2023-10-21 15:30 |
Nagano |
Shinshu University Faculty of Engineering |
"Listening" Performance of Generative AI and Elementary Foreign Language Learners in Code-Switching Discourse Sunaoka Kazuko (Waseda Univ.), Qin Xu (Kyoto Univ.) ET2023-23 |
We used the Whisper model to automatically recognize and process teachers' Japanese and Chinese code-switching (CS) in a... [more] |
ET2023-23 pp.33-37 |
EA, ASJ-H, ASJ-MA, ASJ-SP |
2023-07-02 15:10 |
Hokkaido |
|
Speech Restoration of Spectrogram Images Printed in a Document "Visible Speech" Published in 1947 Naofumi Aoki (Hokkaido Univ.) EA2023-6 |
The restoration of speech materials recorded in the past might be regarded as a study in acoustical archeology. It may p... [more] |
EA2023-6 pp.12-15 |
SP, IPSJ-MUS, IPSJ-SLP [detail] |
2023-06-23 13:50 |
Tokyo |
(Primary: On-site, Secondary: Online) |
Streaming End-to-End speech recognition using a CTC decoder with substituted linguistic information Tatsunari Takagi (TUT), Atsunori Ogawa (NTT), Norihide Kitaoka, Yukoh Wakabayashi (TUT) SP2023-12 |
Speech recognition technology has been employed in various fields due to the enhancement of speech recognition model acc... [more] |
SP2023-12 pp.60-64 |
HIP, HCS, HI-SIGCOASTER [detail] |
2023-05-15 10:20 |
Okinawa |
Okinawa Industry Support Center (Primary: On-site, Secondary: Online) |
Cognitive Load Estimation of Speech-in-Noise Recall Task with State-Space Models Mateusz Dubiel (uni.lu), Minoru Nakayama (Tokyo Tech.), Xin Wang (NII) HCS2023-7 HIP2023-7 |
Cognitive workload during a listening and recall task was estimated using a state-space model based on metrics of pupill... [more] |
HCS2023-7 HIP2023-7 pp.29-32 |
ICD |
2023-04-10 13:20 |
Kanagawa |
(Primary: On-site, Secondary: Online) |
[Invited Talk]
Novel scheme of HZO/Si FeFET reservoir computing for speech recognition Eishin Nako, Kasidit Toprasertpong, Ryosho Nakane, Mitsuru Takenaka, Shinichi Takagi (The Univ. of Tokyo) ICD2023-4 |
We have demonstrated reservoir computing (RC) using HZO/Si ferroelectric gate FETs (FeFETs), which realizes efficient ti... [more] |
ICD2023-4 p.9 |
SP, IPSJ-SLP, EA, SIP [detail] |
2023-03-01 13:45 |
Okinawa |
(Primary: On-site, Secondary: Online) |
[Invited Talk]
Speech and Language Research in the Google Tokyo Office Michiel Bacchiani (Google) EA2022-116 SIP2022-160 SP2022-80 |
This talk will consist of three parts. In the first part of the talk, I will reflect on some lessons learned from the ac... [more] |
EA2022-116 SIP2022-160 SP2022-80 pp.239-240 |
HCGSYMPO (2nd) |
2022-12-14 - 2022-12-16 |
Kagawa |
Onsite (Sunport Takamatsu) and Online (Primary: On-site, Secondary: Online) |
Modelling cognitive load with ocular responses during a noisy synthetic speech recall task Mateusz Dubiel (uni.lu), Minoru Nakayama (Tokyo Tech.), Xin Wang (NII) |
We applied state-space models to estimate the cognitive workload based
on participants' reactions to speech signals (i... [more] |
|
NLC, IPSJ-NL, SP, IPSJ-SLP [detail] |
2022-12-01 14:50 |
Tokyo |
(Primary: On-site, Secondary: Online) |
A Japanese Automatic Speech Recognition System on the Next-Gen Kaldi Framework Wen Shen Teo, Yasuhiro Minami (UEC) NLC2022-16 SP2022-36 |
2021 saw the introduction of the cutting-edge successor to the Kaldi speech processing toolkit, known as Next-Gen Kaldi.... [more] |
NLC2022-16 SP2022-36 pp.39-44 |
EA, EMM, ASJ-H |
2022-11-22 13:00 |
Online |
Online |
[Fellow Memorial Lecture]
Security and Privacy Preservation for Speech Signal
-- Approach from speech information hiding technology -- Masashi Unoki (JAIST) EA2022-60 EMM2022-60 |
Non-authentic but skillfully fabricated artificial replicas of authentic media in the real world are known as “media clo... [more] |
EA2022-60 EMM2022-60 pp.99-104 |
SIS, ITE-BCT |
2022-10-13 14:15 |
Aomori |
Hachinohe Institute of Technology (Primary: On-site, Secondary: Online) |
Toward Improving Speech Naturalness Introducing a Capsule Structure for Speech Enhancement Networks Reito Kasuga, Tetsuya Shimamura, Yosuke Sugiura, Nozomiko Yasui (Saitama Univ.) SIS2022-12 |
Although the field of speech enhancement has been extensively studied around the world, phase tends to be neglected comp... [more] |
SIS2022-12 pp.7-12 |
SIP, BioX, IE, MI, ITE-IST, ITE-ME [detail] |
2022-05-20 11:30 |
Kumamoto |
Kumamoto University Kurokami Campus (Primary: On-site, Secondary: Online) |
Implementation of a Lightweight Automatic Speech Recognition System at the Edge Haotian Tan, Junichi Akita (Kanazawa Univ.) |
Automatic speech recognition (ASR) on the cloud has been widely adopted and has demonstrated satisfactory performance. W... [more] |
|
EA, SIP, SP, IPSJ-SLP [detail] |
2022-03-01 13:10 |
Okinawa |
(Primary: On-site, Secondary: Online) |
The upper limit of subjective intelligibility score of speech enhancement using IRM
-- comparison between laboratory and crowdsourcing experiments -- Ayako Yamamoto, Toshio Irino (Wakayama Univ.), Shoko Araki, Kenichi Arai, Atsunori Ogawa, Keisuke Kinoshita, Tomohiro Nakatani (NTT) EA2021-74 SIP2021-101 SP2021-59 |
We performed subjective speech intelligibility experiments in a laboratory and using crowdsourcing to get a fundamental ... [more] |
EA2021-74 SIP2021-101 SP2021-59 pp.64-69 |
EA, SIP, SP, IPSJ-SLP [detail] |
2022-03-01 14:45 |
Okinawa |
(Primary: On-site, Secondary: Online) |
Target speaker extraction based on conditional variational autoencoder and directional information in underdetermined condition Rui Wang, Li Li, Tomoki Toda (Nagoya Univ) EA2021-76 SIP2021-103 SP2021-61 |
This paper deals with a dual-channel target speaker extraction problem in underdetermined conditions. A blind source sep... [more] |
EA2021-76 SIP2021-103 SP2021-61 pp.76-81 |