Committee |
Date Time |
Place |
Paper Title / Authors |
Abstract |
Paper # |
EA |
2024-05-22 16:50 |
Online |
Online |
[Invited Talk]
Fundamentals of Diffusion-based Generative Models and their Application to Speech Enhancement and Separation Scheibler Robin (LY Corp.) |
(To be available after the conference date) [more] |
|
SIS |
2024-03-14 13:00 |
Kanagawa |
Kanagawa Institute of Technology (Primary: On-site, Secondary: Online) |
On Time-Position Detection of Signals under Noise Considering Threshold
-- Applications of Fractal Dimension Filters -- Hideo Shibayama (Shibaura Institute of Technology), Yoshiaki Makabe (Kanagawa Institute of Technology), Kenji Muto (Shibaura Institute of Technology), Tomoaki Kimura (Kanagawa Institute of Technology) SIS2023-45 |
Conflicts due to neighborhood noise can occur even when the sound pressure level is low. In such cases, the sound pressu... [more] |
SIS2023-45 pp.1-6 |
SIP, SP, EA, IPSJ-SLP [detail] |
2024-03-01 09:30 |
Okinawa |
(Primary: On-site, Secondary: Online) |
Constructing and Evaluating a Batch Voice Input System for Electronic Medical Records Using Large Language Models Ryo Maejima, Norihide Kitaoka (TUT) EA2023-99 SIP2023-146 SP2023-81 |
This study aims to develop an electronic medical record with a voice input interface that lets users input several items... [more] |
EA2023-99 SIP2023-146 SP2023-81 pp.226-231 |
SIP, SP, EA, IPSJ-SLP [detail] |
2024-03-01 10:40 |
Okinawa |
(Primary: On-site, Secondary: Online) |
An Investigation into Weighting Strategies for Model Averaging in Continual Learning for Automatic Speech Recognition Kentaro Shinayama, Hiroshi Sato, Tomoharu Iwata, Takeshi Mori, Taichi Asami (NTT) EA2023-105 SIP2023-152 SP2023-87 |
In recent years, the application scope of speech recognition AI has expanded, enabling the acquisition of diverse data d... [more] |
EA2023-105 SIP2023-152 SP2023-87 pp.262-267 |
SIP, SP, EA, IPSJ-SLP [detail] |
2024-03-01 10:40 |
Okinawa |
(Primary: On-site, Secondary: Online) |
Substitution of Implicit Linguistic Information in Beam Search Decoding Using CTC-based Speech Recognition Models Tatsunari Takagi, Yukoh Wakabayashi (TUT), Atsunori Ogawa (NTT), Norihide Kitaoka (TUT) EA2023-106 SIP2023-153 SP2023-88 |
The rise of neural networks in the field of automatic speech recognition has notably improved the accuracy of speech rec... [more] |
EA2023-106 SIP2023-153 SP2023-88 pp.268-273 |
EMM, EA, ASJ-H |
2023-11-23 15:45 |
Toyama |
|
[Invited Talk]
Auditory representation effective for extracting speech information: Theory, measurement, estimation, and applications Toshio Irino (Wakayama Univ.) |
Just by listening to the voice on a telephone, we can immediately tell whether the caller is an adult or a child, and we... [more] |
EA2023-46 EMM2023-77 pp.98-103 |
SP, IPSJ-MUS, IPSJ-SLP [detail] |
2023-06-23 13:50 |
Tokyo |
(Primary: On-site, Secondary: Online) |
Feature Representation of Japanese Pitch Accent and its Perceptual Adequacy
-- Fundamental Study for Application to Japanese Speech Education -- Ikuyo Masuda-Katsuse (Kindai Univ.) SP2023-3 |
For the purpose of application to Japanese speech education, pitch patterns were modeled by Sequential VAE to obtain a f... [more] |
SP2023-3 pp.8-13 |
SP, IPSJ-MUS, IPSJ-SLP [detail] |
2023-06-23 13:50 |
Tokyo |
(Primary: On-site, Secondary: Online) |
Streaming End-to-End speech recognition using a CTC decoder with substituted linguistic information Tatsunari Takagi (TUT), Atsunori Ogawa (NTT), Norihide Kitaoka, Yukoh Wakabayashi (TUT) SP2023-12 |
Speech recognition technology has been employed in various fields due to the enhancement of speech recognition model acc... [more] |
SP2023-12 pp.60-64 |
EA, EMM, ASJ-H |
2022-11-22 13:00 |
Online |
Online |
[Fellow Memorial Lecture]
Security and Privacy Preservation for Speech Signal
-- Approach from speech information hiding technology -- Masashi Unoki (JAIST) EA2022-60 EMM2022-60 |
Non-authentic but skillfully fabricated artificial replicas of authentic media in the real world are known as “media clo... [more] |
EA2022-60 EMM2022-60 pp.99-104 |
EA, ASJ-H |
2021-07-15 16:00 |
Online |
Online |
Acoustic characteristics of a face mask invented for the purpose of not impairing speech clarity Hiroki Matsuzaki (HUS) EA2021-10 |
Due to the epidemic of the new coronavirus infection (COVID-19), wearing a mask is required in daily life to prevent the... [more] |
EA2021-10 pp.47-52 |
SP, IPSJ-SLP, IPSJ-MUS |
2021-06-19 13:00 |
Online |
Online |
Creating of Japanese Phoneme Balanced Sentences for Speech Synthesis Yuko Takai, Naofumi Aoki, Yoshinori Dobashi (Hokkaido Univ.) SP2021-9 |
When the loss of voice is inevitable due to pharyngectomy or other reasons, it has become possible to realizespeech synt... [more] |
SP2021-9 pp.39-41 |
WIT, SP, IPSJ-SLP [detail] |
2020-10-22 13:00 |
Online |
Online |
[Invited Talk]
NHK's activities on Japanese end-to-end speech synthesis Kiyoshi Kurihara (NHK) SP2020-11 WIT2020-12 |
The main business of NHK (Japan Broadcasting Corporation) is the production and broadcasting of programs. Many programs ... [more] |
SP2020-11 WIT2020-12 pp.19-20 |
SP |
2020-01-29 11:30 |
Toyama |
|
Application of Deep Gaussian Process to Multi-Speaker Text-to-Speech Synthesis using Speaker Codes Kentaro Mitsui, Tomoki Koriyama, Hiroshi Saruwatari (UTokyo) SP2019-49 |
Speaker codes are widely used to achieve multi-speaker text-to-speech synthesis.
Conventionally, Deep Neural Network (D... [more] |
SP2019-49 pp.31-36 |
NLP, NC (Joint) |
2020-01-25 10:10 |
Okinawa |
Miyakojima Marine Terminal |
Application of Chaotic Neural Network Reservoir to Speech Recognition Maakito Inoue, Keisuke Fukuda, Yoshihiko Horio (Tohoku Univ.) NLP2019-103 |
The neural network reservoir is a learning network model using the recurrent neural network. The chaotic neural network ... [more] |
NLP2019-103 pp.95-98 |
WIT, HI-SIGACI |
2019-12-04 14:55 |
Tokyo |
AIST Tokyo Waterfront (TBD) |
Development of language function training support system for medical welfare and education Mio Sakuma (NIT, Sendai College), Shigeharu Ono (JAIST), Chie Sakuma (Kanagami Hospital), Takahiro Yonamine (NIT, Okinawa college) WIT2019-37 |
We have developed the language function training support system using Android tablet-type devices to reduce the burden o... [more] |
WIT2019-37 pp.39-44 |
WIT, IPSJ-AAC |
2019-03-09 16:40 |
Ibaraki |
Tsukuba University of Technology |
[Special Talk]
Applications of Well Well-being Information Technology to Broadcasting Service Tomoyasu Komori (NHK STRL) WIT2018-75 |
NHK has researched and developed the human-friendly broadcasting services that are commentary broadcasting and teletext ... [more] |
WIT2018-75 pp.105-106 |
NLC, IPSJ-IFAT |
2019-02-08 13:00 |
Kyoto |
Ryukoku University Omiya Campus |
[Special Talk]
Morphological Analyzer for Business "Sudachi": the Present and Future Yoshitaka Uchida (Works Applications) NLC2018-46 |
Morphological analysis is a fundamental and important technology for processing a Japanese text, especially for industri... [more] |
NLC2018-46 p.59 |
EA, ASJ-H, ASJ-AA |
2018-07-25 13:40 |
Hokkaido |
Hokkaido Univ. |
Interference-free power spectral representations of periodic sounds and their application to VOCODERs Hideki Kawahara (Wakayama Univ.), Masanori Morise (Univ. Yamanashi), Kanru Hua (Univ. Illinois) EA2018-23 |
We propose a method to calculate the spectral envelope of voiced sounds for VOCODER applications. In our previous techni... [more] |
EA2018-23 pp.135-140 |
PRMU, SP |
2018-06-29 10:00 |
Nagano |
|
Revisiting interference-free power spectral representations of periodic signals Hideki Kawahara (Wakayama Univ.), Masanori Morise (Univ. Yamanashi), Kanru Hua (Univ. Illinois) PRMU2018-29 SP2018-9 |
We propose two algorithms to calculate interference-free power spectra of periodic signals. This set of algorithms is ou... [more] |
PRMU2018-29 SP2018-9 pp.41-46 |
SIP, EA, SP, MI (Joint) [detail] |
2018-03-20 13:20 |
Okinawa |
|
Speech Dereverberation Based on Recursive Weighted Prediction Error Takehiko Kagoshima, Ui-Hyun Kim, Masami Akamine (Toshiba) EA2017-169 SIP2017-178 SP2017-152 |
This paper proposes a speech dereverberation method based on recursive wighted prediction error (RWPE) for a moving aver... [more] |
EA2017-169 SIP2017-178 SP2017-152 pp.367-372 |