Committee |
Date Time |
Place |
Paper Title / Authors |
Abstract |
Paper # |
SP, IPSJ-MUS, IPSJ-SLP [detail] |
2024-06-14 13:50 |
Tokyo |
(Primary: On-site, Secondary: Online) |
[Poster Presentation]
Computation of acoustic field in the vicinity of the wedge-shaped cut imitating the lips by using FDTD method Chiune Sato, Kunitoshi Motoki (Hokkai-Gakuen Univ.) |
(To be available after the conference date) [more] |
|
SP, IPSJ-MUS, IPSJ-SLP [detail] |
2024-06-15 13:50 |
Tokyo |
(Primary: On-site, Secondary: Online) |
[Poster Presentation]
A voice synthesizer operated by fingers to control its vocal-tract area function. Amane Koriki, Masashi Ito (Tohtech) |
(To be available after the conference date) [more] |
|
SIP, SP, EA, IPSJ-SLP [detail] |
2024-02-29 16:45 |
Okinawa |
(Primary: On-site, Secondary: Online) |
Multiple Lag Window Pairs for Estimation of Fundamental Frequency and Periodicity Measure Michiki Koshimori (UEC), Shigeki Sagayama (UTokyo/UEC), Toru Nakashika (UEC) EA2023-75 SIP2023-122 SP2023-57 |
Extending the main concept of modified autocorrelation method in LPC, we investigate lag windows, lag window pairs, and ... [more] |
EA2023-75 SIP2023-122 SP2023-57 pp.85-90 |
SIP, SP, EA, IPSJ-SLP [detail] |
2024-03-01 09:30 |
Okinawa |
(Primary: On-site, Secondary: Online) |
Improving training recipe of Remixed2Remixed for speech enhancement Li Li, Shogo Seki (CyberAgent) EA2023-95 SIP2023-142 SP2023-77 |
In the use of deep learning for speech enhancement, supervised learning models that use pairs of clean speech and artifi... [more] |
EA2023-95 SIP2023-142 SP2023-77 pp.202-207 |
SIP, SP, EA, IPSJ-SLP [detail] |
2024-03-01 10:40 |
Okinawa |
(Primary: On-site, Secondary: Online) |
Intermediate speaker speech synthesis between two speakers using x-vector speaker space Sota Hosoi, Takahiro Kinouchi, Yukoh Wakabayashi, Norihide Kitaoka (TUT) EA2023-103 SIP2023-150 SP2023-85 |
Recent advancements in speech synthesis technologies have enabled the synthesis of speeches of speakers not in the train... [more] |
EA2023-103 SIP2023-150 SP2023-85 pp.250-255 |
SIP, SP, EA, IPSJ-SLP [detail] |
2024-03-01 16:35 |
Okinawa |
(Primary: On-site, Secondary: Online) |
Discrimination of rotation direction of virtual sound source in binaural synthesis using sound source radiation characteristics Orie Nishiyama (Chiba Institute of Technology), Toshiharu Horiuchi, Shota Okubo (KDDI Research, Inc.), Yoshifumi Chisaki (Chiba Institute of Technology) EA2023-125 SIP2023-172 SP2023-107 |
In order to provide the sensation of being there, research has been conducted on realistic communication that acquires, ... [more] |
EA2023-125 SIP2023-172 SP2023-107 pp.376-381 |
SIP, SP, EA, IPSJ-SLP [detail] |
2024-03-01 16:35 |
Okinawa |
(Primary: On-site, Secondary: Online) |
Evaluations of Multi-channel Blind Source Separation for Speech Recognition in Car Environments Yutsuki Takeuchi, Natsuki Ueno, Nobutaka Ono (Tokyo Metropolitan Univ.), Takashi Takazawa, Shuhei Shimanoe, Tomoki Tanemura (MIRISE Technologies) EA2023-127 SIP2023-174 SP2023-109 |
In car environments, speech recognition is difficult due to various types of noise. For this issue, speech enhancement b... [more] |
EA2023-127 SIP2023-174 SP2023-109 pp.388-393 |
EMM, EA, ASJ-H |
2023-11-23 15:45 |
Toyama |
|
[Invited Talk]
Auditory representation effective for extracting speech information: Theory, measurement, estimation, and applications Toshio Irino (Wakayama Univ.) |
Just by listening to the voice on a telephone, we can immediately tell whether the caller is an adult or a child, and we... [more] |
EA2023-46 EMM2023-77 pp.98-103 |
ET |
2023-07-14 13:10 |
Hokkaido |
Muroran Institute of Technology / Online (Primary: On-site, Secondary: Online) |
English Pronunciation Practice Using the Speech Recognition Function Katsuyuki Umezawa (Shonan Inst. of Tech.), Makoto Nakazawa (Junior College of Aizu), Michiko Nakano, Shigeichi Hirasawa (Waseda Univ.) ET2023-9 |
The development of the AI field in recent years has been remarkable, and the speech recognition function has become wide... [more] |
ET2023-9 pp.1-6 |
SP, IPSJ-SLP, EA, SIP [detail] |
2023-03-01 11:00 |
Okinawa |
(Primary: On-site, Secondary: Online) |
Representation and Prediction of Accent Phrase Prosodic Features in Japanese Text-to-Speech Masaki Sato, Shinnosuke Takamichi, Hiroshi Saruwatari (The Univ. of Tokyo) EA2022-108 SIP2022-152 SP2022-72 |
In order to use speech synthesis in a variety of situations such as dialogue systems and emotional expression in audiobo... [more] |
EA2022-108 SIP2022-152 SP2022-72 pp.197-202 |
HCS |
2022-08-27 15:15 |
Hyogo |
(Primary: On-site, Secondary: Online) |
A Study of Feedback Methods for Speakers in Speech Rate Converted Conversation
-- Comparative evaluation for adaptive switching between audio feedback and visual feedback -- Kazuma Ban (Tokyo Denki Univ.), Hiroko Tokunaga (Tokyo Denki Univ./RIKEN), Naoki Mukawa, Hiroto Saito (Tokyo Denki Univ.) HCS2022-47 |
Speech rate conversion is a useful technique for people who need assistance in listening comprehension and non-native sp... [more] |
HCS2022-47 pp.61-66 |
SP, IPSJ-MUS, IPSJ-SLP [detail] |
2022-06-18 13:00 |
Online |
Online |
Speech intelligibility prediction of simulated hearing loss sounds using the Gammachirp Envelope Similarity Index (GESI)
-- Subjective data from laboratory and crowdsourced remote experiments -- Toshio Irino, Honoka Tamaru, Ayako Yamamoto (Wakayama Univ.) SP2022-17 |
We aim at developing an objective intelligibility measure (OIM) to predict speech intelligibility (SI) for individual el... [more] |
SP2022-17 pp.71-76 |
SP, IPSJ-MUS, IPSJ-SLP [detail] |
2022-06-18 15:00 |
Online |
Online |
Improved speech analysis using F0-adaptive lag window Michiki Koshimori, Shigeki Sagayama, Takuya Kishida, Toru Nakashika (UEC) SP2022-21 |
The lag window method is based on a source-filter model, which separates the source information from the filter informat... [more] |
SP2022-21 pp.90-93 |
HCS |
2022-03-12 10:10 |
Online |
Online |
Evaluation of Feedback Methods for Speakers in Speech Rate Converted Conversation Tamami Mizuta, Hiroko Tokunaga, Naoki Mukawa, Hiroto Saito (Tokyo Denki Univ.) HCS2021-70 |
This study clarifies the characteristics of voice feedback and visual feedback, which are support functions for speakers ... [more] |
HCS2021-70 pp.55-60 |
WIT, HI-SIGACI |
2021-12-09 16:25 |
Online |
Online |
Significance of the publication of "Speech communication and people with disabilities" Akira Ichikawa (Chiba Univ.), Yuji Nagashima (Kogakuin Univ.), Akira Okamoto (Tsukuba University of Technology), Naoto Kato (i Univ.), Shinji Sako (NITech), Testuya Takiguchi (Kobe Univ.), Daisuke Hara (Toyota Technological Institute), Michiru Makuuchi (National Rehabilitation Center For Persons with Disabilities) WIT2021-42 |
The book we authored, "Speech Communication and People with Disabilities," (edited by Acoustical Society of Japan, Acous... [more] |
WIT2021-42 pp.54-57 |
EA, ASJ-H |
2021-07-15 16:00 |
Online |
Online |
Acoustic characteristics of a face mask invented for the purpose of not impairing speech clarity Hiroki Matsuzaki (HUS) EA2021-10 |
Due to the epidemic of the new coronavirus infection (COVID-19), wearing a mask is required in daily life to prevent the... [more] |
EA2021-10 pp.47-52 |
SP, IPSJ-SLP, IPSJ-MUS |
2021-06-19 13:00 |
Online |
Online |
Creating of Japanese Phoneme Balanced Sentences for Speech Synthesis Yuko Takai, Naofumi Aoki, Yoshinori Dobashi (Hokkaido Univ.) SP2021-9 |
When the loss of voice is inevitable due to pharyngectomy or other reasons, it has become possible to realizespeech synt... [more] |
SP2021-9 pp.39-41 |
EA, US, SP, SIP, IPSJ-SLP [detail] |
2021-03-03 13:05 |
Online |
Online |
[Invited Talk]
* Masahito Togami (LINE) EA2020-64 SIP2020-95 SP2020-29 |
Recently, deep learning based speech source separation has been evolved rapidly. A neural network (NN) is usually learne... [more] |
EA2020-64 SIP2020-95 SP2020-29 pp.27-32 |
SIS, IPSJ-AVM, ITE-3DMT [detail] |
2020-06-04 14:00 |
Online |
Online |
An experimental comparison of CNN- and CRNN-CTC for automatic phrase speech recognition systems using a children's speech database Yunzhe Wang, Yu Tian (Hokkaido Univ.), Yoshikazu Miyanaga (CIST), Hiroshi Tsutsui (Hokkaido Univ.) SIS2020-9 |
Children's speech recognition is still a challenging issue. In the case of children's speeches, the accuracy of conventi... [more] |
SIS2020-9 pp.49-54 |
SP, EA, SIP |
2020-03-03 09:00 |
Okinawa |
Okinawa Industry Support Center (Cancelled but technical report was issued) |
[Poster Presentation]
EEG-Based Estimation of Attentional Direction while Simultaneously Listening to Music and Speech Ryosuke Matsui, Toshihisa Tanaka (TUAT) EA2019-155 SIP2019-157 SP2019-104 |
We can selectively focus our auditory attention to a particular speech or music.The function is called selective attenti... [more] |
EA2019-155 SIP2019-157 SP2019-104 pp.313-318 |