Committee |
Date Time |
Place |
Paper Title / Authors |
Abstract |
Paper # |
SIP, SP, EA, IPSJ-SLP [detail] |
2024-02-29 15:45 |
Okinawa |
(Primary: On-site, Secondary: Online) |
|
We have developed automatic speech recognition and dialect identification techniques by using COJADS, a corpus of Japane... [more] |
|
SP, IPSJ-SLP, EA, SIP [detail] |
2023-03-01 09:10 |
Okinawa |
(Primary: On-site, Secondary: Online) |
Training Dialect Speech Recognition Model using Corpus of Japanese Dialects and Self-Supervised Learning-based Model XLSR Shogo Miwa, Atsuhiko Kai (Shizuoka Univ.) EA2022-99 SIP2022-143 SP2022-63 |
(To be available after the conference date) [more] |
EA2022-99 SIP2022-143 SP2022-63 pp.141-146 |
SP, IPSJ-SLP, EA, SIP [detail] |
2023-03-01 09:50 |
Okinawa |
(Primary: On-site, Secondary: Online) |
Domain Adaptation for Improving End-to-end ASR Performance of Classroom Speech with Variable Recording Condition Raufun Nahar, Rino Suzuki, Atsuhiko Kai (Shizuoka Univ.) EA2022-101 SIP2022-145 SP2022-65 |
Automatic speech recognition (ASR) of real-world speech recorded in real environment has been a challenge in the field o... [more] |
EA2022-101 SIP2022-145 SP2022-65 pp.153-158 |
SP, EA, SIP |
2020-03-02 13:00 |
Okinawa |
Okinawa Industry Support Center (Cancelled but technical report was issued) |
Adaptation to Meeting Speech and Mitigation of Wraparound Speech for End-to-end Speech Recognition Kazua Ouchi, Atsuhiko Kai (Shizuoka Univ.) EA2019-111 SIP2019-113 SP2019-60 |
(To be available after the conference date) [more] |
EA2019-111 SIP2019-113 SP2019-60 pp.59-64 |
WIT, SP |
2017-10-20 10:40 |
Fukuoka |
Tobata Library of Kyutech (Kitakyushu) |
Low Cost Semi-automatic Correction and Adaptation Method Assuming Automatic Captioning System for Lectures Tamiya Kenta, Terada Yuji, Kai Atsuhiko (Shizuoka Univ.) SP2017-50 WIT2017-46 |
By using Automatic Speech Recognition (ASR) technology, it is possible to subtitle lecture and other voices at low cost ... [more] |
SP2017-50 WIT2017-46 pp.89-94 |
AI |
2016-02-29 13:30 |
Kyoto |
Kyoto Univ. |
Speech-enabled Parallel-text Retrieval Method using Expanded Corpus Taku Fukushima, Atsuhiko Kai (Shizuoka Univ.) AI2015-57 |
Recently, worldwide globalization has helped to increase communication among people with different native languages. How... [more] |
AI2015-57 pp.29-34 |
NLC, IPSJ-NL, SP, IPSJ-SLP (Joint) [detail] |
2015-12-02 16:30 |
Aichi |
Nagoya Inst of Tech. |
Distant-talking speech recognition by reverberation-aware denoising autoencoder Yuma Ueda (Shizuoka Univ.), Longbiao Wang (Nagaoka Univ.), Atsuhiko Kai (Shizuoka Univ.) SP2015-77 |
In the distant-talking speech recognition, it is essential to deal with the noise and reverberation.Denoising autoencode... [more] |
SP2015-77 pp.55-60 |
NLC, IPSJ-NL, SP, IPSJ-SLP, JSAI-SLUD (Joint) [detail] |
2014-12-15 10:45 |
Kanagawa |
Tokyo Institute of Technology (Suzukakedai Campus) |
Investigation of Deep Neural Network and Cross-adaptation for Voice Activity Detection in Meeting Speech Akihiro Nakadani (Shizuoka Univ.), Longbiao Wang (Nagaoka Univ. of Tech.), Atsuhiko Kai (Shizuoka Univ.) SP2014-107 |
In voice activity detection(VAD), performance largely decreases under the influence of noise and reverberation. In this ... [more] |
SP2014-107 pp.19-24 |
NLC, IPSJ-NL, SP, IPSJ-SLP, JSAI-SLUD (Joint) [detail] |
2014-12-16 13:30 |
Kanagawa |
Tokyo Institute of Technology (Suzukakedai Campus) |
[Poster Presentation]
Deep neural network-based feature transformation for reverberant speaker identification Zhaofeng Zhang, Longbiao Wang (NUT), Atsuhiko Kai (Shizuoka Univ.), Weifeng Li (Tsinghua Univ.), Masahiro Iwahashi (NUT) SP2014-119 |
(Advance abstract in Japanese is available) [more] |
SP2014-119 pp.111-116 |
NLC, IPSJ-NL, SP, IPSJ-SLP, JSAI-SLUD (Joint) [detail] |
2014-12-16 13:30 |
Kanagawa |
Tokyo Institute of Technology (Suzukakedai Campus) |
[Poster Presentation]
speech selection and environmental adaptation for asynchronous speech recording based on deep neural network Bo Ren, Longbiao Wang (Nagaoka Univ. of Tech.), Atsuhiko Kai (Shizuoka Univ.) SP2014-121 |
In this paper, we propose a robust distant-talking speech recognition system with asynchronous speech recording. This is... [more] |
SP2014-121 pp.129-134 |
SP, IPSJ-MUS |
2014-05-24 11:30 |
Tokyo |
|
Robustness of Speaker Identification Using Pseudo Pitch Synchronized Phase Information Yuta Kawakami, Longbiao Wang (Nagaoka Univ. of Tech.), Atsuhiko Kai (Shizuoka Univ.), Seiichi Nakagawa (Toyohashi Univ. of Tech.) SP2014-11 |
The phase information is useful for the speaker recognition task, but MFCC ignores that. In this work, we conducted spea... [more] |
SP2014-11 pp.123-126 |
SP, IPSJ-MUS |
2014-05-24 11:30 |
Tokyo |
|
Distant-talking Speech Recognition with Asynchronous Speech Recording Shunta Teraoka, Yuma Ueda (Shizuoka Univ.), Longbiao Wang (Nagaoka Univ. of Tech.), Atsuhiko Kai, Taku Fukushima (Shizuoka Univ.) SP2014-16 |
Although applications using mobile terminals have attracted increasing attention, there are few studies that focus on di... [more] |
SP2014-16 pp.153-157 |
AI |
2014-02-26 14:00 |
Osaka |
|
Proposal of a Speech-enabled Parallel-text Retrieval Method for Smooth Multilingual Communication Support Taku Fukushima, Atsuhiko Kai (Shizuoka Univ.) AI2013-42 |
Recently, worldwide globalization has helped to increase communication among people with different native languages. How... [more] |
AI2013-42 pp.29-34 |
EA, SP, SIP |
2012-05-25 10:50 |
Osaka |
Osaka Univ. Nakanoshima Center |
Evaluation of Denoising and Dereverberation Based on Spectral Subtraction in Real Environment Kyohei Odani, Longbiao Wang, Atsuhiko Kai (Shizuoka Univ.) EA2012-24 SIP2012-24 SP2012-24 |
In a distant-talking environment, reverberation drastically degrades speech recognition performance. In previous work, w... [more] |
EA2012-24 SIP2012-24 SP2012-24 pp.137-142 |
EA, SIP, SP |
2011-05-12 11:15 |
Osaka |
Ritsumeikan Univ. |
Improvement of Dereverberation by Multi-channel LMS Algorithm for Distant-talking Speech Recognition Kyohei Odani, Longbiao Wang, Atsuhiko Kai (Shizuoka Univ.) EA2011-3 SIP2011-3 SP2011-3 |
In a distant-talking environment, reverberation drastically degrades speech recognition performance. In previous work, w... [more] |
EA2011-3 SIP2011-3 SP2011-3 pp.13-18 |