Committee |
Date Time |
Place |
Paper Title / Authors |
Abstract |
Paper # |
EA, SIP, SP, IPSJ-SLP [detail] |
2022-03-01 14:45 |
Okinawa |
(Primary: On-site, Secondary: Online) |
Target speaker extraction based on conditional variational autoencoder and directional information in underdetermined condition Rui Wang, Li Li, Tomoki Toda (Nagoya Univ) EA2021-76 SIP2021-103 SP2021-61 |
This paper deals with a dual-channel target speaker extraction problem in underdetermined conditions. A blind source sep... [more] |
EA2021-76 SIP2021-103 SP2021-61 pp.76-81 |
SeMI |
2022-01-21 10:00 |
Nagano |
(Primary: On-site, Secondary: Online) |
[Short Paper]
Dementia Detection Using Two Perplexities Methods with Part-of-Speech Tags Chuheng Zheng, Mondher Bouazizi, Tomoaki Ohtsuki (Keio Univ.) SeMI2021-77 |
Alzheimer’s disease is a kind of dementia that causes problems with memory, thinking, and behavior. Using automated comp... [more] |
SeMI2021-77 pp.98-102 |
SP, IPSJ-SLP, IPSJ-MUS |
2021-06-18 13:00 |
Online |
Online |
F0 estimation of speech based on l2-norm regularized TV-CAR analysis Keiichi Funaki (Univ. of the Ryukyus) SP2021-2 |
Linear Prediction (LP) is the most successful speech analysis in speech processing, including speech coding implemented
... [more] |
SP2021-2 pp.7-12 |
SP, IPSJ-SLP, IPSJ-MUS |
2021-06-18 15:00 |
Online |
Online |
Protection method with audio processing against Audio Adversarial Example Taisei Yamamoto, Yuya Tarutani, Yukinobu Fukusima, Tokumi Yokohira (Okayama Univ) SP2021-4 |
Machine learning technology has improved the recognition accuracy of voice recognition, and demand for voice recognition... [more] |
SP2021-4 pp.19-24 |
SP, IPSJ-SLP, IPSJ-MUS |
2021-06-19 09:30 |
Online |
Online |
[Invited Talk]
Toward a Unification of Various Speech Processing Tasks Based on End-to-End Neural networks Shinji Watanabe (CMU) SP2021-8 |
This presentation will introduce the recent progress of speech processing technologies based on end-to-end neural networ... [more] |
SP2021-8 p.38 |
EA, US, SP, SIP, IPSJ-SLP [detail] |
2021-03-04 16:10 |
Online |
Online |
Estimation of imagined speech from electrocorticogram with an encoder-decoder model Kotaro Hayashi, Shuji Komeiji (TUAT), Takumi Mitsuhashi, Yasushi Iimura, Hiroharu Suzuki, Hidenori Sugano (Juntendo Univ.), Koichi Shinoda (TokyoTech), Toshihisa Tanaka (TUAT) EA2020-87 SIP2020-118 SP2020-52 |
Recent advances in signal processing and machine learning technologies have made it possible to estimate and reconstruct... [more] |
EA2020-87 SIP2020-118 SP2020-52 pp.164-169 |
SP, EA, SIP |
2020-03-02 09:20 |
Okinawa |
Okinawa Industry Support Center (Cancelled but technical report was issued) |
Investigation of neural speech rate conversion with multi-speaker WaveNet vocoder Takuma Okamoto (NICT), Keisuke Matsubara (Kobe Univ./NICT), Tomoki Toda (Nagoya Univ./NICT), Yoshinori Shiga, Hisashi Kawai (NICT) EA2019-101 SIP2019-103 SP2019-50 |
Speech rate conversion technology, which can expand or compress speech waveforms without changing pitch of sound, is con... [more] |
EA2019-101 SIP2019-103 SP2019-50 pp.1-6 |
SP, EA, SIP |
2020-03-02 13:00 |
Okinawa |
Okinawa Industry Support Center (Cancelled but technical report was issued) |
Data augmentation for ASR system by using locally time-reversed speech
-- Temporal inversion of feature sequence -- Takanori Ashihara, Tomohiro Tanaka, Takafumi Moriya, Ryo Masumura, Yusuke Shinohara, Makio Kashino (NTT) EA2019-110 SIP2019-112 SP2019-59 |
Data augmentation is one of the techniques to mitigate overfitting and improve robustness against several acoustic varia... [more] |
EA2019-110 SIP2019-112 SP2019-59 pp.53-58 |
SP, EA, SIP |
2020-03-03 09:00 |
Okinawa |
Okinawa Industry Support Center (Cancelled but technical report was issued) |
A Study for HMM-based embedded speech synthesis using a large-scale speech corpus Nobuyuki Nishizawa, Tomohiro Obara, Hiromi Ishizaki (KDDI Research, Inc.) EA2019-141 SIP2019-143 SP2019-90 |
This study shows that our speech synthesis system based on HMM speech synthesis for embedded devices can perform real-ti... [more] |
EA2019-141 SIP2019-143 SP2019-90 pp.231-236 |
EA |
2019-12-12 14:25 |
Fukuoka |
Kyushu Inst. Tech. |
Performance improvement of speech enhancement network by multitask learning including noise information Haruki Tanaka (NITTC), Yosuke Sugiura, Nozomiko Yasui, Tetsuya Shimamura (Saitama Univ.), Ryoichi Miyazaki (NITTC) EA2019-70 |
In the signal processing field, there is a growing interest in speech enhancement.Recently, a lot of speech enhancement ... [more] |
EA2019-70 pp.31-36 |
ISEC, SITE, ICSS, EMM, HWS, BioX, IPSJ-CSEC, IPSJ-SPT [detail] |
2019-07-24 12:10 |
Kochi |
Kochi University of Technology |
Recording device identification based on audio distortion depending on system-on-chip Akira Nishimura (Tokyo Univ. Info. Sci.) ISEC2019-48 SITE2019-42 BioX2019-40 HWS2019-43 ICSS2019-46 EMM2019-51 |
This study addresses device-specific distortion observed in recorded
audio, to identify a built-in system-on-a-chip (... [more] |
ISEC2019-48 SITE2019-42 BioX2019-40 HWS2019-43 ICSS2019-46 EMM2019-51 pp.311-316 |
EA, SIP, SP |
2019-03-14 13:30 |
Nagasaki |
i+Land nagasaki (Nagasaki-shi) |
[Poster Presentation]
MVDR beamformer based on time-frequency-bin-wise switching technique for underdetermined speech enhancement Kouei Yamaoka (Univ. of Tsukuba), Nobutaka Ono (Tokyo Metropolitan Univ.), Shoji Makino, Takeshi Yamada (Univ. of Tsukuba) EA2018-124 SIP2018-130 SP2018-86 |
In this paper, we present an underdetermined speech enhancement method called the time-frequency-bin-wise switching beam... [more] |
EA2018-124 SIP2018-130 SP2018-86 pp.149-154 |
EA, SIP, SP |
2019-03-15 13:30 |
Nagasaki |
i+Land nagasaki (Nagasaki-shi) |
[Poster Presentation]
F0 estimation using TV-CAR speech analysis based on Regularized LP Keiichi Funaki (Univ. of the Ryukyus) EA2018-152 SIP2018-158 SP2018-114 |
Linear Prediction (LP) analysis is speech analysis to estimate AR(Auto-Regressive) coefficients to represent the all-pol... [more] |
EA2018-152 SIP2018-158 SP2018-114 pp.311-316 |
NLC, IPSJ-IFAT |
2019-02-07 15:30 |
Kyoto |
Ryukoku University Omiya Campus |
[Invited Talk]
ForeSight Voice Mining, a voice mining system for contact centers Kazuhiro Arai (NTT-TX) NLC2018-40 |
This paper describes ForeSight Voice Mining that NTT TechnoCross Corp. provides for contact centers. ForeSight Voice Min... [more] |
NLC2018-40 pp.27-32 |
NLC, IPSJ-IFAT |
2019-02-08 13:00 |
Kyoto |
Ryukoku University Omiya Campus |
[Special Talk]
Morphological Analyzer for Business "Sudachi": the Present and Future Yoshitaka Uchida (Works Applications) NLC2018-46 |
Morphological analysis is a fundamental and important technology for processing a Japanese text, especially for industri... [more] |
NLC2018-46 p.59 |
SP |
2019-01-27 10:40 |
Ishikawa |
Kanazawa-Harmonie |
Evaluation of end-to-end speech synthesis method using speaking styles Kiyoshi Kurihara, Nobumasa Seiyama, Tadashi Kumano, Atsushi Imai (NHK) SP2018-58 |
The purpose of this study was to conduct end-to-end text-to-speech synthesis in Japanese; we developed a system that use... [more] |
SP2018-58 pp.29-34 |
NLC, IPSJ-NL, SP, IPSJ-SLP (Joint) [detail] |
2018-12-10 16:30 |
Tokyo |
Waseda Univ. Nishiwaseda Campus |
Evaluation of Japanese end-to-end speech synthesis method inputting kana and prosodic symbols Kiyoshi Kurihara, Nobumasa Seiyama, Tadashi Kumano, Atsushi Imai (NHK) SP2018-49 |
The purpose of this study was to conduct end-to-end text-to-speech synthesis in Japanese; we developed a system that use... [more] |
SP2018-49 pp.89-94 |
AI |
2018-12-07 15:55 |
Fukuoka |
|
Toyoaki Kuwahara, Yuichi Sei, Yasuyuki Tahara, Akihiko Ohsuga (UEC) AI2018-30 |
The emotion estimation by speech makes it possible to estimate with higher precision with the development of deep learni... [more] |
AI2018-30 pp.25-29 |
EA, ASJ-H, EMM, IPSJ-MUS [detail] |
2018-11-21 13:30 |
Ishikawa |
Hotel Koshuen |
Evaluation of DNN-based Low-Musical-Noise Speech Enhancement Using Kurtosis Matching Satoshi Mizoguchi, Yuki Saito, Shinnosuke Takamichi, Hiroshi Saruwatari (UTokyo) EA2018-66 EMM2018-66 |
This paper proposes DNN-based speech enhancement with low musical noise by kurtosis matching. Musical noise, artifacts g... [more] |
EA2018-66 EMM2018-66 pp.19-24 |
EA, ASJ-H, EMM, IPSJ-MUS [detail] |
2018-11-22 10:00 |
Ishikawa |
Hotel Koshuen |
[Invited Talk]
Phase reconstruction for speech enhancement and its effect on array processing Yukoh Wakabayashi (TMU) EA2018-80 EMM2018-80 |
Phase spectrum processing for speech enhancement, so called ``phase reconstruction,'' has been particularly received att... [more] |
EA2018-80 EMM2018-80 pp.163-168 |