Committee |
Date Time |
Place |
Paper Title / Authors |
Abstract |
Paper # |
EA, ASJ-H |
2019-08-09 10:30 |
Miyagi |
Tohoku Univ. |
Study on Robust Method for Blindly Estimating Speech Transmission Index using Convolutional Neural Network with Temporal Amplitude Envelope Suradej Doungpummet (JAIST), Jessada Karunjana (NASDA), Waree Kongprawechnon (SIIT), Masashi Unoki (JAIST) EA2019-30 |
We have developed a robust scheme for blindly estimating speech transmission index (STI) in noisy reverberant environmen... [more] |
EA2019-30 pp.47-52 |
ISEC, SITE, ICSS, EMM, HWS, BioX, IPSJ-CSEC, IPSJ-SPT [detail] |
2019-07-24 12:10 |
Kochi |
Kochi University of Technology |
Recording device identification based on audio distortion depending on system-on-chip Akira Nishimura (Tokyo Univ. Info. Sci.) ISEC2019-48 SITE2019-42 BioX2019-40 HWS2019-43 ICSS2019-46 EMM2019-51 |
This study addresses device-specific distortion observed in recorded
audio, to identify a built-in system-on-a-chip (... [more] |
ISEC2019-48 SITE2019-42 BioX2019-40 HWS2019-43 ICSS2019-46 EMM2019-51 pp.311-316 |
EA, SIP, SP |
2019-03-14 10:25 |
Nagasaki |
i+Land nagasaki (Nagasaki-shi) |
Blind speech separation based on approximate joint diagonalization utilizing correlation between neighboring frequency bins Taiki Asamizu, Toshihiro Furukawa (TUS) EA2018-100 SIP2018-106 SP2018-62 |
In this paper, we propose a new method that extends the approximate joint diagonalization blind speech separation (BSS).... [more] |
EA2018-100 SIP2018-106 SP2018-62 pp.7-12 |
EA, SIP, SP |
2019-03-14 13:30 |
Nagasaki |
i+Land nagasaki (Nagasaki-shi) |
[Poster Presentation]
Initial analysis of emotional speech acted in noise Yi Zhao (NII), Atsushi Ando (NTT), Shinji Takaki, Junichi Yamagishi (NII), Satoshi Kobashikawa (NTT) EA2018-120 SIP2018-126 SP2018-82 |
Speakers usually adjust their way of talking in noisy environments involuntarily for effective communication, this adapt... [more] |
EA2018-120 SIP2018-126 SP2018-82 pp.125-130 |
EA, SIP, SP |
2019-03-14 13:30 |
Nagasaki |
i+Land nagasaki (Nagasaki-shi) |
[Poster Presentation]
MVDR beamformer based on time-frequency-bin-wise switching technique for underdetermined speech enhancement Kouei Yamaoka (Univ. of Tsukuba), Nobutaka Ono (Tokyo Metropolitan Univ.), Shoji Makino, Takeshi Yamada (Univ. of Tsukuba) EA2018-124 SIP2018-130 SP2018-86 |
In this paper, we present an underdetermined speech enhancement method called the time-frequency-bin-wise switching beam... [more] |
EA2018-124 SIP2018-130 SP2018-86 pp.149-154 |
EA, SIP, SP |
2019-03-15 13:30 |
Nagasaki |
i+Land nagasaki (Nagasaki-shi) |
[Poster Presentation]
F0 estimation using TV-CAR speech analysis based on Regularized LP Keiichi Funaki (Univ. of the Ryukyus) EA2018-152 SIP2018-158 SP2018-114 |
Linear Prediction (LP) analysis is speech analysis to estimate AR(Auto-Regressive) coefficients to represent the all-pol... [more] |
EA2018-152 SIP2018-158 SP2018-114 pp.311-316 |
EA, ASJ-H, EMM, IPSJ-MUS [detail] |
2018-11-21 13:30 |
Ishikawa |
Hotel Koshuen |
Evaluation of DNN-based Low-Musical-Noise Speech Enhancement Using Kurtosis Matching Satoshi Mizoguchi, Yuki Saito, Shinnosuke Takamichi, Hiroshi Saruwatari (UTokyo) EA2018-66 EMM2018-66 |
This paper proposes DNN-based speech enhancement with low musical noise by kurtosis matching. Musical noise, artifacts g... [more] |
EA2018-66 EMM2018-66 pp.19-24 |
EA, ASJ-H, EMM, IPSJ-MUS [detail] |
2018-11-22 10:00 |
Ishikawa |
Hotel Koshuen |
[Invited Talk]
Phase reconstruction for speech enhancement and its effect on array processing Yukoh Wakabayashi (TMU) EA2018-80 EMM2018-80 |
Phase spectrum processing for speech enhancement, so called ``phase reconstruction,'' has been particularly received att... [more] |
EA2018-80 EMM2018-80 pp.163-168 |
WIT, SP |
2018-10-27 13:50 |
Fukuoka |
Kyushu Institute of Technology(Kitakyushu) |
Proposal of Esophageal Speech Training Device with Myoelectric Signal
-- Identification of Myoelectric Signal Detection Spot for Training Device -- Katsutoshi Oe (DIT), Ryoya Nakamura (Kyutech), Kazutaka Hosokawa (DIT) SP2018-34 WIT2018-22 |
The patients who undergo the laryngectomy lose their voice. One of the speech production substitutes that are used by vo... [more] |
SP2018-34 WIT2018-22 pp.13-16 |
IEE-CMN, EMM, LOIS, IE, ITE-ME [detail] |
2018-09-28 13:40 |
Oita |
Beppu Int'l Convention Ctr. aka B-CON Plaza |
Study on speech representation for speech fingerprint using perceptual matching-pursuit algorithm Dung Kim Tran, Huy Quoc Nguyen, Masashi Unoki (JAIST) LOIS2018-20 IE2018-40 EMM2018-59 |
Recent studies have revealed the weakness of audio fingerprinting methods in speech signals. The problem is that spectro... [more] |
LOIS2018-20 IE2018-40 EMM2018-59 pp.71-76 |
EA, ASJ-H |
2018-08-23 12:55 |
Miyagi |
Tohoku Gakuin Univ. |
Self-produced speech enhancement and suppression method with wearable air- and body-conductive microphones Moe Takada, Shogo Seki, Tomoki Toda (Nagoya Univ.) EA2018-29 |
This paper presents a self-produced speech enhancement and suppression method for multichannel signals recorded with bot... [more] |
EA2018-29 pp.7-12 |
SP, IPSJ-SLP (Joint) |
2018-07-26 16:15 |
Shizuoka |
Sago-Royal-Hotel (Hamamatsu) |
Ladder Network Driven from Auditory Computational Model for Multi-talker Speech Separation Hiroshi Sekiguchi, Yoshiaki Narusue, Hiroyuki Morikawa (Univ. of Tokyo) SP2018-18 |
This paper introduces ladder network implementation induced by auditory computational model for multi-talker speech sepa... [more] |
SP2018-18 pp.9-13 |
SP, IPSJ-SLP (Joint) |
2018-07-26 16:45 |
Shizuoka |
Sago-Royal-Hotel (Hamamatsu) |
Single channel noisy speech recognition based on combination of noisy speech and enhanced speech Masakiyo Fujimoto, Hisashi Kawai (NICT) SP2018-19 |
In many cases, single channel speech enhancement seriously deteriorates speech recognition accuracy due to the influence... [more] |
SP2018-19 pp.15-20 |
EA, ASJ-H, ASJ-AA |
2018-07-25 13:40 |
Hokkaido |
Hokkaido Univ. |
Interference-free power spectral representations of periodic sounds and their application to VOCODERs Hideki Kawahara (Wakayama Univ.), Masanori Morise (Univ. Yamanashi), Kanru Hua (Univ. Illinois) EA2018-23 |
We propose a method to calculate the spectral envelope of voiced sounds for VOCODER applications. In our previous techni... [more] |
EA2018-23 pp.135-140 |
PRMU, SP |
2018-06-29 10:00 |
Nagano |
|
Revisiting interference-free power spectral representations of periodic signals Hideki Kawahara (Wakayama Univ.), Masanori Morise (Univ. Yamanashi), Kanru Hua (Univ. Illinois) PRMU2018-29 SP2018-9 |
We propose two algorithms to calculate interference-free power spectra of periodic signals. This set of algorithms is ou... [more] |
PRMU2018-29 SP2018-9 pp.41-46 |
SIP, EA, SP, MI (Joint) [detail] |
2018-03-20 09:00 |
Okinawa |
|
[Poster Presentation]
A Hybrid Approach on Electrolaryngeal Speech Enhancement based on Spectral Differential Features and Noise Suppression Mohammad Eshghi, Kazuhiro Kobayashi, Tomoki Toda (Nagoya Univ.) EA2017-141 SIP2017-150 SP2017-124 |
This work presents a hybrid approach for enhancing the quality of the electrolaryngeal (EL) speech. Current hybrid enhan... [more] |
EA2017-141 SIP2017-150 SP2017-124 pp.221-226 |
SIP, EA, SP, MI (Joint) [detail] |
2018-03-20 09:00 |
Okinawa |
|
[Poster Presentation]
Speech Enhancement Using Non-Local Means Kyohei Mitani, Yosuke Sugiura, Tetsuya Shimamura (Saitama Univ.) EA2017-142 SIP2017-151 SP2017-125 |
In this paper, we propose a speech enhance method using Non-Local Means. In the proposed method, we focus on Non-Local M... [more] |
EA2017-142 SIP2017-151 SP2017-125 pp.227-230 |
MBE, NC (Joint) |
2018-03-14 13:10 |
Tokyo |
Kikai-Shinko-Kaikan Bldg. |
Silent Japanese Single Syllable Recognition using Similarities of Muscular Activity Transitions between EMG Channels Hidetoshi Nagai (Kyutech) MBE2017-104 |
In inaudible speech recognition based on surface EMG, recognition of consonants is one of the important tasks. When utte... [more] |
MBE2017-104 pp.119-124 |
WIT, IPSJ-AAC |
2018-03-10 09:30 |
Ibaraki |
Tsukuba University of Technology |
Development of text preparation application using speech recognition with array signal processing and gaze detection Tatsuya Igarasi, Ryoichi Miyazaki (NITTC) WIT2017-79 |
Voice input is useful the upper limb disabled people and in situation in the situation the use the keyboard is restricte... [more] |
WIT2017-79 pp.115-119 |
EA |
2018-02-16 13:10 |
Hiroshima |
Pref. Univ. Hiroshima |
The effect of increasing the number of channels with multi-channel non-negative matrix factorization for noisy speech recognition Takanobu Uramoto (Oita Univ.), Youhei Okato, Toshiyuki Hanazawa (Mitsubishi Electric), Iori Miura, Shingo Uenohara, Ken'ich Furuya (Oita Univ.) EA2017-99 |
Nonnegative Matrix Factorization (NMF) factorizes a non-negative matrix into two non-negative matrices. In the field of ... [more] |
EA2017-99 pp.33-38 |