Committee |
Date Time |
Place |
Paper Title / Authors |
Abstract |
Paper # |
EA |
2011-11-18 15:45 |
Kumamoto |
Kumamoto Univ. |
Underdetermined BSS in noisy environments with new analytical update rule for TDOA inference Takuro Maruyama (Tsukuba Univ), Shoko Araki, Tomohiro Nakatani (NTT), Shigeki Miyabe, Takeshi Yamada, Shoji Makino (Tsukuba Univ), Atsushi Nakamura (NTT) EA2011-86 |
[more] |
EA2011-86 pp.25-30 |
EA |
2011-10-28 14:45 |
Nagano |
Faculty of Engineering, Shinshu Univ. |
Perspective evaluation of virtual sound sources produced with a focusing-beam loudspeaker array Akihiko Yamakawa, Ryosuke Horiuchi, Takeshi Saitou, Masato Miyoshi (Kanazawa Univ.), Keisuke Kinoshita, Tomohiro Nakatani (NTT CS Labs.) EA2011-71 |
We previously reported that the sound image of a focal point was steerable through our focusing-beam type loudspeaker-ar... [more] |
EA2011-71 pp.19-24 |
EA, SIP, SP |
2011-05-12 10:50 |
Osaka |
Ritsumeikan Univ. |
A Robust On-line Estimation Method of Noise Mixture Model for Noise Suppression Masakiyo Fujimoto, Tomohiro Nakatani, Shinji Watanabe (NTT) EA2011-2 SIP2011-2 SP2011-2 |
In this paper, we propose a robust on-line estimation method of noise mixture model for the statistical model-based nois... [more] |
EA2011-2 SIP2011-2 SP2011-2 pp.7-12 |
EA, SIP, SP |
2011-05-12 15:00 |
Osaka |
Ritsumeikan Univ. |
[Invited Talk]
Microphone array speech processing techniques for conversation scene analysis Shoko Araki, Masakiyo Fujimoto, Takuya Yoshioka, Takaaki Hori, Tomohiro Nakatani (NTT) EA2011-15 SIP2011-15 SP2011-15 |
Recognition and understanding of conversation scenes has recently been tackled to achieve a variety of tasks such as aut... [more] |
EA2011-15 SIP2011-15 SP2011-15 pp.83-88 |
SP |
2011-01-27 16:30 |
Kyoto |
NICT |
Integrated approach of spectrum enhancement and feature compensation for noise reduction and robust speech recognition Takuya Yoshioka, Tomohiro Nakatani (NTT) SP2010-107 |
[more] |
SP2010-107 pp.25-30 |
EA, US (Joint) |
2011-01-21 16:00 |
Kyoto |
Doshisha Univ. |
On perspective control of sound images using a focusing-beam Akihiko Yamakawa, Ryosuke Horiuchi, Kiyoshi Nishikawa, Takeshi Saitou, Masato Miyoshi (Kanazawa Univ.), Keisuke Kinoshita, Tomohiro Nakatani, Gabriel Pablo Nava (NTT CS Labs.) EA2010-124 |
We are studying on perspective information of sound images given by virtual sound sources produced in front of and behin... [more] |
EA2010-124 pp.113-118 |
NLC, SP (Joint) [detail] |
2010-12-20 16:30 |
Tokyo |
National Olympics Memorial Youth Center |
Noise suppression method based on noise bias-residual decomposition and optimization Masakiyo Fujimoto, Shinji Watanabe, Tomohiro Nakatani (NTT Corp.) NLC2010-18 SP2010-91 |
In this paper, we propose a non-stationary noise estimation method based on bias-residual component decomposition, and s... [more] |
NLC2010-18 SP2010-91 pp.43-48 |
SP, EA, SIP |
2010-05-26 14:55 |
Hyogo |
Konan Univ. (Hirao Seminar House) |
[Invited Talk]
Recent advances in blind speech dereverberation Keisuke Kinoshita, Takuya Yoshioka, Tomohiro Nakatani (NTT CS labs.) EA2010-5 SIP2010-5 SP2010-5 |
A speech signal captured by a distant microphone inevitably contains reverberant components due to reflection from, for ... [more] |
EA2010-5 SIP2010-5 SP2010-5 pp.25-30 |
SP, NLC |
2008-12-09 10:50 |
Tokyo |
Waseda Univ. |
Noisy speech recognition using integrated method of statistical model-based voice activity detection and noise suppression Masakiyo Fujimoto, Kentaro Ishizuka, Tomohiro Nakatani (NTT Corporation) NLC2008-26 SP2008-81 |
This paper addresses robust front-end processing for automatic speech recognition in noise. The proposed method integrat... [more] |
NLC2008-26 SP2008-81 pp.13-18 |
SP, NLC |
2008-12-09 11:40 |
Tokyo |
Waseda Univ. |
Speaker diarization of multi-party conversations based on audio and visual information integration Kentaro Ishizuka, Shoko Araki, Kazuhiro Otsuka, Masakiyo Fujimoto, Tomohiro Nakatani (NTT) NLC2008-28 SP2008-83 |
This paper proposes a speaker diarization method, which detects “who spoke when” in multi-party conversations, based on ... [more] |
NLC2008-28 SP2008-83 pp.25-30 |
EA |
2008-07-18 14:45 |
Nara |
|
Speaker diarization for meetings by integrating speech presence probability estimation and time-frequency domain direction of arrival estimation Shoko Araki, Masakiyo Fujimoto, Kentaro Ishizuka, Tomohiro Nakatani, Hiroshi Sawada, Shoji Makino (NTT) EA2008-40 |
This paper presents a meeting diarization system that estimates who spoke when in a meeting. Our proposed system is real... [more] |
EA2008-40 pp.19-24 |
EA |
2008-07-18 16:30 |
Nara |
|
Frequency Domain Speech Dereverberation with Crossband EffectCcompensation Tomohiro Nakatani, Takuya Yoshioka, Keisuke Kinoshita, Masato Miyoshi (NTT), Biing-Hwang Juang (Georgia Inst. of Tech.) EA2008-43 |
It has recently been shown that the maximum likelihood estimation
approach with a time-varying source model is very ef... [more] |
EA2008-43 pp.37-42 |
SP |
2008-07-17 - 2008-07-19 |
Iwate |
Iwate Prefectural Univ. |
An evaluation and an examination of integration method of statistical model-based voice activity detection and noise suppression Masakiyo Fujimoto, Kentaro Ishizuka, Tomohiro Nakatani (NTT CS Labs.) SP2008-45 |
This paper addresses robust front-end processing for automatic speech recognition (ASR) in noisy environments.
Usually... [more] |
SP2008-45 pp.13-18 |