Committee |
Date Time |
Place |
Paper Title / Authors |
Abstract |
Paper # |
SP, NLC |
2008-12-09 10:00 |
Tokyo |
Waseda Univ. |
Two-channel input speech recognition using sparsness-based blind source separation Kenta Nishiki, Yosuke Izumi (Univ. of Tokyo), Shinji Watanabe (NTT), Takuya Nishimoto, Nobutaka Ono, Shigeki Sagayama (Univ. of Tokyo) NLC2008-24 SP2008-79 |
This paper discusses a two-channel input speech recognition using a sparsness-based blind source separation. The target ... [more] |
NLC2008-24 SP2008-79 pp.1-6 |
SP, NLC |
2008-12-09 10:25 |
Tokyo |
Waseda Univ. |
Hands-free speech recognition system for robot Kosuke Hosoya, Tetsuji Ogawa, Shinya Fujie, Daichi Watanabe, Yuhi Ichikawa, Hikaru Taniyama, Tetsunori Kobayashi (Waseda Univ.) NLC2008-25 SP2008-80 |
[more] |
NLC2008-25 SP2008-80 pp.7-12 |
SP, NLC |
2008-12-09 10:50 |
Tokyo |
Waseda Univ. |
Noisy speech recognition using integrated method of statistical model-based voice activity detection and noise suppression Masakiyo Fujimoto, Kentaro Ishizuka, Tomohiro Nakatani (NTT Corporation) NLC2008-26 SP2008-81 |
This paper addresses robust front-end processing for automatic speech recognition in noise. The proposed method integrat... [more] |
NLC2008-26 SP2008-81 pp.13-18 |
SP, NLC |
2008-12-09 11:15 |
Tokyo |
Waseda Univ. |
Music suppression method for single channel speech mixed with BGM using Bayesian networks Hiroaki Itou, Takanori Nishino, Norihide Kitaoka, Kazuya Takeda (Nagoya Univ.) NLC2008-27 SP2008-82 |
A non-parametric stochastic method of the single-channel speech extraction from a mixture of speech and music is propose... [more] |
NLC2008-27 SP2008-82 pp.19-24 |
SP, NLC |
2008-12-09 11:40 |
Tokyo |
Waseda Univ. |
Speaker diarization of multi-party conversations based on audio and visual information integration Kentaro Ishizuka, Shoko Araki, Kazuhiro Otsuka, Masakiyo Fujimoto, Tomohiro Nakatani (NTT) NLC2008-28 SP2008-83 |
This paper proposes a speaker diarization method, which detects “who spoke when” in multi-party conversations, based on ... [more] |
NLC2008-28 SP2008-83 pp.25-30 |
SP, NLC |
2008-12-09 13:10 |
Tokyo |
Waseda Univ. |
[Invited Talk]
Cognitive competence required for spoken language performance and computational competence realized by spoken language engineering Nobuaki Minematsu (Univ. of Tokyo) NLC2008-29 SP2008-84 |
The performance and flexibility of speech recognition and synthesis technologies have been remarkably enhanced by introd... [more] |
NLC2008-29 SP2008-84 pp.31-36 |
SP, NLC |
2008-12-09 14:10 |
Tokyo |
Waseda Univ. |
Acoustic Model Training Technique for Speech Recognition using Style Estimation with Multiple-Regression HMM Yusuke Ijima, Makoto Tachibana, Takashi Nose, Takao Kobayashi (Tokyo Tech) NLC2008-30 SP2008-85 |
We propose a technique for emotional speech recognition based on multiple-regression HMM (MRHMM). To achieve emotional s... [more] |
NLC2008-30 SP2008-85 pp.37-42 |
SP, NLC |
2008-12-09 14:35 |
Tokyo |
Waseda Univ. |
Speech Feature Extraction Using Constrained Nonnegative Matrix Factorization Hyunsin Park, Tetsuya Takiguchi, Yasuo Ariki (Kobe Univ.) NLC2008-31 SP2008-86 |
n this paper, we propose a speech feature extraction approach using nonnegative matrix factorization (NMF), and a new in... [more] |
NLC2008-31 SP2008-86 pp.43-48 |
SP, NLC |
2008-12-09 15:10 |
Tokyo |
Waseda Univ. |
Evaluation of annealing schadule for PLSA language model adaptaion Masaharu Kato, Tetsuo Kosaka (Yamagata Univ.), Akinori Ito, Shozo Makino (Tohoku Univ.) NLC2008-32 SP2008-87 |
Probabilistic Latent Semantic Analysis (PLSA)
is a powerful statistical language model.
However the PLSA has the loca... [more] |
NLC2008-32 SP2008-87 pp.49-53 |
SP, NLC |
2008-12-09 15:35 |
Tokyo |
Waseda Univ. |
Speech Recognition by Topic Models with Continuous/Discontinuous Topic Changes Atsushi Sako, Yasuo Ariki (Kobe Univ.), Tomoharu Iwata, Shinji Watanabe, Takaaki Hori (NTT) NLC2008-33 SP2008-88 |
In this paper, we propose topic models with continuous/discontinuous topic changes,
and describe experiments using MIT... [more] |
NLC2008-33 SP2008-88 pp.55-60 |
SP, NLC |
2008-12-09 16:00 |
Tokyo |
Waseda Univ. |
User modeling for a satisfaction evaluation of a speech recognition system Sunao Hara, Norihide Kitaoka, Kazuya Takeda (Nagoya Univ.) NLC2008-34 SP2008-89 |
A mathematical model for predicting the user satisfaction of a speech dialogue systems is studied based on a field trial... [more] |
NLC2008-34 SP2008-89 pp.61-66 |
SP, NLC |
2008-12-10 09:30 |
Tokyo |
Waseda Univ. |
Segmentation of Spoken Language into unit of Utterance Fragment using Acoustics Features Katsuyoshi Setoyama (Nara Institute of Science and Technology), Hideki Kashioka, Nick Campbell (Nara Institute of Science and Technology/National Institute of I) NLC2008-35 SP2008-90 |
It is common for speech synthesis technology to process each sentence as one single and independent unit. However, in hu... [more] |
NLC2008-35 SP2008-90 pp.67-72 |
SP, NLC |
2008-12-10 09:55 |
Tokyo |
Waseda Univ. |
Bayesian Context Clustering Using Cross Validation for HMM-Based Speech Synthesis Kei Hashimoto, Heiga Zen, Yoshihiko Nankaku, Keiichi Tokuda (Nagoya Institute of Technology) NLC2008-36 SP2008-91 |
This paper proposes a prior distribution determination technique using cross validation for HMM-based speech synthesis b... [more] |
NLC2008-36 SP2008-91 pp.73-78 |
SP, NLC |
2008-12-10 10:20 |
Tokyo |
Waseda Univ. |
Simultaneous Transformation of Duration and Spectrum Using Statistical Models Including Time-Sequence Matching Kaori Yutani, Yoshihiko Nankaku (Nagoya Institute of Technology), Tomoki Toda (Nara Institute of Science and Technology), Keiichi Tokuda (Nagoya Institute of Technology) NLC2008-37 SP2008-92 |
This paper describes a simultaneous conversion technique of duration and spectrum based on a statistical model including... [more] |
NLC2008-37 SP2008-92 pp.79-84 |
SP, NLC |
2008-12-10 10:45 |
Tokyo |
Waseda Univ. |
Aperiodicity extraction based on linear prediction and temporal axis warping using fundamental frequency information Hideki Kawahara (Wakayama Univ.), Masanori Morise (Kwansei Univ.), Toru Takahashi (Kyoto Univ.), Hideki Banno (Meijo Univ.), Ryuichi Nisimura, Toshio Irino (Wakayama Univ.) NLC2008-38 SP2008-93 |
A reliable aperiodicity extractor is crucial for high-quality speech manipulation systems. This article proposes a new e... [more] |
NLC2008-38 SP2008-93 pp.85-90 |
SP, NLC |
2008-12-10 11:20 |
Tokyo |
Waseda Univ. |
Mutually-Adaptive Generation of Utterances Based on Belief Shared by Human And Robots in Real World. Shinya Nakamura (UEC/NICT), Naoto Iwahashi (NICT/ATR), Takayuki Nagai (The University of Electro-Communications) NLC2008-39 SP2008-94 |
In this paper, an utterance generation method is proposed.
Generated utterances are used to communicate with a user in... [more] |
NLC2008-39 SP2008-94 pp.91-96 |
SP, NLC |
2008-12-10 11:45 |
Tokyo |
Waseda Univ. |
Controlling thought-evoking dialogue using POMDP Yasuhiro Minami, Minako Sawaki, Ryuichiro Higashinaka, Kohji Dohsaka (NTT) NLC2008-40 SP2008-95 |
We are researching thought-evoking dialogue systems where conversation agents appropriately affect users and evoke their... [more] |
NLC2008-40 SP2008-95 pp.97-102 |
SP, NLC |
2008-12-10 12:10 |
Tokyo |
Waseda Univ. |
Speech recognition system for spoken dialogue system Toru Taniguchi, Shinya Fujie, Tetsunori Kobayashi (Waseda Univ.) NLC2008-41 SP2008-96 |
We have developed speech recognition method for spoken dialogue systems
such as communication robots. In the case of de... [more] |
NLC2008-41 SP2008-96 pp.103-108 |
SP, NLC |
2008-12-10 13:40 |
Tokyo |
Waseda Univ. |
[Invited Talk]
A New Paradigm for Speech Application System Development Tetsunori Kobayashi (Waseda Univ.) NLC2008-42 SP2008-97 |
Speech recognition application systems have been developed along so alled one-directional development paradigm: the engi... [more] |
NLC2008-42 SP2008-97 pp.109-114 |
SP, NLC |
2008-12-10 14:40 |
Tokyo |
Waseda Univ. |
Progress Report of SLP Spoken Document Processing Working Group Tomoyoshi Akiba (Toyohashi Univ. of Tech.), Kiyoaki Aikawa (Tokyo Univ. of Tech.), Yoshiaki Itoh (Iwate Prefectural Univ.), Tatsuya Kawahara (Kyoto Univ.), Hiroaki Nanjo (Ryukoku Univ.), Hiromitsu Nishizaki (Univ. of Yamanashi), Norihito Yasuda (NTT), Yoichi Yamashita (Ritsumeikan Univ.), Tomoko Matsui (The Institute of Statistical Mathematics), Xinhui Hu (NICT/ATR), Seiichi Nakagawa (Toyohashi Univ. of Tech.), Katunobu Itou (Hosei Univ.) NLC2008-43 SP2008-98 |
[more] |
NLC2008-43 SP2008-98 pp.115-120 |