Committee |
Date Time |
Place |
Paper Title / Authors |
Abstract |
Paper # |
NLC, IPSJ-NL, SP, IPSJ-SLP, JSAI-SLUD (Joint) [detail] |
2014-12-16 11:00 |
Kanagawa |
Tokyo Institute of Technology (Suzukakedai Campus) |
Speaker adaptation using speaker-normalized DNN based on speaker codes Yosuke Kashiwagi, Daisuke Saito, Nobuaki Minematsu, Keikichi Hirose (Univ. of Tokyo) SP2014-118 |
Recently, deep neural network (DNN) becomes one of the main streams of acoustic modeling for automatic speech recognitio... [more] |
SP2014-118 pp.105-110 |
EA |
2014-10-24 14:20 |
Tokyo |
Central Research Laboratory, Hitachi, Ltd. |
[Invited Talk]
Speech enhancement techniques in multi-speaker spontaneous speech recognition for conversation scene analysis Shoko Araki, Takaaki Hori, Tomohiro Nakatani (NTT) EA2014-25 |
This paper illustrates speech enhancement techniques for multi-speaker distant-talk speech recognition, where a conversa... [more] |
EA2014-25 pp.9-14 |
SP, IPSJ-SLP (Joint) |
2014-07-26 10:30 |
Iwate |
Hotel Hanamaki |
Dialogue group detection and speaker diarization for multi-party conversational speech recorded by multiple smartphones Koji Iwano, Toko Iizuka, Kanoko Saito, Shuhei Yoneyama (Tokyo City Univ.) SP2014-71 |
This paper describes acoustic information processing techniques for multi-party conversational speech recorded by attend... [more] |
SP2014-71 pp.47-52 |
SP |
2014-02-28 12:40 |
Tokushima |
The University of Tokushima |
[Poster Presentation]
Speech style manipulation using entrainment for speech recognition system Koutaro Sugiyama, Graham Neubig, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura (NAIST) SP2013-114 |
Achievement of highly accurate speech recognition is essential to develop well-behaved spoken dialogue systems as speech... [more] |
SP2013-114 pp.27-28 |
SP, IPSJ-SLP |
2012-12-21 14:40 |
Tokyo |
TITECH(Ookayama) |
Reduction of cross spectrum for feature-domain sound source separation Atsushi Ando (Nagoya Univ.), Kenta Niwa (NTT), Norihide Kitaoka, Kazuya Takeda (Nagoya Univ.) SP2012-93 |
Speech source separation is utilized for recognition of simultaneous speech. Conventional source separation methods, esp... [more] |
SP2012-93 pp.107-112 |
SP |
2012-06-15 14:15 |
Kanagawa |
NTT Atsugi R&D Center |
[Invited Talk]
Measurement of Brain Activation during Speech Recognition Using Optical Topography Hirokazu Atsumori, Yukiko Hirabayashi, Atsushi Maki (Hitachi), Hideaki Sakata (Mejiro Univ.), Hiroki Sato (Hitachi) SP2012-45 |
In this article, we show Optical Topography (OT) technique for noninvasive imaging of human brain functions. Next, we pr... [more] |
SP2012-45 pp.65-68 |
PRMU, SP |
2012-02-09 15:15 |
Miyagi |
|
Electrooculogram recognition using hidden Markov model Fuming Fang, Takahiro Shinozaki, Yasuo Horiuchi, Shingo Kuroiwa (Chiba Univ), Sadaoki Furui (Tokyo Tech), Toshimitsu Musha (BFL) PRMU2011-202 SP2011-117 |
In order to provide an efficient means of communication for those who cannot move muscles of their whole body except eye... [more] |
PRMU2011-202 SP2011-117 pp.97-102 |
PRMU, SP |
2012-02-10 15:50 |
Miyagi |
|
Two-pass Approach for Recognizing Code-Switching Speech Yonatan Andy Fajar Nugraha, Koichi Shinoda, Sadaoki Furui (Tokyo Inst. of Tech.), Koji Iwano (Tokyo City Univ.) PRMU2011-235 SP2011-150 |
We propose a recognition method for code-switching utterances in multilingual speech recognition. Code-switching is the ... [more] |
PRMU2011-235 SP2011-150 pp.225-229 |
VLD, DC, IPSJ-SLDM, CPSY, RECONF, ICD, CPM (Joint) [detail] |
2011-11-30 10:05 |
Miyazaki |
NewWelCity Miyazaki |
A 40nm 144mW VLSI Processor for Realtime 60k Word Continuous Speech Reconginion Takanobu Sugahara, Guangji He, Tsuyoshi Fujinaga, Yuki Miyamoto, Hiroki Noguchi, Shintaro Izumi, Hiroshi Kawaguchi, Masahiko Yoshimoto (Kobe Univ.) CPM2011-164 ICD2011-96 |
We have developed a low power VLSI chip for 60k-word real-time continuous speech recognition based on HMM(Hidden Markov ... [more] |
CPM2011-164 ICD2011-96 pp.79-84 |
SIS, IPSJ-AVM |
2011-09-22 13:40 |
Akita |
|
Robust Keyword Spotting using The Isolated Word Recognition Wataru Takahashi, Yoshikazu Miyanaga (Hokkaido Univ.) SIS2011-37 |
In this report, we performed a keyword spotting from cotinuous speech under noisy environments. The proposed algorithm i... [more] |
SIS2011-37 pp.105-109 |
SP |
2011-06-23 14:15 |
Aichi |
Nagoya Univ. |
[Invited Talk]
Recent topics in acoustic modeling for speech recognition
-- A machine learning perspective based on generative and discriminative approaches -- Shinji Watanabe (NTT) SP2011-31 |
Acoustic model research is one of the most active area in speech recognition, and there have been various progress. This... [more] |
SP2011-31 pp.7-10 |
EA, SIP, SP |
2011-05-13 13:50 |
Osaka |
Ritsumeikan Univ. |
Estimation of Head Orientation Based on Discrimination of Acoustic Transfer Functions Ryoichi Takashima, Tetsuya Takiguchi, Yasuo Ariki (Kobe Univ.) EA2011-29 SIP2011-29 SP2011-29 |
This paper presents a talker's head orientation estimation method using only a single microphone, where phoneme HMMs (Hi... [more] |
EA2011-29 SIP2011-29 SP2011-29 pp.167-172 |
SP |
2011-01-27 15:45 |
Kyoto |
NICT |
Relation between musical noise generation in nonlinear signal processing and speech recognition performance Ryoichi Miyazaki, Takayuki Inoue, Nobuhisa Hirata, Hiroshi Saruwatari, Kiyohiro Shikano (NAIST), Tomoya Takatani (TOYOTA) SP2010-106 |
In this paper, we discuss a relation between musical noise generation in nonlinear signal processing and
speech recogni... [more] |
SP2010-106 pp.19-24 |
NLC, SP (Joint) [detail] |
2010-12-21 11:05 |
Tokyo |
National Olympics Memorial Youth Center |
Evaluation of Successive Rapid Hypothesis Determination Algorithm for Continuous Word Recognition Hiroyuki Ohno (Nagoya Inst. of Tech.), Hiroshi Kojima (Nagoya Inst. of Tech/Hitachi Solutions, Ltd.), Yoshihiko Nankaku, Akinobu Lee, Keiichi Tokuda (Nagoya Inst. of Tech.) NLC2010-21 SP2010-94 |
Minimizing response delay of speech recognition system and giving rapid feed backs are important properties for an intui... [more] |
NLC2010-21 SP2010-94 pp.77-82 |
NLC, SP (Joint) [detail] |
2010-12-21 11:30 |
Tokyo |
National Olympics Memorial Youth Center |
Voice Recognition Method Based on Features of Consonant Using Pipelined Matching Circuit Jun Nonaka, Keita Okamoto, Hakaru Tamukoh, Masatoshi Sekine (Tokyo Univ. of Agr and Tech.) NLC2010-22 SP2010-95 |
The human audition extracts voice characteristics from the sounds flowing into two ears. The recognition circuit of the ... [more] |
NLC2010-22 SP2010-95 pp.83-88 |
EA |
2010-11-18 13:30 |
Fukuoka |
Kyushu Univ. |
Basic Study of the Speech Recognition Using the Change the Area of the Lips Yuki Ogawa, Yoichi Midorikawa, Masanori Akita (Oita Univ) EA2010-85 |
In speech recognition, the decrease at the recognition rate is a problem in the noisy environment. The research that use... [more] |
EA2010-85 pp.7-12 |
SP, EA, SIP |
2010-05-26 13:25 |
Hyogo |
Konan Univ. (Hirao Seminar House) |
Online Speech Separation based on Spectral Subtraction for Meeting Speech Recognition Yu Nasu, Koichi Shinoda, Sadaoki Furui (Tokyo Inst. of Tech.) EA2010-2 SIP2010-2 SP2010-2 |
This paper proposes a speech separation method for meeting speech recognition, which operates in real time. The proposed... [more] |
EA2010-2 SIP2010-2 SP2010-2 pp.7-12 |
PRMU, SP, MVE, CQ |
2010-01-22 11:30 |
Kyoto |
Kyoto Univ. |
Consonant recognition method based on pipelined matching circuit Jun Nonaka, Hiroshi Kotaki, Hakaru Tamukoh, Masatoshi Sekine (Tokyo Univ. of Agri and Tech.) CQ2009-99 PRMU2009-198 SP2009-139 MVE2009-121 |
The human audition can extract amount of characteristic and the source of sound which flow into two ears.
Then freque... [more] |
CQ2009-99 PRMU2009-198 SP2009-139 MVE2009-121 pp.291-296 |
SIS |
2009-12-17 16:00 |
Kyoto |
|
[Fellow Memorial Lecture]
On a System with High-Speed Processing and Low-Power Consumption for Speech Signal Processing Yoshikazu Miyanaga (Hokkaido Univ.) SIS2009-42 |
In this report, a new noise robust speech recognition system is introduced. As a narrow sense of speech recognition, a ... [more] |
SIS2009-42 pp.67-72 |
SP |
2009-07-18 09:50 |
Fukushima |
|
Speaking Style Classification of Spontaneous Speech Using Multiple-Regression HMM Takashi Nose, Takeshi Matsubara, Yusuke Ijima, Takao Kobayashi (Tokyo Inst. of Tech.) SP2009-46 |
This paper describes speaking style classification and speech recognition for spontaneous speech based on multiple-regre... [more] |
SP2009-46 pp.31-36 |