Committee |
Date Time |
Place |
Paper Title / Authors |
Abstract |
Paper # |
SP |
2008-03-20 10:00 |
Tokyo |
Univ. Tokyo |
[Invited Talk]
Voice-based Information Retrieval
-- How far are we from the text-based retrieval? -- Lin-shan Lee (Nat'l Taiwan Univ.) SP2007-185 |
With the rapid increase of multimedia content on the Internet, it becomes more and more important to retrieve multimedia... [more] |
SP2007-185 pp.1-2 |
SP |
2008-03-20 11:00 |
Tokyo |
Univ. Tokyo |
[Invited Talk]
Robust Speech Recognition Based on Structured Modeling, Irrelevant Variability Normalization and Unsupervised Online Adaptation Qiang Huo (Microsoft Research Asia) SP2007-186 |
In the past several years, we've been studying several approaches to robust automatic speech recognition (ASR) based on ... [more] |
SP2007-186 pp.3-4 |
SP |
2008-03-20 13:00 |
Tokyo |
Univ. Tokyo |
[Invited Talk]
Feature Vector Classification based on Likelihood Ratio Jeong-sik Park, Sang-min Yoon, Yung-Hwan Oh (KAIST) SP2007-187 |
This paper proposes a new feature vector classification method to reduce the recognition error due to overlap data in sp... [more] |
SP2007-187 pp.5-10 |
SP |
2008-03-20 14:00 |
Tokyo |
Univ. Tokyo |
[Invited Talk]
Remaking speech revisited STRAIGHT and TANDEM-STRAIGHT and Their implications Hideki Kawahara (Wakayama Univ.) SP2007-188 |
Speech is a highly flexible media for communication and conveys rich information on linguistic, para-linguistic, and non... [more] |
SP2007-188 pp.11-16 |
SP |
2008-03-20 15:15 |
Tokyo |
Univ. Tokyo |
[Poster Presentation]
A STUDY ON THE LP-BASED BLIND MODEL IN RESTORING BONE-CONDUCTED SPEECH Thang tat Vu, Masashi Unoki, Masato Akagi (JAIST) SP2007-189 |
Due to the stability against the external noise, bone-conducted (BC) speech seems better to be used instead of noisy air... [more] |
SP2007-189 pp.17-22 |
SP |
2008-03-20 15:15 |
Tokyo |
Univ. Tokyo |
[Poster Presentation]
Investigation of usual and unusual articulation based on simulations and observations Akikazu Nishikido, Jianwu Dang (JAIST) SP2007-190 |
Inverse estimation of articulatory movement from speech signal faces a one-to-many problem. To reduce the one-to-many pr... [more] |
SP2007-190 pp.23-28 |
SP |
2008-03-20 15:15 |
Tokyo |
Univ. Tokyo |
[Poster Presentation]
Perception of Speaker Identity and Its Relation to the Phonological Features Kanae Amino, Takayuki Arai (Sophia Univ.) SP2007-191 |
In perceptual speaker identification, it is known that the identification accuracy depends on the contents of the stimul... [more] |
SP2007-191 pp.29-34 |
SP |
2008-03-20 15:15 |
Tokyo |
Univ. Tokyo |
[Poster Presentation]
Effects of repetition priming and word familiarity on perception of spectrally degraded spoken words Mika Murao, Yasunari Sasaki, Hiroshi Riquimaroux (Doshisha Univ.) SP2007-192 |
Priming effects in the auditory domain have been demonstrated by several studies. Our previous studies examined effects ... [more] |
SP2007-192 pp.35-38 |
SP |
2008-03-20 15:15 |
Tokyo |
Univ. Tokyo |
[Poster Presentation]
Prosody Reconstruction by Rescaling Fundamental Frequency Contours in Order to Synthesize Communicative Speech Jinfu Ni, Shinsuke Sakai, Satoshi Nakamura (NICT/ATR) SP2007-193 |
This paper presents a method of prosody reconstruction that can be used to synthesize conversational speech. In our meth... [more] |
SP2007-193 pp.39-44 |
SP |
2008-03-20 15:15 |
Tokyo |
Univ. Tokyo |
[Poster Presentation]
A Context Clustering Technique for Improvement of Tone Intelligibility of Average-voice-based Thai Speech Synthesis Suphattharachai Chomphan, Takao Kobayashi (Tokyo Inst. of Tech.) SP2007-194 |
This paper describes a novel approach to the context clustering process in a speaker independent HMM-based Thai speech s... [more] |
SP2007-194 pp.45-50 |
SP |
2008-03-20 15:15 |
Tokyo |
Univ. Tokyo |
[Poster Presentation]
Robust noise suppression algorithm using the only Kalman filter theory for white and colored noises Nari Tanabe (Tokyo Univ. of Science, Suwa), Toshihiro Furukawa (Tokyo Univ. of Science), Shigeo Tsujii (Inst. of Information Security) SP2007-195 |
This paper presents a noise suppression algorithm using only the Kalman filter theory with canonical state space models:... [more] |
SP2007-195 pp.51-56 |
SP |
2008-03-20 15:15 |
Tokyo |
Univ. Tokyo |
[Poster Presentation]
Improvement of robustness using selective sound segregation for automatic speech recognition systems in noisy environments Atsushi Haniu, Masashi Unoki, Masato Akagi (JAIST) SP2007-196 |
This paper proposes the concept of our novel robust speech recognition method based on the selective sound segregation m... [more] |
SP2007-196 pp.57-62 |
SP |
2008-03-20 15:15 |
Tokyo |
Univ. Tokyo |
[Poster Presentation]
Robust Distant Speech Recognition by Combining Variable-term spectrum Based Position-dependent CMN with Conventional CMN Longbiao Wang, Seiichi Nakagawa (Toyohashi Univ. of Tech.), Norihide Kitaoka (Nagoya Univ.) SP2007-197 |
In a distant-talking environment, the duration of channel impulse response is longer than the short-term spectral analys... [more] |
SP2007-197 pp.63-68 |
SP |
2008-03-20 15:15 |
Tokyo |
Univ. Tokyo |
[Poster Presentation]
Unsupervised Phoneme Segmentation Using Mahalanobis Distance Yu Qiao, Nobuaki Minematsu (Univ. of Tokyo) SP2007-198 |
One of the fundamental problems in speech engineering is phoneme segmentation. Approaches to phoneme segmentation can be... [more] |
SP2007-198 pp.69-74 |
SP |
2008-03-20 15:15 |
Tokyo |
Univ. Tokyo |
[Poster Presentation]
A Fast Sequence Kernel for Sequential Data Classification Makoto Yamada (SOKENDAI/Yamaha Corp.), Tomoko Matsui (Inst. of Statistical Mathematics) SP2007-199 |
Sequence kernel based speaker verification systems have been shown to outperform the conventional systems based on Gauss... [more] |
SP2007-199 pp.75-80 |
SP |
2008-03-20 15:15 |
Tokyo |
Univ. Tokyo |
[Poster Presentation]
LVCSR based on Context-Dependent Syllable Acoustic Models Jian Zhang, Longbiao Wang, Seiichi Nakagawa (Toyohashi Univ. of Tech.) SP2007-200 |
We propose an effective and accurate inter-word context dependent modeling for large vocabulary continuous speech recogn... [more] |
SP2007-200 pp.81-86 |
SP |
2008-03-20 15:15 |
Tokyo |
Univ. Tokyo |
[Poster Presentation]
Accent Analysis for Mandarin Large Vocabulary Continuous Speech Recognition Dong Yang, Koji Iwano, Sadaoki Furui (Tokyo Inst. of Tech.) SP2007-201 |
This paper presents our work on accent issues in Mandarin large vocabulary continuous speech recognition. What we want t... [more] |
SP2007-201 pp.87-92 |
SP |
2008-03-20 15:15 |
Tokyo |
Univ. Tokyo |
[Poster Presentation]
Initial Evaluation of the Drivers' Japanese Speech Corpus in a Car Environment Kousuke Hiraki, Takahiro Shinozaki, Koji Iwano, Agnieszka Betkowska, Koichi Shinoda, Sadaoki Furui (Tokyo Inst. of Tech.) SP2007-202 |
Car navigation systems are getting more and more popular and many of them equip a speech recognition system as hands-fre... [more] |
SP2007-202 pp.93-98 |
SP |
2008-03-20 15:15 |
Tokyo |
Univ. Tokyo |
[Poster Presentation]
Efficient Discriminative Training of Error Corrective Models Using High-WER Competitors Takanobu Oba, Takaaki Hori, Atsushi Nakamura (NTT) SP2007-203 |
We focus on error corrective models for speech recognition, which select a more accurate word sequence in a word N-best ... [more] |
SP2007-203 pp.99-104 |
SP |
2008-03-21 10:00 |
Tokyo |
Univ. Tokyo |
Vowel Epenthesis in Consonant Clusters by Japanese Speakers Masako Fujimoto (NIJLA), Seiya Funatsu (The Prefectural University of Hiroshima) SP2007-204 |
Vowel epenthesis in non-native consonant clusters by native Japanese speakers was investigated from productive point of ... [more] |
SP2007-204 pp.105-110 |