Committee |
Date Time |
Place |
Paper Title / Authors |
Abstract |
Paper # |
SP |
2012-06-14 11:00 |
Kanagawa |
NTT Atsugi R&D Center |
A Study on Automatic Prosodic Context Labeling for Emphatic Speech Synthesis Yu Maeno, Takashi Nose, Takao Kobayashi (Tokyo Tech), Yusuke Ijima, Hideharu Nakajima, Hideyuki Mizuno, Osamu Yoshioka (NTT) SP2012-33 |
This paper describes automatic prosodic context labeling of training data for synthesizing expressive speech in HMM-base... [more] |
SP2012-33 pp.1-6 |
SP |
2012-06-14 11:30 |
Kanagawa |
NTT Atsugi R&D Center |
Eigenvoice-based character conversion and its evaluation Teeraphon Pongkittiphan, Daisuke Saito, Nobuaki Minematsu, Keikichi Hirose (Univ. of Tokyo) SP2012-34 |
This paper describes a new method of voice conversion,
which aims at character conversion based on eigenvoice
GMM (EV-... [more] |
SP2012-34 pp.7-12 |
SP |
2012-06-14 13:00 |
Kanagawa |
NTT Atsugi R&D Center |
A low-cost concatenative TTS for monosyllabic languages Trung-Nghia Phung (JAIST), Mai Chi Luong (IOIT), Masato Akagi (JAIST) SP2012-35 |
Comparing with several TTSs that have been proposed, concatenative TTS has the greatest naturalness. However,
it has a ... [more] |
SP2012-35 pp.13-18 |
SP |
2012-06-14 13:30 |
Kanagawa |
NTT Atsugi R&D Center |
Comparison of Methods for Emotion Dimensions Estimation in Speech Using a Three-Layered Model Reda Elbarougy, Masato Akagi (JAIST) SP2012-36 |
This paper proposes a three-layer model for estimating the expressed emotions in a
speech signal based on a dimensional... [more] |
SP2012-36 pp.19-24 |
SP |
2012-06-14 14:15 |
Kanagawa |
NTT Atsugi R&D Center |
[Invited Talk]
Perception and communication from the viewpoint of the motor theory Makio Kashino (NTT) SP2012-37 |
[more] |
SP2012-37 p.25 |
SP |
2012-06-14 15:30 |
Kanagawa |
NTT Atsugi R&D Center |
Personalization of the physiological articulatory model using MR image based deformation Nana Nishimura, Shin'ichi Kawamoto (JAIST), Jianwu Dang (JAIST/Tianjin Univ.) SP2012-38 |
A physiological articulatory model enables to investigate individual differences for any speaker in speech production by... [more] |
SP2012-38 pp.27-32 |
SP |
2012-06-14 16:00 |
Kanagawa |
NTT Atsugi R&D Center |
Perceptual evaluation of synthesized speech reflecting "personalities" Minoru Tsuzaki (KCUA), Keiichi Tokuda (NITEC), Hisashi Kawai (KDDI R&D Labs), Yoshinori Shiga, Jinfu Ni (NICT), Keiichiro Oura, Sayaka Shiota (NITEC) SP2012-39 |
Perceptual evaluation tests were performed for talker selection methods in the application of the speaker adaptation fra... [more] |
SP2012-39 pp.33-38 |
SP |
2012-06-14 16:30 |
Kanagawa |
NTT Atsugi R&D Center |
Speaker size discrimination and vowel identification for aoustically scaled vowels
-- Dependence of vowel duration -- Cihiro Takeshima (Oberlin Univ.), Minoru Tsuzaki (KCUA), Toshio Irino (Wakayama Univ.) SP2012-40 |
[more] |
SP2012-40 pp.39-44 |
SP |
2012-06-15 10:30 |
Kanagawa |
NTT Atsugi R&D Center |
The effects of the combination of respiratory rate and acoustic tempo on the autonomic nervous system Ken Watanabe (Tokyo Inst. of Tech.), Yuki Ooishi (NTT), Makio Kashino (NTT/Tokyo Inst. of Tech./JST) SP2012-41 |
Many studies have investigated the effects of music on human activity and revealed the influences of music on the autono... [more] |
SP2012-41 pp.45-49 |
SP |
2012-06-15 11:00 |
Kanagawa |
NTT Atsugi R&D Center |
Analysis of Tempo Control Characteristics using Tapping Manami Haruki, Kiyoaki Aikawa (Tokyo Univ. of Tech.) SP2012-42 |
This report analyzed the characteristics of controlling changing tempo when human listen to the precedent tone sequence ... [more] |
SP2012-42 pp.51-55 |
SP |
2012-06-15 11:30 |
Kanagawa |
NTT Atsugi R&D Center |
A Japanese version of an audio-visual corpus for Coordinate Response Measure (CRM) Shigeto Furukawa (NTT), Shogo Kominato (Toyohashi Univ. Tech.) SP2012-43 |
The Coordinate Response Measure (CRM) is widely used in English-speaking countries as a test to measure speech intelligi... [more] |
SP2012-43 pp.57-60 |
SP |
2012-06-15 13:00 |
Kanagawa |
NTT Atsugi R&D Center |
[Invited Talk]
Mirror neuron as a link between motor control and cognitive function Akira Murata (Kinki Univ.) SP2012-44 |
The idea of functional interaction between motor system and perception system has been known from early decade of the pr... [more] |
SP2012-44 pp.61-64 |
SP |
2012-06-15 14:15 |
Kanagawa |
NTT Atsugi R&D Center |
[Invited Talk]
Measurement of Brain Activation during Speech Recognition Using Optical Topography Hirokazu Atsumori, Yukiko Hirabayashi, Atsushi Maki (Hitachi), Hideaki Sakata (Mejiro Univ.), Hiroki Sato (Hitachi) SP2012-45 |
In this article, we show Optical Topography (OT) technique for noninvasive imaging of human brain functions. Next, we pr... [more] |
SP2012-45 pp.65-68 |
SP |
2012-06-15 15:45 |
Kanagawa |
NTT Atsugi R&D Center |
Analysis of the correlation between various acoustic features and the audibility of speech with noise Hosana Kamiyama, Yusuke Ijima, Mitsuaki Isogai, Hideyuki Mizuno (NTT) SP2012-46 |
This paper addresses the correlation analysis of acoustic features with the audibility of naturally uttered speech with ... [more] |
SP2012-46 pp.69-74 |
SP |
2012-06-15 16:15 |
Kanagawa |
NTT Atsugi R&D Center |
Correlation between otoacoustic emissions and temporal fine-structure processing. Sho Otsuka, Koichi Hirota (The Univ. of Tokyo), Makio Kashino (NTT) SP2012-47 |
Recent studies showed that there was a significant correlation between the performance of selective attention task and t... [more] |
SP2012-47 pp.75-79 |
SP |
2012-06-15 16:45 |
Kanagawa |
NTT Atsugi R&D Center |
Does Recovering Sound Sources from Embedded Repetition Require Directed Attention? Keiko Masutomi (TiTech/NTT), Nicolas Barascud, Tobias Overath (UCL), Makio Kashino (NTT/TiTech), Josh H. McDermott (New York Univ.), Maria Chait (UCL) SP2012-48 |
It is known that the auditory system can recover sound sources from mixtures by detecting repeating structure embedded i... [more] |
SP2012-48 pp.81-84 |
SP |
2012-06-15 17:15 |
Kanagawa |
NTT Atsugi R&D Center |
Recalibration of tactile distance induced by auditory feedback Norimichi Kitagawa (NTT), Ana Tajadura-Jimenez (Royal Holloway, Univ. of London), Aleksander Valjamae (Univ. of Graz), Iwaki Toshima, Toshitaka Kimura (NTT), Manos Tsakiris (Royal Holloway, Univ. of London) SP2012-49 |
[more] |
SP2012-49 pp.85-88 |