Thu, Nov 13 PM 13:45 - 15:00 |
(1) |
13:45-14:10 |
A comparative study of paralinguistic information control methods for HMM-based dialogue speech synthesis |
Hiroki Mori, Shunsuke Takahashi, Tomohiro Nagata (Utsunomiya Univ.) |
(2) |
14:10-14:35 |
Variable factor of laughter in an utterance for dialogue speech synthesis |
Tomohiro Nagata, Hiroki Mori (Utsunomiya Univ.) |
(3) |
14:35-15:00 |
Shared emotion additive model for HMM-based emotional speech synthesis |
Yamato Ohtani, Yu Nasu, Ryo Morinaka, Masatsune Tamura, Masahiro Morita, Masami Akamine (Toshiba) |
|
- |
|
|
15:00-15:05 |
Break ( 5 min. ) |
Thu, Nov 13 PM 15:15 - 16:15 |
(4) |
15:15-16:15 |
[Invited Talk]
Speech Synthesis for Conversation System |
Tetsunori Kobayashi, Kazuhiko Iwata (Waseda Univ.) |
|
16:15-16:30 |
Break ( 15 min. ) |
Thu, Nov 13 PM 16:30 - 17:45 |
(5) |
16:30-16:55 |
A study on intuitive control of emotional expressions and speaking styles using facial features by Kinect |
Yu Bi, Takashi Nose, Akinori Ito (Tohoku Univ.) |
(6) |
16:55-17:20 |
Emphasized Accent Phrase Prediction from Advertisement Text towards Expressive Text-to-speech Synthesis |
Hideharu Nakajima, Hideyuki Mizuno, Sumitaka Sakauchi (NTT) |
(7) |
17:20-17:45 |
Emotional speech synthesis for long words by generalizing accent types |
Yuko Aoyama (hakase.com), Tsuyoshi Moriyama (Tokyo Polytechnic Univ.) |
|
- |
|
Fri, Nov 14 AM 09:00 - 10:15 |
(8) |
09:00-09:25 |
Influence of the phase of voiced sound in source-filter speech synthesis on nerve cell responses in the auditory cortex
-- A study based on nerve cell responses in the primary auditory cortex of an awake cat -- |
Masanori Morise, Kaito Okubo, Sohei Chimoto, Yu Sato, Kenji Ozawa (Univ. of Yamanashi) |
(9) |
09:25-09:50 |
A study for estimating the vocal-tract shape from speech spectrum using a sensitivity function |
Tokihiko Kaburagi (Kyushu Univ.) |
(10) |
09:50-10:15 |
Application of a vocal tract mapping interface to inverse estimation of vocal tract shapes |
Kohichi Ogata, Tayuto Kodama, Tomohiro Hayakawa (Kumamoto Univ.) |
|
- |
|
|
10:15-10:20 |
Break ( 5 min. ) |
Fri, Nov 14 AM 10:30 - 11:45 |
(11) |
10:30-10:55 |
A study of speaker normalization based on voice conversion for statistical acoustic-to-articulatory mapping |
Hidetsugu Uchida, Daisuke Saito, Nobuaki Minematsu, Keikichi Hirose (Univ. of Tokyo) |
(12) |
10:55-11:20 |
Design of control parameters for voice quality control based on multiple-regression Gaussian mixture model |
Kazutaka Kubo, Kazuhiro Kobayashi, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura (NAIST) |
(13) |
11:20-11:45 |
An evaluation of target speech for nonaudible murmur enhancement focusing on its intelligibility under noisy environments |
Sakura Tsuruta, Kou Tanaka, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura (NAIST) |
|
- |
|
|
11:45-12:50 |
Break ( 65 min. ) |
Fri, Nov 14 PM 13:00 - 14:15 |
(14) |
13:00-13:25 |
Analysis of color attributes derived from vowel sound impression
-- for multimodal expression of sentiment information -- |
Kanako Watanabe, Yoko Greenberg, Yoshinori Sagisaka (Waseda Univ.) |
(15) |
13:25-13:50 |
An Acoustical Analysis of Singing Voice of Kugaki and Ufugaki in Ryukyuan Classical Music "Nomura Style" |
Yasufumi Uezu, Tokihiko Kaburagi (Kyushu Univ.) |
(16) |
13:50-14:15 |
A method of measuring articulatory space using NDI Wave speech research system |
Tatsuya Kitamura (Konan Univ.), Yukiko Nota (ATR-Promotions), Michiko Hashi (Pref. Univ. Hiroshima), Hiroaki Hatano (ATR/Kobe Univ.) |
|
- |
|