Thu, Jul 22 PM 13:30 - 14:45 |
(1) |
13:30-13:55 |
A study of relationship between speaker identification and acoustic features using perceptual similarity of imitated voice |
Mari Tanaka (Waseda Univ.), Hideki Kawahara (Wakayama Univ.), Shigeo Morishima (Waseda Univ.) |
(2) |
13:55-14:20 |
Extraction of Angry Phone Calls Using Prosody and Dialog Features |
Narichika Nomoto, Hirokazu Masataki, Osamu Yoshioka, Satoshi Takahashi (NTT Corp.) |
(3) |
14:20-14:45 |
Pronunciation assessment based on multilayer multiple regression analysis using structural features |
Masayuki Suzuki, Ayano Nakamura (Univ. of Tokyo.), Yu Qiao (Shenzhen Institutes), Nobuaki Minematsu, Keikichi Hirose (Univ. of Tokyo.) |
Thu, Jul 22 PM 16:00 - 18:00 |
(4) |
16:00-17:00 |
[Invited Talk]
Research Activities in Music Information Retrieval |
Keiichiro Hoashi (KDDI Labs.) |
(5) |
17:00-18:00 |
[Invited Talk]
Image/Video Recognition and Retrieval |
Keiji Yanai (The Univ. of Electro-Comm.) |
Fri, Jul 23 AM 09:00 - 09:50 |
(6) |
09:00-09:25 |
Speech Recognition using Phase Information based on Long-Term Analysis |
Kazumasa Yamamoto, Eiichi Sueyoshi, Seiichi Nakagawa (Toyohashi Univ. of Tech.) |
(7) |
09:25-09:50 |
Speech recognition by using word graph combination under various noise conditions |
Shunsuke Kuramata, Masaharu Kato, Tetsuo Kosaka (Yamagata Univ.) |
Fri, Jul 23 PM 13:50 - 14:15 |
(8) |
13:50-14:15 |
Confidence Estimation at the Spoken Document Level Using Word Contextual Coherence and Acoustic Likelihood |
Taichi Asami, Satoshi Kobashikawa, Yoshikazu Yamaguchi, Hirokazu Masataki, Satoshi Takahashi (NTT Corp.) |
Fri, Jul 23 PM 14:30 - 14:15 |
Sat, Jul 24 PM 15:10 - 15:35 |
(9) |
15:10-15:35 |
Spoken Dialogue Manager in Car Navigation System Using Partially Observable Markov Decision Processes with Hierarchical Reinforcement Learning |
Yasuhide Kishimoto, Tetsuya Takiguchi, Yasuo Ariki (Kobe Univ.) |