Mon, Dec 19 AM 10:30 - 10:35 |
|
- |
|
Mon, Dec 19 AM 10:35 - 11:50 |
(1) |
10:35-11:00 |
|
(2) |
11:00-11:25 |
|
(3) NLC |
11:25-11:50 |
Extraction of new abbreviated words using Crowdsourcing System |
Toshihiko Sakai (Kyushu Univ.), Masayuki Ashikawa (Toshiba), Sachio Hirokawa (Kyushu Univ.) |
|
11:50-13:00 |
Lunch ( 70 min. ) |
Mon, Dec 19 PM 13:00 - 14:15 |
(4) |
13:00-13:25 |
|
(5) |
13:25-13:50 |
|
(6) SP |
13:50-14:15 |
Telephone conversations retrieval using Line Detection method in the Distance Matrix Images(LD-DMI) |
Hiroyuki Nishi, Yuuki Yokobayashi, Haiyen, Yoshimasa Kimura, Toshio Kakinoki (Sojo Univ) |
|
14:15-14:30 |
Break ( 15 min. ) |
Mon, Dec 19 PM 14:30 - 15:30 |
(7) |
14:30-15:30 |
|
|
15:30-15:45 |
Break ( 15 min. ) |
Mon, Dec 19 PM 15:45 - 17:15 |
(8) SP |
15:45-17:15 |
A study on language identification using non-negative matrix factorization as an extractor of phonotactic information |
Tsuyoshi Ogata, Kazuyuki Takagi (UEC Tokyo) |
(9) SP |
15:45-17:15 |
Phoneme Recognition based on AF-HMMs with Optimal State Configuration |
Narpendyah W. Ariwardhani, Yurie Iribe, Kouichi Katsurada, Tsuneo Nitta (Toyohashi Univ. of Tech.) |
(10) SP |
15:45-17:15 |
Concise representation of a matrix of basis functions for speech analysis and synthesis by using segmental NMF |
Cheol Lee, Kazunori Mano (Shibaura Inst. of Tech.) |
(11) SP |
15:45-17:15 |
Speaker identification using closed caption for scene retrieval in television broadcasting |
Keita Yamamuro, Katunobu Itou (Hosei Univ.) |
(12) SP |
15:45-17:15 |
Speaker Clustering Using Speaker Subspace Obtained Dynamically based on Variance of Intra-Utterance |
Yuki Ishikawa, Masafumi Nishida, Seiichi Yamamoto (Doshisha Univ.) |
(13) SP |
15:45-17:15 |
Study on extraction of vocal part in music signal by using non-negative matrix algorithm |
Yuta Yasui, Hideki Banno, Fumitada Itakura (Meijo Univ) |
(14) SP |
15:45-17:15 |
A proposal of acoustic feature related to voice quality for estimation of similarity in singing voice |
Chifumi Suzuki, Hideki Banno, Fumitada Itakura (Meijo Univ.), Masanori Morise (Ritsumeikan Univ.) |
(15) |
15:45-17:15 |
|
(16) |
15:45-17:15 |
|
(17) |
15:45-17:15 |
|
(18) |
15:45-17:15 |
|
(19) NLC |
15:45-17:15 |
[Poster Presentation]
Digital Signals Gave Birth to the Grammars and the Abstract Concepts
-- Autogenetic Multi-Stage Development of Human Vocal Communication System -- |
Kimiaki Tokumaru (System Engineer) |
Tue, Dec 20 AM 09:00 - 10:15 |
(20) SP |
09:00-09:25 |
Simultaneous application of speaker adaptation and noise mixture model estimation for noise suppression |
Masakiyo Fujimoto, Shinji Watanabe, Tomohiro Nakatani (NTT) |
(21) SP |
09:25-09:50 |
GIF-SP: Improvement of Speech Recognition Using General and Discriminative Feature |
Satoshi Tamura, Yoji Tagami, Satoru Hayamizu (Gifu Univ.) |
(22) |
09:50-10:15 |
|
|
10:15-10:30 |
Break ( 15 min. ) |
Tue, Dec 20 AM 10:30 - 11:45 |
(23) SP |
10:30-10:55 |
Speaker Verification Using MMAP Adaptation |
Sangeeta Biswas, Johan Rohdin, Koichi Shinoda, Sadaoki Furui (Tokyo Inst. of Tech.) |
(24) SP |
10:55-11:20 |
Error Correction Using CRF for Mis-Recognition around OOV Words on Speech Recognition Result |
Ryohei Nakatani (Kobe Univ.), Naoto Iwahashi (NICT), Mikio Nakano (HRI-JP), Tetsuya Takiguchi, Yasuo Ariki (Kobe Univ.) |
(25) |
11:20-11:45 |
|
|
11:45-13:00 |
Lunch ( 75 min. ) |
Tue, Dec 20 PM 13:00 - 14:00 |
(26) SP |
13:00-14:00 |
[Invited Talk]
Development of a framework for constructing spoken dialogue systems based on user-generated content |
Keiichi Tokuda (NITech) |
|
14:00-14:15 |
Break ( 15 min. ) |
Tue, Dec 20 PM 14:15 - 15:05 |
(27) SP |
14:15-14:40 |
An Open-Source Toolkit for Building Attractive Voice Interaction Systems -- MMDAgent |
Akinobu Lee, Keiichiro Oura, Keiichi Tokuda (Nitech) |
(28) |
14:40-15:05 |
|
|
15:05-15:20 |
Break ( 15 min. ) |
Tue, Dec 20 PM 15:20 - 17:25 |
(29) |
15:20-15:45 |
|
(30) SP |
15:45-16:10 |
An MRHSMM-based conversational speech synthesis with controllability of paralinguistic information |
Tomohiro Nagata, Hiroki Mori (Utsunomiya Univ), Takashi Nose (Tokyo Tech) |
(31) SP |
16:10-16:35 |
On the use of prosodic-event-based HMM in F0 generation of conversational speech |
Tomoki Koriyama, Takashi Nose, Takao Kobayashi (Tokyo Tech) |
(32) SP |
16:35-17:00 |
A Study on Speaker Independent Style Conversion in HMM Speech Synthesis |
Hiroki Kanagawa, Takashi Nose, Takao Kobayashi (Tokyo Tech) |
(33) SP |
17:00-17:25 |
A study on modeling phone duration using dynamic features for HMM-based speech synthesis |
Takashi Nose, Takao Kobayashi (Tokyo Tech) |