Committee |
Date Time |
Place |
Paper Title / Authors |
Abstract |
Paper # |
NLC, IPSJ-NL, SP, IPSJ-SLP, JSAI-SLUD (Joint) [detail] |
2014-12-15 09:30 |
Kanagawa |
Tokyo Institute of Technology (Suzukakedai Campus) |
Recognition and Analysis of Emotion in Indonesian Conversational Speech Nurul Lubis, Sakriani Sakti, Graham Neubig, Tomoki Toda (NAIST), Dessi Lestari, Ayu Purwarianti (ITB), Satoshi Nakamura (NAIST) SP2014-106 |
The importance of incorporating emotional aspect in human computer interaction continues to arise. Unfortunately, explor... [more] |
SP2014-106 pp.1-6 |
NLC, IPSJ-NL, SP, IPSJ-SLP, JSAI-SLUD (Joint) [detail] |
2014-12-15 10:45 |
Kanagawa |
Tokyo Institute of Technology (Suzukakedai Campus) |
Investigation of Deep Neural Network and Cross-adaptation for Voice Activity Detection in Meeting Speech Akihiro Nakadani (Shizuoka Univ.), Longbiao Wang (Nagaoka Univ. of Tech.), Atsuhiko Kai (Shizuoka Univ.) SP2014-107 |
In voice activity detection(VAD), performance largely decreases under the influence of noise and reverberation. In this ... [more] |
SP2014-107 pp.19-24 |
NLC, IPSJ-NL, SP, IPSJ-SLP, JSAI-SLUD (Joint) [detail] |
2014-12-15 13:10 |
Kanagawa |
Tokyo Institute of Technology (Suzukakedai Campus) |
[Invited Talk]
Dialogue state tracking in statistical dialogue management Kai Yu, Lu Chen (SJTU) SP2014-108 |
(Advance abstract in Japanese is available) [more] |
SP2014-108 pp.25-29 |
NLC, IPSJ-NL, SP, IPSJ-SLP, JSAI-SLUD (Joint) [detail] |
2014-12-15 14:00 |
Kanagawa |
Tokyo Institute of Technology (Suzukakedai Campus) |
[Invited Talk]
Statistical approach to flexible speech synthesis
-- towards human-like talking machines -- Keiichi Tokuda (NITech/Google) SP2014-109 |
This talk will give an overview of statistical approach to
flexible speech synthesis. For constructing human-like
tal... [more] |
SP2014-109 p.31 |
NLC, IPSJ-NL, SP, IPSJ-SLP, JSAI-SLUD (Joint) [detail] |
2014-12-15 19:20 |
Kanagawa |
Tokyo Institute of Technology (Suzukakedai Campus) |
An experimental study of definitions of reference pronunciation distances and acoustic features used for distance prediction with the aim of pronunciation clustering Shun Kasahara (Univ. of Tokyo), Tianze Shi (Tsinghua Univ.), Nobuaki Minematsu, Daisuke Saito, Keikichi Hirose (Univ. of Tokyo) SP2014-110 |
“World Englishes” indicates well one aspect of the current state of English as an international language, which claims t... [more] |
SP2014-110 pp.47-52 |
NLC, IPSJ-NL, SP, IPSJ-SLP, JSAI-SLUD (Joint) [detail] |
2014-12-16 11:00 |
Kanagawa |
Tokyo Institute of Technology (Suzukakedai Campus) |
Articulatory Controllable Speech Modification using Sequential Inversion and Production Mapping with Gaussian Mixture Models Patrick Lumban Tobing, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura (NAIST), Ayu Purwarianti (ITB) SP2014-111 |
In this report, we propose an articulatory controllable speech modification framework using statistical inversion and pr... [more] |
SP2014-111 pp.57-62 |
NLC, IPSJ-NL, SP, IPSJ-SLP, JSAI-SLUD (Joint) [detail] |
2014-12-16 11:00 |
Kanagawa |
Tokyo Institute of Technology (Suzukakedai Campus) |
Prosody Correction Preserving Speaker Individuality in English-Read-By-Japanese Speech Synthesis Based on HMM Yuji Oshima, Shinnosuke Takamichi, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura (NAIST) SP2014-112 |
To build an English acoustic model that well captures speaker individuality of each Japanese speaker, a framework using ... [more] |
SP2014-112 pp.63-68 |
NLC, IPSJ-NL, SP, IPSJ-SLP, JSAI-SLUD (Joint) [detail] |
2014-12-16 11:00 |
Kanagawa |
Tokyo Institute of Technology (Suzukakedai Campus) |
Noise robust speech recognition by non-negative matrix factorization using GMM clustering in MFCC domain Kentaro Fujigaki, Yosuke Kashiwagi, Daisuke Saito, Nobuaki Minematsu, Keikichi Hirose (Univ. of Tokyo) SP2014-113 |
Exemplar-based feature enhancement by non-negative matrix factorization (NMF) was proposed for noise-robust speech recog... [more] |
SP2014-113 pp.69-74 |
NLC, IPSJ-NL, SP, IPSJ-SLP, JSAI-SLUD (Joint) [detail] |
2014-12-16 11:00 |
Kanagawa |
Tokyo Institute of Technology (Suzukakedai Campus) |
Many-to-one Voice Conversion using Multiple Non-negative Matrix Factorization Ryo Aihara, Tetsuya Takiguchi, Yasuo Ariki (Kobe Univ.) SP2014-114 |
Voice conversion (VC) is being widely researched in the field of speech processing because of increased interest in usin... [more] |
SP2014-114 pp.75-80 |
NLC, IPSJ-NL, SP, IPSJ-SLP, JSAI-SLUD (Joint) [detail] |
2014-12-16 11:00 |
Kanagawa |
Tokyo Institute of Technology (Suzukakedai Campus) |
HMM-Based Speech Synthesis System with Prosody Modification Based on Speech Input Yuri Nishigaki, Shinnosuke Takamichi, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura (NAIST) SP2014-115 |
As a creative activity using speech synthesis technologies has been grown rapidly, it is desired to develop an interface... [more] |
SP2014-115 pp.81-86 |
NLC, IPSJ-NL, SP, IPSJ-SLP, JSAI-SLUD (Joint) [detail] |
2014-12-16 11:00 |
Kanagawa |
Tokyo Institute of Technology (Suzukakedai Campus) |
Multimodal Voice Conversion using Weighted Features in Noisy Environments Kenta Masaka, Ryo Aihara, Tetsuya Takiguchi, Yasuo Ariki (Kobe Univ.) SP2014-116 |
Voice conversion is a technique for converting specific information in speech while maintaining the other information, s... [more] |
SP2014-116 pp.87-92 |
NLC, IPSJ-NL, SP, IPSJ-SLP, JSAI-SLUD (Joint) [detail] |
2014-12-16 11:00 |
Kanagawa |
Tokyo Institute of Technology (Suzukakedai Campus) |
Voice conversion based on deep neural network with multiple output sub-networks Tetsuya Hashimoto, Yosuke Kashiwagi, Daisuke Saito, Keikichi Hirose, Nobuaki Minematsu (Univ. of Tokyo) SP2014-117 |
(Advance abstract in Japanese is available) [more] |
SP2014-117 pp.99-104 |
NLC, IPSJ-NL, SP, IPSJ-SLP, JSAI-SLUD (Joint) [detail] |
2014-12-16 11:00 |
Kanagawa |
Tokyo Institute of Technology (Suzukakedai Campus) |
Speaker adaptation using speaker-normalized DNN based on speaker codes Yosuke Kashiwagi, Daisuke Saito, Nobuaki Minematsu, Keikichi Hirose (Univ. of Tokyo) SP2014-118 |
Recently, deep neural network (DNN) becomes one of the main streams of acoustic modeling for automatic speech recognitio... [more] |
SP2014-118 pp.105-110 |
NLC, IPSJ-NL, SP, IPSJ-SLP, JSAI-SLUD (Joint) [detail] |
2014-12-16 13:30 |
Kanagawa |
Tokyo Institute of Technology (Suzukakedai Campus) |
[Poster Presentation]
Deep neural network-based feature transformation for reverberant speaker identification Zhaofeng Zhang, Longbiao Wang (NUT), Atsuhiko Kai (Shizuoka Univ.), Weifeng Li (Tsinghua Univ.), Masahiro Iwahashi (NUT) SP2014-119 |
(Advance abstract in Japanese is available) [more] |
SP2014-119 pp.111-116 |
NLC, IPSJ-NL, SP, IPSJ-SLP, JSAI-SLUD (Joint) [detail] |
2014-12-16 13:30 |
Kanagawa |
Tokyo Institute of Technology (Suzukakedai Campus) |
[Poster Presentation]
Accent identification by conbining GMM and DNN under reverberant environment Ryota Sakagami, Longbiao Wang, Zhang Zhaofeng, Khomdet Phapatanaburi, Masahiro Iwahashi (NUT) SP2014-120 |
(Advance abstract in Japanese is available) [more] |
SP2014-120 pp.123-128 |
NLC, IPSJ-NL, SP, IPSJ-SLP, JSAI-SLUD (Joint) [detail] |
2014-12-16 13:30 |
Kanagawa |
Tokyo Institute of Technology (Suzukakedai Campus) |
[Poster Presentation]
speech selection and environmental adaptation for asynchronous speech recording based on deep neural network Bo Ren, Longbiao Wang (Nagaoka Univ. of Tech.), Atsuhiko Kai (Shizuoka Univ.) SP2014-121 |
In this paper, we propose a robust distant-talking speech recognition system with asynchronous speech recording. This is... [more] |
SP2014-121 pp.129-134 |
NLC, IPSJ-NL, SP, IPSJ-SLP, JSAI-SLUD (Joint) [detail] |
2014-12-16 13:30 |
Kanagawa |
Tokyo Institute of Technology (Suzukakedai Campus) |
STD for SQ using MSRR Satoshi Oshima, Yoshiaki Itoh (Iwate Prefectural Univ.) SP2014-122 |
This paper describes the method about STD for SQ using MSRR. STD denotes Spoken Term Detection that is one of the most i... [more] |
SP2014-122 pp.135-140 |
NLC, IPSJ-NL, SP, IPSJ-SLP, JSAI-SLUD (Joint) [detail] |
2014-12-16 13:30 |
Kanagawa |
Tokyo Institute of Technology (Suzukakedai Campus) |
A Study on Speaker Recognition Method in Consideration of Speaking Style Differences in Lecture Speech Kota Nakatsuji (Doshisha Univ.), Masafumi Nishida (Nagoya Univ.), Seiichi Yamamoto (Doshisha Univ.) SP2014-123 |
Speaker recognition technology has been applied to achieve a variety of tasks such as minute taking and speaker search f... [more] |
SP2014-123 pp.141-146 |
NLC, IPSJ-NL, SP, IPSJ-SLP, JSAI-SLUD (Joint) [detail] |
2014-12-16 13:30 |
Kanagawa |
Tokyo Institute of Technology (Suzukakedai Campus) |
[Poster Presentation]
relationship between speakers' characteristics and the information transmission quality in Dialog Bohan Chen (Nagoya Univ.), Norihide Kitaoka (Tokushima Univ.), Kazuya Takeda (Nagoya Univ.) SP2014-124 |
We investigate the correlation between speakers’ characteristics similarity and their information transmission efficienc... [more] |
SP2014-124 pp.147-152 |
NLC, IPSJ-NL, SP, IPSJ-SLP, JSAI-SLUD (Joint) [detail] |
2014-12-16 13:30 |
Kanagawa |
Tokyo Institute of Technology (Suzukakedai Campus) |
[Poster Presentation]
Automatic Language Identification Based on Posterior Probability on Articulatory Classes Takumi Hirata, Kazuyuki Takagi (UEC) SP2014-125 |
Extraction of features from input speech that are effective in distinguishing the language is a key issue for language i... [more] |
SP2014-125 pp.153-157 |