Committee |
Date Time |
Place |
Paper Title / Authors |
Abstract |
Paper # |
SP, EA, SIP |
2020-03-03 09:00 |
Okinawa |
Okinawa Industry Support Center (Cancelled but technical report was issued) |
[Poster Presentation]
Automatic estimation of prosodic control made in English utterances using DNN-based acoustic models trained with prosodic features and labels Yang Shen, Shintarou Ando, Nobuaki Minematsu, Daisuke Saito (UTokyo), Satoshi Kobashikawa (NTT) EA2019-136 SIP2019-138 SP2019-85 |
This paper investigate how to utilize DNN acoustic models trained with prosodic features and labels to detect prosodic e... [more] |
EA2019-136 SIP2019-138 SP2019-85 pp.201-206 |
SP |
2019-08-28 14:40 |
Kyoto |
Kyoto Univ. |
[Poster Presentation]
Analysis of prosodic differences between a newscaster and amateur speakers using partial-substituted synthetic speech Takuya Ozuru (Univ. of Tokyo), Yusuke Ijima (NTT), Daisuke Saito, Nobuaki Minematsu (Univ. of Tokyo) SP2019-11 |
This paper analyzes prosodic differences between a professional newscaster and amateur speakers which affects listeners’... [more] |
SP2019-11 pp.13-18 |
SP |
2019-06-13 14:20 |
Kanagawa |
Tokyo Institute of Technology |
A large collection of sentences read aloud by Vietnamese learners of Japanese and native speakers' reverse shadowings Shintaro Ando, Tasavat Trisitichoke, Yusuke Inoue, Fuki Yoshizawa, Daisuke Saito, Nobuaki Minematsu (UTokyo) SP2019-3 |
The main objective of language learning is to acquire good communication skills in the target language.
From that viewp... [more] |
SP2019-3 pp.13-17 |
SP |
2019-06-13 14:45 |
Kanagawa |
Tokyo Institute of Technology |
Evaluation of Comprehensibility of L2 Speech Based on Native Listeners’ Reverse Shadowing and Their Facial Expressions Tasavat Trisitichoke, Shintaro Ando, Daisuke Saito, Nobuaki Minematsu (UTokyo) SP2019-4 |
Recently, researchers' attention has been paid to pronunciation assessment not based on comparison between L2 utterances... [more] |
SP2019-4 pp.19-24 |
EA, SIP, SP |
2019-03-14 13:30 |
Nagasaki |
i+Land nagasaki (Nagasaki-shi) |
[Poster Presentation]
An experimental study of influence of classroom babble noise on automatic assessment of learners' shadowing speech Suguru Kabashima, Daisuke Saito, Nobuaki Minematsu (UTokyo), Yutaka Yamauchi (Soka Univ.), Kayoko Ito (Koyasan Univ.) EA2018-118 SIP2018-124 SP2018-80 |
(To be available after the conference date) [more] |
EA2018-118 SIP2018-124 SP2018-80 pp.113-118 |
EA, SIP, SP |
2019-03-14 13:30 |
Nagasaki |
i+Land nagasaki (Nagasaki-shi) |
[Poster Presentation]
Modeling learners’ pronunciation variations and its application to automatic phoneme error detection Zhang Haoyu, Saito Daisuke, Minematsu Nobuaki (UTokyo), Kobashikawa Satoshi, Masumura Ryo (NTT) EA2018-119 SIP2018-125 SP2018-81 |
[more] |
EA2018-119 SIP2018-125 SP2018-81 pp.119-124 |
SIP, EA, SP, MI (Joint) [detail] |
2018-03-19 13:00 |
Okinawa |
|
[Poster Presentation]
Quantitative and corpus-based analysis of pronunciation diversity observed in Japanese English Suguru Kabashima, Haoyu Zhang, Daisuke Saito, Nobuaki Minematsu (Univ. of Tokyo), Satoshi Kobashikawa, Ryo Masumura (NTT) EA2017-113 SIP2017-122 SP2017-96 |
In foreign language teaching, corrective feedback to learners' pronunciation is regarded
as highly important and automa... [more] |
EA2017-113 SIP2017-122 SP2017-96 pp.69-74 |
SIP, EA, SP, MI (Joint) [detail] |
2018-03-19 13:00 |
Okinawa |
|
[Poster Presentation]
An Experimental Study on Segmental and Prosodic Comparison of Utterances for Automatic Assessment of Dubbing Speech Takuya Ozuru, Nobuaki Minematsu, Daisuke Saito (Univ. of Tokyo) EA2017-114 SIP2017-123 SP2017-97 |
In Japanese language education, especially in its speech training, dubbing-based training has gained a
huge popularity.... [more] |
EA2017-114 SIP2017-123 SP2017-97 pp.75-80 |
SP, ASJ-H |
2018-01-20 14:55 |
Tokyo |
The University of Tokyo |
[Poster Presentation]
Automatic speech quality control of English listening materials and examination of Japanese learners’ listening ability in terms of robustness Zhang Haoyu, Inoue Yusuke, Saito Daisuke, Minematsu Nobuaki (UTokyo), Yamauchi Yutaka (TIU), Masuda Hinako (SeikeiU) SP2017-71 |
When a speaker speaks and a listener listens to that speaker, extra-linguistic and environmental factors often degrade t... [more] |
SP2017-71 pp.31-34 |
SP |
2017-01-21 14:00 |
Tokyo |
The University of Tokyo |
[Invited Talk]
Deep learning in voice conversion Daisuke Saito (UTokyo) SP2016-72 |
In this paper, deep learning techniques in voice conversion studies are overviewed. Recently, deep learning techniques w... [more] |
SP2016-72 pp.47-52 |
EA, SP, SIP |
2016-03-29 09:00 |
Oita |
Beppu International Convention Center B-ConPlaza |
[Poster Presentation]
Amplitude limiters based on phase optimization Akira Kakitani, Daisuke Saito, Yasuhiro Kosugi, Nobuaki Minematsu (UTokyo) EA2015-111 SIP2015-160 SP2015-139 |
In order to reduce the peak value of source waveforms without quality degradation, a novel method is proposed. In this m... [more] |
EA2015-111 SIP2015-160 SP2015-139 pp.249-254 |
EA, SP, SIP |
2016-03-29 09:00 |
Oita |
Beppu International Convention Center B-ConPlaza |
[Poster Presentation]
An experimental study of designing context labels for infant-directed storytelling speech synthesis Kyota Hyakutake, Daisuke Saito, Nobuaki Minematsu (UTokyo) EA2015-112 SIP2015-161 SP2015-140 |
Context labels for infant-directed storytelling speech synthesis are investigated. After collecting one-hour storytellin... [more] |
EA2015-112 SIP2015-161 SP2015-140 pp.255-260 |
EA, SP, SIP |
2016-03-29 10:45 |
Oita |
Beppu International Convention Center B-ConPlaza |
Tensor-based Speech Representation and its Application to Identification of Languages and Speakers So Suzuki, Daisuke Saito, Nobuaki Minematsu (UTokyo) EA2015-127 SIP2015-176 SP2015-155 |
This paper proposes a novel approach to speech representation for automatic identification of languages and speakers by ... [more] |
EA2015-127 SIP2015-176 SP2015-155 pp.341-346 |
SP |
2016-01-14 10:55 |
Kanagawa |
Sunpian Kawasaki |
A study of predicting unseen articulatory movements using speech structure Hidetsugu Uchida, Daisuke Saito, Nobuaki Minematsu (Tokyo Univ.) SP2015-86 |
[more] |
SP2015-86 pp.7-12 |
NLC, IPSJ-NL, SP, IPSJ-SLP (Joint) [detail] |
2015-12-02 10:00 |
Aichi |
Nagoya Inst of Tech. |
Voice Conversion based on Projection to Speaker Space Bases constructed by Deep Neural Network Tetsuya Hashimoto, Yosuke Kashiwagi, Daisuke Saito, Minematsu Nobuaki (UTokyo) SP2015-70 |
(Advance abstract in Japanese is available) [more] |
SP2015-70 pp.1-6 |
NLC, IPSJ-NL, SP, IPSJ-SLP (Joint) [detail] |
2015-12-03 09:50 |
Aichi |
Nagoya Inst of Tech. |
Multi-speaker speech synthesis and speaker adaptation based on deep bidirectional long short-term memory recurrent neural network Yi Zhao, Nobuaki Minematsu, Daisuke Saito (UTokyo) SP2015-82 |
(Advance abstract in Japanese is available) [more] |
SP2015-82 pp.105-110 |
SP, IPSJ-SLP (Joint) |
2015-07-16 15:10 |
Nagano |
Katakura Suwako Hotel |
A study on discriminative approach for estimation of the divergence between distributions and its application to language identification Yosuke Kashiwagi, Congying Zhang, Daisuke Saito, Nobuaki Minematsu (Tokyo Univ.) SP2015-38 |
In this paper, we propose a method for estimating the statistical divergence between probability distributions by a disc... [more] |
SP2015-38 pp.13-18 |
WIT, SP, ASJ-H, PRMU |
2015-06-18 16:00 |
Niigata |
|
Noise-robust Prediction of Pronunciation Distances Aiming at Clustering of World Englishes Using a Learner's Self-centered Viewpoint Yuichi Sato, Yosuke Kashiwagi, Shun Kasahara, Nobuaki Minematsu, Daisuke Saito, Keikichi Hirose (UT) PRMU2015-45 SP2015-14 WIT2015-14 |
In recent years,we have more and more international tourists and in 2020, we have Tokyo Olympic Games. For communicating... [more] |
PRMU2015-45 SP2015-14 WIT2015-14 pp.77-82 |
SP |
2015-01-22 14:40 |
Gifu |
Juroku Plaza |
Automatic prediction of intelligibility of English words spoken with Japanese accents
-- Comparative study of features and models used for prediction -- Teeraphon Pongkittiphan, Nobuaki Minematsu (Univ Tokyo), Takehiko Makino (Chuo Univ.), Daisuke Saito, Keikichi Hirose (Univ Tokyo) SP2014-132 |
This study investigates automatic prediction of the words in given sentences that will be unintelligible to American lis... [more] |
SP2014-132 pp.31-36 |
NLC, IPSJ-NL, SP, IPSJ-SLP, JSAI-SLUD (Joint) [detail] |
2014-12-15 19:20 |
Kanagawa |
Tokyo Institute of Technology (Suzukakedai Campus) |
An experimental study of definitions of reference pronunciation distances and acoustic features used for distance prediction with the aim of pronunciation clustering Shun Kasahara (Univ. of Tokyo), Tianze Shi (Tsinghua Univ.), Nobuaki Minematsu, Daisuke Saito, Keikichi Hirose (Univ. of Tokyo) SP2014-110 |
“World Englishes” indicates well one aspect of the current state of English as an international language, which claims t... [more] |
SP2014-110 pp.47-52 |