Committee |
Date Time |
Place |
Paper Title / Authors |
Abstract |
Paper # |
SP |
2018-08-27 11:35 |
Kyoto |
Kyoto Univ. |
[Poster Presentation]
Discrimination of pharyngeal residue using swallowing sound in dysphagia diagnosis Tatsunori Uchino, Atsushi Hashizume, Masahisa Katsuno, Tomoki Toda (Nagoya Univ.) SP2018-27 |
The measurement of pharyngeal residue with X-ray fluoroscopy is widely used as a typical diagnosis method of swallowing ... [more] |
SP2018-27 pp.23-27 |
SP |
2018-08-27 15:55 |
Kyoto |
Kyoto Univ. |
Sound Event Encoder Using Onomatopoeic Representations based on End-to-End Approach Koichi Miyazaki, Tomoki Hayashi, Tomoki Toda, Kazuya Takeda (Nagoya Univ.) SP2018-30 |
In this paper, we propose a sound event encoder for converting sound events into their onomatopoeic representations. The... [more] |
SP2018-30 pp.37-42 |
EA, ASJ-H |
2018-08-23 12:55 |
Miyagi |
Tohoku Gakuin Univ. |
Self-produced speech enhancement and suppression method with wearable air- and body-conductive microphones Moe Takada, Shogo Seki, Tomoki Toda (Nagoya Univ.) EA2018-29 |
This paper presents a self-produced speech enhancement and suppression method for multichannel signals recorded with bot... [more] |
EA2018-29 pp.7-12 |
PRMU, SP |
2018-06-28 15:10 |
Nagano |
|
Multimodal voice conversion using deep bottleneck features and deep canonical correlation analysis Satoshi Tamura, Kento Horio, Hajime Endo, Satoru Hayamizu (Gifu Univ.), Tomoki Toda (Nagoya Univ.) PRMU2018-24 SP2018-4 |
In this paper, we aim at improving the speech quality in voice conversion and propose a novel multi-modal voice conversi... [more] |
PRMU2018-24 SP2018-4 pp.13-18 |
SIP, EA, SP, MI (Joint) [detail] |
2018-03-20 09:00 |
Okinawa |
|
[Poster Presentation]
An investigation of singing voice separation methods for a statistical approach to singing voice modification in music Tomoya Yamada, Shogo Seki, Kazuhiro Kobayashi, Tomoki Toda (Nagoya Univ.) EA2017-139 SIP2017-148 SP2017-122 |
[more] |
EA2017-139 SIP2017-148 SP2017-122 pp.209-214 |
SP, ASJ-H |
2018-01-21 13:30 |
Tokyo |
The University of Tokyo |
[Invited Talk]
Impact of WaveNet on Speech Synthesis Research Tomoki Toda (Nagoya Univ./JST) SP2017-80 |
[more] |
SP2017-80 p.79 |
SP, ASJ-H |
2018-01-21 14:45 |
Tokyo |
The University of Tokyo |
An investigation of multi-speaker WaveNet vocoder Tomoki Hayashi, Kazuhiro Kobayashi, Akira Tamamori, Kazuya Takeda, Tomoki Toda (Nagoya Univ.) SP2017-81 |
In this paper, we investigate a multi-speaker WaveNet vocoder. In our previous work, we have demonstrated that our propo... [more] |
SP2017-81 pp.81-86 |
SP, ASJ-H |
2018-01-21 15:10 |
Tokyo |
The University of Tokyo |
Statistical voice conversion with WaveNet vocoder Kazuhiro Kobayashi, Tomoki Hayashi, Akira Tamamori, Tomoki Toda (Nagoya Univ.) SP2017-82 |
[more] |
SP2017-82 pp.87-92 |
NLC, IPSJ-NL, SP, IPSJ-SLP (Joint) [detail] |
2017-12-21 12:50 |
Tokyo |
Waseda Univ. Green Computing Systems Research Organization |
[Poster Presentation]
Development of Speaker/Environment-Dependent Acoustic Model for Non-Audible Murmur Recognition Based on DNN Adaptation Seita Noda, Tomoki Hayashi, Tomoki Toda, Kazuya Takeda (Nagoya Univ.) SP2017-56 |
In this research, we aim to improve the performance of non-audible murmur (NAM) recognition towards the development of s... [more] |
SP2017-56 pp.7-10 |
NLC, IPSJ-NL, SP, IPSJ-SLP (Joint) [detail] |
2017-12-21 12:50 |
Tokyo |
Waseda Univ. Green Computing Systems Research Organization |
[Poster Presentation]
An Evaluation of Speech Waveform Modification Methods towards Improvement of Speech Intelligibility in Noisy Environment Tomohiro Takeyama, Kazuhiro Kobayashi, Tomoki Toda, Kazuya Takeda (Nagoya Univ.) SP2017-57 |
In this research, in order to improve speech intelligibility for a listener under the noisy environment, we propose a te... [more] |
SP2017-57 pp.11-16 |
EA, ASJ-H |
2017-07-20 13:40 |
Hokkaido |
Hokkaido Univ. |
Explicit Event Duration-Controlled BLSTM-HSMM Hybrid Model for Polyphonic Sound Event Detection Tomoki Hayashi (Nagoya Univ.), Shinji Watanabe (MERL), Tomoki Toda (Nagoya Univ.), Takaaki Hori, JonathanLe Roux (MERL), Kazuya Takeda (Nagoya Univ.) EA2017-2 |
This paper presents a new BLSTM-HSMM hybrid approach for polyphonic Sound Event Detection (SED). It builds upon a state-... [more] |
EA2017-2 pp.9-14 |
SP, SIP, EA |
2017-03-01 09:20 |
Okinawa |
Okinawa Industry Support Center |
Speech waveform synthesis based on WaveNet considering speech generation process Akira Tamamori, Tomoki Hayashi, Tomoki Toda, Kazuya Takeda (Nagoya Univ.) EA2016-82 SIP2016-137 SP2016-77 |
Our aim is to realize a new vocoder, which can resolve various constraints imposed on source-filter model and deal with ... [more] |
EA2016-82 SIP2016-137 SP2016-77 pp.1-6 |
SP, SIP, EA |
2017-03-01 09:45 |
Okinawa |
Okinawa Industry Support Center |
Nonaudible murmur enhancement based on non-negative tensor factorization with segment feature regularization in noisy environments Yusuke Tajiri (Nagoya Univ.), Hirokazu Kameoka (NTT), Tomoki Toda (Nagoya Univ.) EA2016-83 SIP2016-138 SP2016-78 |
Towards the development of silent speech communication, there has been studied a statistical approach to enhancing nonau... [more] |
EA2016-83 SIP2016-138 SP2016-78 pp.7-12 |
SP, SIP, EA |
2017-03-01 10:50 |
Okinawa |
Okinawa Industry Support Center |
Missing Component Restoration for Speech Spectrogram Based on Time-domain Signal Estimation Shogo Seki (Nagoya Univ.), Hirokazu Kameoka (NTT), Tomoki Toda, Kazuya Takeda (Nagoya Univ.) EA2016-85 SIP2016-140 SP2016-80 |
This study proposes a missing component restoration method for time-frequency masked speech spectrogram based on time-do... [more] |
EA2016-85 SIP2016-140 SP2016-80 pp.19-24 |
SP, SIP, EA |
2017-03-02 09:00 |
Okinawa |
Okinawa Industry Support Center |
[Poster Presentation]
Acoustic-to-articulatory inversion mapping with variational latent trajectory Gaussian mixture model Patrick Lumban Tobing (Nagoya Univ.), Hirokazu Kameoka (NTT), Tomoki Toda (Nagoya Univ.) EA2016-134 SIP2016-189 SP2016-129 |
[more] |
EA2016-134 SIP2016-189 SP2016-129 pp.291-296 |
SP |
2016-08-24 13:00 |
Kyoto |
ACCMS, Kyoto Univ. |
Adaptation Methods for Daily Activity Recognition Based on Deep Neural Network Tomoki Hayashi (Nagoya Univ.), Norihide Kitaoka (Tokushima Univ.), Tomoki Toda, Kazuya Takeda (Nagoya Univ.) SP2016-27 |
Our objective is to build a monitoring system which enables elderly people to live actively, and the key technology to a... [more] |
SP2016-27 pp.1-6 |
SP |
2016-08-24 13:25 |
Kyoto |
ACCMS, Kyoto Univ. |
Daily Activity Recognition Based on Recurrent Neural Network Akira Tamamori, Tomoki Hayashi, Tomoki Toda, Kazuya Takeda (Nagoya Univ.) SP2016-28 |
Our goal is to build an automatic surveillance system for elderly people and the core technique is daily activity recogn... [more] |
SP2016-28 pp.7-12 |
EA, SP, SIP |
2016-03-28 09:00 |
Oita |
Beppu International Convention Center B-ConPlaza |
Method of presenting sound signals for selective listening system Tomomi Suzuki (Nagoya Univ.), Takanori Nishino (Mie Univ.), Yoshio Ishiguro, Tomoki Toda, Kazuya Takeda (Nagoya Univ.) EA2015-66 SIP2015-115 SP2015-94 |
We are able to selectively concentrate on the desired acoustic information in the environmental sounds.
The aim of this... [more] |
EA2015-66 SIP2015-115 SP2015-94 pp.1-6 |
EA, SP, SIP |
2016-03-28 13:15 |
Oita |
Beppu International Convention Center B-ConPlaza |
[Poster Presentation]
An evaluation of F0 transformation for statistical singing voice conversion based on spectral differential filtering Kazuhiro Kobayashi (NAIST), Tomoki Toda (Nagoya Univ./NAIST), Satoshi Nakamura (NAIST) EA2015-84 SIP2015-133 SP2015-112 |
In this report, we propose a technique for cross-gender statistical singing voice conversion (SVC) with direct waveform ... [more] |
EA2015-84 SIP2015-133 SP2015-112 pp.105-110 |
EA, SP, SIP |
2016-03-28 13:15 |
Oita |
Beppu International Convention Center B-ConPlaza |
[Poster Presentation]
Nonaudible murmur enhancement based on non-negative tensor factorization of air- and body-conducted signals in real environments Yusuke Tajiri (NAIST), Hirokazu Kameoka (NTT), Tomoki Toda (Nagoya Univ./NAIST), Satoshi Nakamura (NAIST) EA2015-86 SIP2015-135 SP2015-114 |
Nonaudible murmur (NAM) recorded with a special body-conductive microphone called NAM microphone is one of the promising... [more] |
EA2015-86 SIP2015-135 SP2015-114 pp.117-122 |