ken-system: - All Technical Committee Conferences

IEICE Technical Committee Submission System
Conference Schedule

Online Proceedings
[Sign in]
Tech. Rep. Archives

[Japanese] / [English]

(

Committee/Place/Topics

) --Press->

(

Paper Keywords: / Column:Title Auth. Affi. Abst. Keyword

) --Press->

All Technical Committee Conferences (Searched in: All Years)

Search Results: Conference Papers

Conference Papers (Available on Advance Programs) (Sort by: Date Descending)

Committee	Date Time	Place		Paper Title / Authors	Abstract	Paper #
SP, IPSJ-MUS, IPSJ-SLP [detail]	2023-06-24 13:50	Tokyo	(Primary: On-site, Secondary: Online)	Domain adaptation of speech recognition models based on self-supervised learning using target domain speech Takahiro Kinouchi (TUT), Atsunori Ogawa (NTT), Yuko Wakabayashi, Norihide Kitaoka (TUT) SP2023-19	In this study, we propose a domain adaptation method using only speech data in the target domain without using transcrib... [more]	SP2023-19 pp.91-96
ET	2023-03-14 14:10	Tokushima	Tokushima University (Primary: On-site, Secondary: Online)	HMD-type customer service training support system using eye tracking Takeru Oue, Yukihiro Matsubara, Kousuke Mouri, Masaru Okamoto (Hiroshima City Univ.) ET2022-71	In this paper, customer service training support system using HMD and eye tracking approach are developed. By using this... [more]	ET2022-71 pp.73-78
SP, IPSJ-SLP, EA, SIP [detail]	2023-03-01 09:30	Okinawa	(Primary: On-site, Secondary: Online)	A Study on Scheduled Sampling for Neural Transducer-based ASR Takafumi Moriya, Takanori Ashihara, Hiroshi Sato, Kohei Matsuura, Tomohiro Tanaka, Ryo Masumura (NTT) EA2022-100 SIP2022-144 SP2022-64	In this paper, we propose scheduled sampling approaches suited for the recurrent neural network-transducer (RNNT) that i... [more]	EA2022-100 SIP2022-144 SP2022-64 pp.147-152
SP, IPSJ-SLP, EA, SIP [detail]	2023-03-01 11:00	Okinawa	(Primary: On-site, Secondary: Online)	Representation and Prediction of Accent Phrase Prosodic Features in Japanese Text-to-Speech Masaki Sato, Shinnosuke Takamichi, Hiroshi Saruwatari (The Univ. of Tokyo) EA2022-108 SIP2022-152 SP2022-72	In order to use speech synthesis in a variety of situations such as dialogue systems and emotional expression in audiobo... [more]	EA2022-108 SIP2022-152 SP2022-72 pp.197-202
SP, IPSJ-SLP, EA, SIP [detail]	2023-03-01 11:00	Okinawa	(Primary: On-site, Secondary: Online)	Analysis of Noisy-target Training for DNN-based speech enhancement and investigation towards its practical use Takuya Fujimura, Tomoki Toda (Nagoya Univ.) EA2022-112 SIP2022-156 SP2022-76	Deep neural network (DNN)-based speech enhancement usually uses a clean speech as a training target. However, it is hard... [more]	EA2022-112 SIP2022-156 SP2022-76 pp.221-226
EA, US (Joint)	2022-12-22 16:50	Hiroshima	Satellite Campus Hiroshima	[Poster Presentation] Data augmentation method for machine learning on speech data Tsubasa Maruyama (Tokyo Tech), Tsutomu Ikegami (AIST), Toshio Endo (Tokyo Tech), Takahiro Hirofuchi (AIST) EA2022-68	In machine learning, data augmentation is a method to enhance the number and diversity of data by adding transformations... [more]	EA2022-68 pp.42-48
NLC, IPSJ-NL, SP, IPSJ-SLP [detail]	2022-11-30 15:30	Tokyo	(Primary: On-site, Secondary: Online)	Semi-supervised joint training of text to speech and automatic speech recognition using unpaired text data Naoki Makishima, Satoshi Suzuki, Atsushi Ando, Ryo Masumura (NTT) NLC2022-14 SP2022-34	This paper presents a novel joint training of text to speech (TTS) and automatic speech recognition (ASR) with small amo... [more]	NLC2022-14 SP2022-34 pp.27-32
NLC, IPSJ-NL, SP, IPSJ-SLP [detail]	2022-12-01 15:20	Tokyo	(Primary: On-site, Secondary: Online)	Domain and language adaptation of large-scale pretrained model for speech recognition of low-resource language Kak Soky (Kyoto University), Sheng Li (NICT), Chenhui Chu, Tatsuya Kawahara (Kyoto University) NLC2022-17 SP2022-37	The self-supervised learning (SSL) models are effective for automatic speech recognition (ASR). Due to the huge paramete... [more]	NLC2022-17 SP2022-37 pp.45-49
NLC, IPSJ-NL, SP, IPSJ-SLP [detail]	2022-12-01 15:50	Tokyo	(Primary: On-site, Secondary: Online)	ASR model adaptation to target domain with large-scale audio data without transcription Takahiro Kinouchi, Daiki Mori (TUT), Ogawa Atsunori (NTT), Norihide Kitaoka (TUT) NLC2022-18 SP2022-38	Nowadays, speech recognition is used in various services and businesses thanks to the advent of high-performance models ... [more]	NLC2022-18 SP2022-38 pp.50-53
SIP	2022-08-26 14:08	Okinawa	Nobumoto Ohama Memorial Hall (Ishigaki Island) (Primary: On-site, Secondary: Online)	Study on Bone-conducted Speech Enhancement Using Vector-quantized Variational Autoencoder and Gammachirp Filterbank Cepstral Coefficients Quoc-Huy Nguyen, Masashi Unoki (JAIST) SIP2022-71	Bone-conducted (BC) speech potentially avoids the undesired effects on recorded speech due to background noise or reverb... [more]	SIP2022-71 pp.109-114
SP, IPSJ-MUS, IPSJ-SLP [detail]	2022-06-17 15:00	Online	Online	Representation and analytical normalization for vocal-tract-length transformation by group theory Atsushi Miyashita, Tomoki Toda (Nagoya Univ) SP2022-11	In automatic speech recognition, a recognition result should be invariant with respect to acoustic changes caused by dif... [more]	SP2022-11 pp.41-46
SP, IPSJ-MUS, IPSJ-SLP [detail]	2022-06-18 15:00	Online	Online	Unsupervised Training of Sequential Neural Beamformer Using Blindly-separated and Non-separated Signals Kohei Saijo, Tetsuji Ogawa (Waseda Univ.) SP2022-25	We present an unsupervised training method of the sequential neural beamformer (Seq-NBF) using the separated signals fro... [more]	SP2022-25 pp.110-115
EA, SIP, SP, IPSJ-SLP [detail]	2022-03-01 12:45	Okinawa	(Primary: On-site, Secondary: Online)	Incorporating Acoustic and Textual Information for Language Modeling in Code-switching Speech Recognition Roland Hartanto, Kuniaki Uto, Koichi Shinoda (TokyoTech) EA2021-73 SIP2021-100 SP2021-58	People who speak two or more languages tend to alternate the language when they are speaking. This particular phenomenon... [more]	EA2021-73 SIP2021-100 SP2021-58 pp.56-63
EA, SIP, SP, IPSJ-SLP [detail]	2022-03-02 15:35	Okinawa	(Primary: On-site, Secondary: Online)	[Poster Presentation] A study of shout detection for clipped speech Taito Ishida, Kazuhiro Matsuda, Takahiro Fukumori, Yoichi Yamashita (Ritsumeikan Univ.) EA2021-97 SIP2021-124 SP2021-82	Recently, several audio surveillance systems using shouted speech have been proposed for safety in daily life. Although... [more]	EA2021-97 SIP2021-124 SP2021-82 pp.207-212
EA, US (Joint)	2021-12-22 13:30	Kumamoto	Sojo University	[Poster Presentation] Improved voice quality due to multi-speaker learning with WaveNet vocoder Satoshi Yoshida, Shingo Uenohara, Ken'ichi Furuya (Oita Univ.) EA2021-57	In recent years, speech synthesis and voice quality conversion techniques using neural networks have attracted much atte... [more]	EA2021-57 pp.1-6
SP, WIT, IPSJ-SLP, ASJ-H [detail]	2021-10-19 15:10	Online	Online	A study on model training for DNN-HSMM-based speech synthesis using a large-scale speech corpus Nobuyuki Nishizawa, Gen Hattori (KDDI Research) SP2021-34 WIT2021-27	In this study, an investigation into model training for DNN-HSMM-based speech synthesis using a large speech corpus coll... [more]	SP2021-34 WIT2021-27 pp.52-57
SIS	2021-03-04 09:00	Online	Online	Optimization source-filtere based speech waveform generation using adversarial training Hayato Mitsui, Yosuke Sugiura, Nozomiko Yasui, Tetsuya Shimamura (Saitama Univ.) SIS2020-35	This research aims to improve the accuracy of the source-filter based speech waveform generation model using deep learni... [more]	SIS2020-35 pp.1-4
PRMU, IPSJ-CVIM	2021-03-05 09:45	Online	Online	Improved Speech Separation Performance from Monaural Mixed Speech Based on Deep Embedding Network Shaoxiang Dang, Tetsuya Matsumoto, Hiroaki Kudo (Nagoya Univ.), Yoshinori Takeuchi (Daido Univ.) PRMU2020-85	Speech separation refers to the separation of utterances in which multiple people are speaking simultaneously. The idea ... [more]	PRMU2020-85 pp.91-96
EA, US, SP, SIP, IPSJ-SLP [detail]	2021-03-03 14:05	Online	Online	[Poster Presentation] A unified source-filter network for neural vocoder Reo Yoneyama, Yi-Chiao Wu, Tomoki Toda (Nagoya Univ.) EA2020-69 SIP2020-100 SP2020-34	In this paper, we propose a method to develop a neural vocoder using a single network based on the source-filter theory.... [more]	EA2020-69 SIP2020-100 SP2020-34 pp.57-62
MVE, IPSJ-CVIM	2021-01-21 15:50	Online	Online	Customer service training VR system with spoken voice traning Toki Nishio, Soichiro Iida (Univ. of Tsukuba), Yuta Sano, Leow Chee Siang, Hiromitsu Nishizaki (Univ. of Yamanashi), Takehito Utsuro, Junichi Hoshino (Univ. of Tsukuba) MVE2020-32	A filler is a word that has no meaning in itself and is used to fill gaps in conversation. In customer service, this fil... [more]	MVE2020-32 pp.13-16

Copyright and reproduction : All rights are reserved and no part of this publication may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopy, recording, or any information storage and retrieval system, without permission in writing from the publisher. Notwithstanding, instructors are permitted to photocopy isolated articles for noncommercial classroom use without fee. (License No.: 10GA0019/12GB0052/13GB0056/17GB0034/18GB0034)

[Return to Top Page]

[Return to IEICE Web Page]

The Institute of Electronics, Information and Communication Engineers (IEICE), Japan