ken-system: - All Technical Committee Conferences

IEICE Technical Committee Submission System
Conference Schedule

Online Proceedings
[Sign in]
Tech. Rep. Archives

[Japanese] / [English]

(

Committee/Place/Topics

) --Press->

(

Paper Keywords: / Column:Title Auth. Affi. Abst. Keyword

) --Press->

All Technical Committee Conferences (Searched in: All Years)

Search Results: Conference Papers

Conference Papers (Available on Advance Programs) (Sort by: Date Descending)

Committee	Date Time	Place		Paper Title / Authors	Abstract	Paper #
EA	2024-05-22 14:15	Online	Online	未定 -- 未定 -- Tsubasa Ochiai (NTT), Kazuma Iwamoto (Doshisha Univ.), Marc Delcroix, Rintaro Ikeshita, Hiroshi Sato, Shoko Araki (NTT), Shigeru Katagiri (Doshisha Univ.)	(To be available after the conference date) [more]
SIP, SP, EA, IPSJ-SLP [detail]	2024-02-29 16:45	Okinawa	(Primary: On-site, Secondary: Online)	Multiple Lag Window Pairs for Estimation of Fundamental Frequency and Periodicity Measure Michiki Koshimori (UEC), Shigeki Sagayama (UTokyo/UEC), Toru Nakashika (UEC) EA2023-75 SIP2023-122 SP2023-57	Extending the main concept of modified autocorrelation method in LPC, we investigate lag windows, lag window pairs, and ... [more]	EA2023-75 SIP2023-122 SP2023-57 pp.85-90
SIP, SP, EA, IPSJ-SLP [detail]	2024-03-01 09:30	Okinawa	(Primary: On-site, Secondary: Online)	An experimental survey on speaker embedding spaces for controlling speaker identity in speech synthesis system Wakuto Morita, Daisuke Saito, Nobuaki Minematsu (Univ. of Tokyo) EA2023-93 SIP2023-140 SP2023-75	This study investigated the influence of the discriminability of speaker encoders on speech synthesis models that can co... [more]	EA2023-93 SIP2023-140 SP2023-75 pp.190-195
SIP, SP, EA, IPSJ-SLP [detail]	2024-03-01 09:30	Okinawa	(Primary: On-site, Secondary: Online)	SELECTING N-LOWEST SCORES FOR TRAINING MOS PREDICTION MODELS Yuto Kondo, Hirokazu Kameoka, Kou Tanaka, Takuhiro Kaneko (NTT) EA2023-94 SIP2023-141 SP2023-76	Automatic speech quality assessment (SQA) is a task to evaluate the quality of speech samples without resorting to time-... [more]	EA2023-94 SIP2023-141 SP2023-76 pp.196-201
SIP, SP, EA, IPSJ-SLP [detail]	2024-03-01 09:30	Okinawa	(Primary: On-site, Secondary: Online)	Improving training recipe of Remixed2Remixed for speech enhancement Li Li, Shogo Seki (CyberAgent) EA2023-95 SIP2023-142 SP2023-77	In the use of deep learning for speech enhancement, supervised learning models that use pairs of clean speech and artifi... [more]	EA2023-95 SIP2023-142 SP2023-77 pp.202-207
SIP, SP, EA, IPSJ-SLP [detail]	2024-03-01 09:30	Okinawa	(Primary: On-site, Secondary: Online)	Multi-Dialect Speech Synthesis with Interpretable Accent latent Variable based on VQ-VAE Kazuki Yamauchi, Yuki Saito, Hiroshi Saruwatari (UTokyo) EA2023-98 SIP2023-145 SP2023-80	In this paper, we address two tasks: "Intra-dialect Text-to-Speech (TTS)," aiming to synthesize speech in the same diale... [more]	EA2023-98 SIP2023-145 SP2023-80 pp.220-225
SIP, SP, EA, IPSJ-SLP [detail]	2024-03-01 09:30	Okinawa	(Primary: On-site, Secondary: Online)	Domain adaptation of speech recognition model based on multilingual SSL model with only nonparallel corpus. Takahiro Kinouchi (TUT), Atsunori Ogawa (NTT), Yukoh Wakabayashi (TUT), Kengo Ohta (NITA), Norihide Kitaoka (TUT) EA2023-100 SIP2023-147 SP2023-82	Automatic speech recognition (ASR) models are used in various services and businesses, and each domain’s recognition acc... [more]	EA2023-100 SIP2023-147 SP2023-82 pp.232-237
SIP, SP, EA, IPSJ-SLP [detail]	2024-03-01 09:30	Okinawa	(Primary: On-site, Secondary: Online)	Evaluation of Automatic Speech Recognition for Deaf and Hard-of-Hearing People by Speaker Adaptation. Kaito Takahashi, Takahiro Kinouchi, Yukoh Wakabayashi (TUT), Kengo Ohta (NITAC), Akio Kobayashi (Yamato Univ.), Norihide Kitaoka (TUT) EA2023-102 SIP2023-149 SP2023-84	Communication between normal-hearing people and the deaf is generally used sign language, written communication, and spe... [more]	EA2023-102 SIP2023-149 SP2023-84 pp.244-249
SIP, SP, EA, IPSJ-SLP [detail]	2024-03-01 10:40	Okinawa	(Primary: On-site, Secondary: Online)	Intermediate speaker speech synthesis between two speakers using x-vector speaker space Sota Hosoi, Takahiro Kinouchi, Yukoh Wakabayashi, Norihide Kitaoka (TUT) EA2023-103 SIP2023-150 SP2023-85	Recent advancements in speech synthesis technologies have enabled the synthesis of speeches of speakers not in the train... [more]	EA2023-103 SIP2023-150 SP2023-85 pp.250-255
SIP, SP, EA, IPSJ-SLP [detail]	2024-03-01 10:40	Okinawa	(Primary: On-site, Secondary: Online)	Speech representation based on VAE assuming gamma distribution for latent variables and observation Nanako Imaichi, Toru Nakashika (UEC) EA2023-104 SIP2023-151 SP2023-86	Recently, deep generative models that can represent complex relationships in data generation have been attracting attent... [more]	EA2023-104 SIP2023-151 SP2023-86 pp.256-261
SIP, SP, EA, IPSJ-SLP [detail]	2024-03-01 10:40	Okinawa	(Primary: On-site, Secondary: Online)	Substitution of Implicit Linguistic Information in Beam Search Decoding Using CTC-based Speech Recognition Models Tatsunari Takagi, Yukoh Wakabayashi (TUT), Atsunori Ogawa (NTT), Norihide Kitaoka (TUT) EA2023-106 SIP2023-153 SP2023-88	The rise of neural networks in the field of automatic speech recognition has notably improved the accuracy of speech rec... [more]	EA2023-106 SIP2023-153 SP2023-88 pp.268-273
SIP, SP, EA, IPSJ-SLP [detail]	2024-03-01 15:25	Okinawa	(Primary: On-site, Secondary: Online)	Investigation of objective intelligibility metrics based on speech foundation models for Clarity Prediction Challenge 2 Katsuhiko Yamamoto (CyberAgent) EA2023-119 SIP2023-166 SP2023-101	Speech Foundation Models (SFMs), which use components like the encoder layer of Whisper, have been suggested to separate... [more]	EA2023-119 SIP2023-166 SP2023-101 pp.334-339
SIP, SP, EA, IPSJ-SLP [detail]	2024-03-01 16:35	Okinawa	(Primary: On-site, Secondary: Online)	Discrimination of rotation direction of virtual sound source in binaural synthesis using sound source radiation characteristics Orie Nishiyama (Chiba Institute of Technology), Toshiharu Horiuchi, Shota Okubo (KDDI Research, Inc.), Yoshifumi Chisaki (Chiba Institute of Technology) EA2023-125 SIP2023-172 SP2023-107	In order to provide the sensation of being there, research has been conducted on realistic communication that acquires, ... [more]	EA2023-125 SIP2023-172 SP2023-107 pp.376-381
SIP, SP, EA, IPSJ-SLP [detail]	2024-03-01 16:35	Okinawa	(Primary: On-site, Secondary: Online)	Simulation Evaluation of Speech Detection Based on Distributed Sound-to-Light Conversion Device Blinkies Satoshi Motoyama, Natsuki Ueno, Masahiro Yasuda (TMU), Yuma Kinoshita (Tokai Univ.), Nobutaka Ono (TMU) EA2023-126 SIP2023-173 SP2023-108	The purpose of this study is speech detection using the distributed sound-to-light conversion device Blinkies. As an ini... [more]	EA2023-126 SIP2023-173 SP2023-108 pp.382-387
SIP, SP, EA, IPSJ-SLP [detail]	2024-03-01 16:05	Okinawa	(Primary: On-site, Secondary: Online)	Evaluating speech generation based on objective measures for text generation Takaaki Saeki (UTokyo), Soumi Maiti (CMU), Shinnosuke Takamichi (UTokyo), Shinji Watanabe (CMU), Hiroshi Saruwatari (UTokyo) EA2023-133 SIP2023-180 SP2023-115	In the evaluation of speech generation, while subjective judgments have long been the gold standard, objective metrics s... [more]	EA2023-133 SIP2023-180 SP2023-115 pp.421-426
HCGSYMPO (2nd)	2023-12-11 - 2023-12-13	Fukuoka	Asia pacific Import Mart (Kitakyushu) (Primary: On-site, Secondary: Online)	Effect Evaluation of Pain Perception by Robot's Stroking with Speech Kota Nieda, Taishi Sawabe, Masayuki Kanbara, Yuichiro Fujimoto, Hirokazu Kato (NAIST)	The purpose of this study is to verify whether the " stroking with speech" behavior by the robot can change the percepti... [more]
SP, NLC, IPSJ-SLP, IPSJ-NL [detail]	2023-12-03 09:30	Tokyo	Kikai-Shinko-Kaikan Bldg. (Primary: On-site, Secondary: Online)	Enhancing Recognition of Rare Words in ASR through Error Detection and Context-Aware Error Correction Jiajun He, Zekun Yang, Tomoki Toda (Nagoya Univ.) NLC2023-16 SP2023-36	Automatic speech recognition (ASR) systems often suffer from errors, particularly when recognizing rare words. These err... [more]	NLC2023-16 SP2023-36 pp.13-18
SP, NLC, IPSJ-SLP, IPSJ-NL [detail]	2023-12-03 10:00	Tokyo	Kikai-Shinko-Kaikan Bldg. (Primary: On-site, Secondary: Online)	Improvement of Tacotron2 text-to-speech model based on masking operation and positional attention mechanism Tong Ma, Daisuke Saito, Nobuaki Minematsu (Univ. of Tokyo) NLC2023-17 SP2023-37	[more]	NLC2023-17 SP2023-37 pp.19-24
SP, NLC, IPSJ-SLP, IPSJ-NL [detail]	2023-12-03 11:05	Tokyo	Kikai-Shinko-Kaikan Bldg. (Primary: On-site, Secondary: Online)	[Poster Presentation] Enhancing Multi-Accent Automated Speech Recognition with Accent-Activated Adapters Yuqin Lin, Longbiao Wang, Jianwu Dang (Tianjin Univ. & Univ. of Tokyo), Nobuaki Minematsu (Univ. of Tokyo) NLC2023-18 SP2023-38	This paper proposes the Accent-Activated adapter (AccentAct) approach to address the challenge of speech variations in m... [more]	NLC2023-18 SP2023-38 pp.25-30
SP, NLC, IPSJ-SLP, IPSJ-NL [detail]	2023-12-03 11:05	Tokyo	Kikai-Shinko-Kaikan Bldg. (Primary: On-site, Secondary: Online)	[Poster Presentation] Enhancing Dysarthric Speech Recognition with Auxiliary Feature Fusion Module: Exploring Articulatory-related Features from Foundation Models Yuqin Lin, Longbiao Wang, Jianwu Dang (Tianjin Univ. & Univ. of Tokyo), Nobuaki Minematsu (Univ. of Tokyo) NLC2023-19 SP2023-39	Addressing dysarthric speech variability in Automatic Speech Recognition (ASR) is crucial for improving human-computer i... [more]	NLC2023-19 SP2023-39 pp.31-36

Copyright and reproduction : All rights are reserved and no part of this publication may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopy, recording, or any information storage and retrieval system, without permission in writing from the publisher. Notwithstanding, instructors are permitted to photocopy isolated articles for noncommercial classroom use without fee. (License No.: 10GA0019/12GB0052/13GB0056/17GB0034/18GB0034)

[Return to Top Page]

[Return to IEICE Web Page]

The Institute of Electronics, Information and Communication Engineers (IEICE), Japan