Committee |
Date Time |
Place |
Paper Title / Authors |
Abstract |
Paper # |
CAS, CS |
2024-03-14 13:30 |
Okinawa |
|
Characterization of Semantic Communications in Speech Signal Transmission Futo Iwanaga, Daisuke Umehara (Kyoto Inst. of Tech.) CAS2023-118 CS2023-111 |
In recent years, the volume of data in data communication has surged, Characterization of Semantic Communications in Spe... [more] |
CAS2023-118 CS2023-111 pp.41-46 |
SIP, SP, EA, IPSJ-SLP [detail] |
2024-03-01 10:40 |
Okinawa |
(Primary: On-site, Secondary: Online) |
Speech representation based on VAE assuming gamma distribution for latent variables and observation Nanako Imaichi, Toru Nakashika (UEC) EA2023-104 SIP2023-151 SP2023-86 |
Recently, deep generative models that can represent complex relationships in data generation have been attracting attent... [more] |
EA2023-104 SIP2023-151 SP2023-86 pp.256-261 |
SIP, SP, EA, IPSJ-SLP [detail] |
2024-03-01 10:40 |
Okinawa |
(Primary: On-site, Secondary: Online) |
Substitution of Implicit Linguistic Information in Beam Search Decoding Using CTC-based Speech Recognition Models Tatsunari Takagi, Yukoh Wakabayashi (TUT), Atsunori Ogawa (NTT), Norihide Kitaoka (TUT) EA2023-106 SIP2023-153 SP2023-88 |
The rise of neural networks in the field of automatic speech recognition has notably improved the accuracy of speech rec... [more] |
EA2023-106 SIP2023-153 SP2023-88 pp.268-273 |
SP, NLC, IPSJ-SLP, IPSJ-NL [detail] |
2023-12-03 09:30 |
Tokyo |
Kikai-Shinko-Kaikan Bldg. (Primary: On-site, Secondary: Online) |
Enhancing Recognition of Rare Words in ASR through Error Detection and Context-Aware Error Correction Jiajun He, Zekun Yang, Tomoki Toda (Nagoya Univ.) NLC2023-16 SP2023-36 |
Automatic speech recognition (ASR) systems often suffer from errors, particularly when recognizing rare words. These err... [more] |
NLC2023-16 SP2023-36 pp.13-18 |
WIT, SP, IPSJ-SLP [detail] |
2023-10-14 17:05 |
Fukuoka |
Kyushu Institute of Technology (Primary: On-site, Secondary: Online) |
Electrolaryngeal Speech Enhancement through Strong Linguistic Encoding Methods Lester Phillip Violeta, Wen-Chin Huang, Ding Ma, Ryuichi Yamamoto, Kazuhiro Kobayashi, Tomoki Toda (Nagoya Univ.) SP2023-33 WIT2023-24 |
Although pretraining and fine-tuning approaches have proven to work well in speech intelligibility enhancement, various ... [more] |
SP2023-33 WIT2023-24 pp.33-38 |
SP, IPSJ-SLP, EA, SIP [detail] |
2023-03-01 09:30 |
Okinawa |
(Primary: On-site, Secondary: Online) |
A Study on Scheduled Sampling for Neural Transducer-based ASR Takafumi Moriya, Takanori Ashihara, Hiroshi Sato, Kohei Matsuura, Tomohiro Tanaka, Ryo Masumura (NTT) EA2022-100 SIP2022-144 SP2022-64 |
In this paper, we propose scheduled sampling approaches suited for the recurrent neural network-transducer (RNNT) that i... [more] |
EA2022-100 SIP2022-144 SP2022-64 pp.147-152 |
HCS |
2023-01-22 16:00 |
Kyoto |
Kyoto Institute of Technology (Primary: On-site, Secondary: Online) |
Decoding of average ERPs during silent Japanese words by attention-based RNN with encoder-decoder Toshimasa Yamazaki, Yuko Tokunaga, Chieko Ito (KIT) HCS2022-74 |
This study attempted to decode average event-related potentials (ERPs) during silent Japanese words by attention-based r... [more] |
HCS2022-74 pp.108-111 |
SP, IPSJ-MUS, IPSJ-SLP [detail] |
2022-06-18 10:50 |
Online |
Online |
[Invited Talk]
Crazy vocoder is unbreakable
-- But let's talk about an informal vision of the future -- Masanori Morise (Meiji Univ.) SP2022-15 |
When current speech synthesis researchers refer to Vocoder in their papers, they are most likely referring to Neural voc... [more] |
SP2022-15 pp.61-66 |
EA, SIP, SP, IPSJ-SLP [detail] |
2022-03-01 12:45 |
Okinawa |
(Primary: On-site, Secondary: Online) |
Incorporating Acoustic and Textual Information for Language Modeling in Code-switching Speech Recognition Roland Hartanto, Kuniaki Uto, Koichi Shinoda (TokyoTech) EA2021-73 SIP2021-100 SP2021-58 |
People who speak two or more languages tend to alternate the language when they are speaking. This particular phenomenon... [more] |
EA2021-73 SIP2021-100 SP2021-58 pp.56-63 |
SP, IPSJ-SLP, IPSJ-MUS |
2021-06-18 13:00 |
Online |
Online |
F0 estimation of speech based on l2-norm regularized TV-CAR analysis Keiichi Funaki (Univ. of the Ryukyus) SP2021-2 |
Linear Prediction (LP) is the most successful speech analysis in speech processing, including speech coding implemented
... [more] |
SP2021-2 pp.7-12 |
EA, US, SP, SIP, IPSJ-SLP [detail] |
2021-03-03 14:05 |
Online |
Online |
[Poster Presentation]
A unified source-filter network for neural vocoder Reo Yoneyama, Yi-Chiao Wu, Tomoki Toda (Nagoya Univ.) EA2020-69 SIP2020-100 SP2020-34 |
In this paper, we propose a method to develop a neural vocoder using a single network based on the source-filter theory.... [more] |
EA2020-69 SIP2020-100 SP2020-34 pp.57-62 |
SP, EA, SIP |
2020-03-03 09:00 |
Okinawa |
Okinawa Industry Support Center (Cancelled but technical report was issued) |
[Poster Presentation]
A Comparison of Language Models for a Design of Reduced Phoneme Set Shuji Komeiji, Toshihisa Tanaka (TUAT), Koichi Shinoda (titech) EA2019-152 SIP2019-154 SP2019-101 |
Language models for a design of reduced phoneme set are compared each other.
The reduction of the phoneme set improves ... [more] |
EA2019-152 SIP2019-154 SP2019-101 pp.295-300 |
NLC, IPSJ-NL, SP, IPSJ-SLP (Joint) [detail] |
2019-12-06 13:55 |
Tokyo |
NHK Science & Technology Research Labs. |
[Poster Presentation]
Time-Varying Complex AR speech analysis based on l2-norm regularization Keiichi Funaki (Univ. of the Ryukyus) SP2019-41 |
Linear prediction (LP) is a mathematical operation estimating an all-pole spectrum from the speech
signal. It is an ess... [more] |
SP2019-41 pp.73-77 |
ISEC, SITE, ICSS, EMM, HWS, BioX, IPSJ-CSEC, IPSJ-SPT [detail] |
2019-07-24 12:10 |
Kochi |
Kochi University of Technology |
Recording device identification based on audio distortion depending on system-on-chip Akira Nishimura (Tokyo Univ. Info. Sci.) ISEC2019-48 SITE2019-42 BioX2019-40 HWS2019-43 ICSS2019-46 EMM2019-51 |
This study addresses device-specific distortion observed in recorded
audio, to identify a built-in system-on-a-chip (... [more] |
ISEC2019-48 SITE2019-42 BioX2019-40 HWS2019-43 ICSS2019-46 EMM2019-51 pp.311-316 |
EA, SIP, SP |
2019-03-15 13:30 |
Nagasaki |
i+Land nagasaki (Nagasaki-shi) |
[Poster Presentation]
F0 estimation using TV-CAR speech analysis based on Regularized LP Keiichi Funaki (Univ. of the Ryukyus) EA2018-152 SIP2018-158 SP2018-114 |
Linear Prediction (LP) analysis is speech analysis to estimate AR(Auto-Regressive) coefficients to represent the all-pol... [more] |
EA2018-152 SIP2018-158 SP2018-114 pp.311-316 |
NLC, IPSJ-NL, SP, IPSJ-SLP (Joint) [detail] |
2018-12-10 13:15 |
Tokyo |
Waseda Univ. Nishiwaseda Campus |
[Invited Talk]
Review of Automatic Speech Recognition Methodology
-- Outlook of Acoustic-to-Word Model -- Tatsuya Kawahara (Kyoto Univ.) SP2018-48 |
The methodology of speech recognition has been changing due to the introduction of deep learning, in particular end-to-e... [more] |
SP2018-48 pp.25-30 |
SIP, EA, SP, MI (Joint) [detail] |
2018-03-20 09:00 |
Okinawa |
|
[Poster Presentation]
Perceptual influence of spectral envelope and aperiodicity quantization for encoding high-quality speech Genta Miyashita, Masanori Morise (Univ. of Yamanashi) EA2017-145 SIP2017-154 SP2017-128 |
In this paper, we investigate the relationship between the degradation of sound quality and the parameter quantization i... [more] |
EA2017-145 SIP2017-154 SP2017-128 pp.241-244 |
SITE, EMM, ISEC, ICSS, IPSJ-CSEC, IPSJ-SPT [detail] |
2017-07-15 13:25 |
Tokyo |
|
Investigation of spikegram-based signal representation for speech fingerprints Dung Kim Tran, Masashi Unoki (JAIST) ISEC2017-32 SITE2017-24 ICSS2017-31 EMM2017-35 |
This paper investigates the ability of spikegrams in representing the speech content and voice identications of speech s... [more] |
ISEC2017-32 SITE2017-24 ICSS2017-31 EMM2017-35 pp.241-246 |
SP, SIP, EA |
2017-03-02 10:45 |
Okinawa |
Okinawa Industry Support Center |
[Special Invited Talk]
Speech and Audio Coding for High-Quality Services of Mobile-Phone and Broadcasting Takehiro Moriya (NTT) EA2016-138 SIP2016-193 SP2016-133 |
Among recent research and development trends on speech and audio coding, two international standard schemes are introduc... [more] |
EA2016-138 SIP2016-193 SP2016-133 p.313 |
EA, ASJ-H, IPSJ-MUS [detail] |
2016-10-15 09:00 |
Ishikawa |
Noto Omakidai (Nanao) |
[Fellow Memorial Lecture]
Progress in speech and audio coding technologies Takehiro Moriya (NTT) EA2016-47 |
Among recent research and development trends on speech and audio coding, two international standard schemes are introduc... [more] |
EA2016-47 p.109 |