Information and Systems-Speech(Date:2023/12/02)

Presentation
Effectiveness of Signal Compression in Speech Enhancement with Diffusion Models

Yuki Nishi(Titech),  Koji Iwano(Tokyo City Univ.),  Koichi Shinoda(Titech),  

[Date]2023-12-02
[Paper #]NLC2023-14,SP2023-34
Development and effects of English speech training drills to improve perception and production skills seamlessly with interactive gamification

Nobuaki Minematsu(UTokyo),  Yingxiang Gao(UTokyo),  Noriko Nakanishi(KGU),  Yusuke Inoue(Carriage),  Hiroaki Mizuno(Carriage),  

[Date]2023-12-02
[Paper #]NLC2023-15,SP2023-35
Enhancing Recognition of Rare Words in ASR through Error Detection and Context-Aware Error Correction

Jiajun He(Nagoya Univ.),  Zekun Yang(Nagoya Univ.),  Tomoki Toda(Nagoya Univ.),  

[Date]2023-12-03
[Paper #]NLC2023-16,SP2023-36
[Poster Presentation] Enhancing Multi-Accent Automated Speech Recognition with Accent-Activated Adapters

Yuqin Lin(Tianjin Univ. & Univ. of Tokyo),  Longbiao Wang(Tianjin Univ. & Univ. of Tokyo),  Jianwu Dang(Tianjin Univ. & Univ. of Tokyo),  Nobuaki Minematsu(Univ. of Tokyo),  

[Date]2023-12-03
[Paper #]NLC2023-18,SP2023-38
[Poster Presentation] Enhancing Dysarthric Speech Recognition with Auxiliary Feature Fusion Module: Exploring Articulatory-related Features from Foundation Models

Yuqin Lin(Tianjin Univ. & Univ. of Tokyo),  Longbiao Wang(Tianjin Univ. & Univ. of Tokyo),  Jianwu Dang(Tianjin Univ. & Univ. of Tokyo),  Nobuaki Minematsu(Univ. of Tokyo),  

[Date]2023-12-03
[Paper #]NLC2023-19,SP2023-39
[Poster Presentation] Integration of Throat Microphone Recording and Bandwidth Extension for Robust Assessment of L2 Listening

Yu Xu(Univ. of Tokyo),  Nobuaki Minematsu(Univ. of Tokyo),  Daisuke Saito(Univ. of Tokyo),  

[Date]2023-12-03
[Paper #]NLC2023-20,SP2023-40
[Poster Presentation] Self-supervised learning model based emotion transfer and intensity control technology for expressive speech synthesis

Wei Li(Univ. of Tokyo),  Nobuaki Minematsu(Univ. of Tokyo),  Daisuke Saito(Univ. of Tokyo),  

[Date]2023-12-03
[Paper #]NLC2023-21,SP2023-41
Improvement of Tacotron2 text-to-speech model based on masking operation and positional attention mechanism

Tong Ma(Univ. of Tokyo),  Daisuke Saito(Univ. of Tokyo),  Nobuaki Minematsu(Univ. of Tokyo),  

[Date]2023-12-03
[Paper #]NLC2023-17,SP2023-37
Report on Participation in Interspeech2023

Kentaro Mitsui(rinna),  Kohei Matsuura(NTT),  

[Date]2023-12-04
[Paper #]NLC2023-22,SP2023-42