Information and Systems-Speech(Date:2023/02/28)

Presentation
Regularization Term Design Based on Spectrogram Consistency in Independent Low-Rank Matrix Analysis for Multichannel Audio Source Separation

Sota Misawa(UTokyo),  Norihiro Takamune(UTokyo),  Kohei Yatabe(TUAT),  Daichi Kitamura(NIT, Kagawa),  Hiroshi Saruwatari(UTokyo),  

[Date]2023-03-01
[Paper #]EA2022-105,SIP2022-149,SP2022-69
The linguistic influence on speaker verification based on Self-Supervised Learning

Tomoka Wakamatsu(Tokyo Metropolitan Univ.),  Atsushi Ando(NTT),  Sayaka Shiota(Tokyo Metropolitan Univ.),  Ryo Masumura(NTT),  Hitoshi Kiya(Tokyo Metropolitan Univ.),  

[Date]2023-03-01
[Paper #]EA2022-118,SIP2022-162,SP2022-82
Anomalous sound detection with complex-valued hybrid neural networks considering phase variations

Shota Nishiyama(AIT),  Akira Tamamori(AIT),  

[Date]2023-03-01
[Paper #]EA2022-106,SIP2022-150,SP2022-70
Quasi-real-time estimation of a maximum radiation direction from a loudspeaker surrounded by four microphones based on SPL ratio

Ryusei Tsuda(Osaka Sangyo Univ.),  Daiki Maekawa(Osaka Sangyo Univ.),  Tomoru Awatani(Osaka Sangyo Univ.),  Masato Nakayama(Osaka Sangyo Univ.),  Toru Takahashi(Osaka Sangyo Univ.),  

[Date]2023-03-01
[Paper #]EA2022-111,SIP2022-155,SP2022-75
Analysis of Noisy-target Training for DNN-based speech enhancement and investigation towards its practical use

Takuya Fujimura(Nagoya Univ.),  Tomoki Toda(Nagoya Univ.),  

[Date]2023-03-01
[Paper #]EA2022-112,SIP2022-156,SP2022-76
An Investigation of Text-to-Speech Synthesis Using Voice Conversion and x-vector Embedding Sympathizing Emotion of Input Audio for Spoken Dialogue Systems

Shunichi Kohara(Okayama Univ.),  Masanobu Abe(Okayama Univ.),  Sunao Hara(Okayama Univ.),  

[Date]2023-03-01
[Paper #]EA2022-109,SIP2022-153,SP2022-73
Anomalous sound detection based on differential features of multi channel acoustic signals considering spatial and temporal variations

Shota Nishiyama(AIT),  Akira Tamamori(AIT),  

[Date]2023-03-01
[Paper #]
A Study on Selective Fixed-Filter ANC Using 2D-CNN with Sliding DCT input

Kenya Doi(KU),  Yoshinobu Kajikawa(KU),  

[Date]2023-03-01
[Paper #]EA2022-113,SIP2022-157,SP2022-77
RGB-D Salient Object Detection Using Saliency and Edge Reverse Attention

Tomoki Ikeda(Keio Univ.),  Masaaki Ikehara(Keio Univ.),  

[Date]2023-03-01
[Paper #]EA2022-127,SIP2022-171,SP2022-91
Increasing speech intelligibility for evacuation guidance by mimicking professional announcers’ voice

KimDung Tran(JAIST),  Masato Akagi(JAIST),  Masashi Unoki(JAIST),  

[Date]2023-03-01
[Paper #]EA2022-119,SIP2022-163,SP2022-83
Vocabulary-Set Decomposition and Multi-task Learning for Target Vocabulary Extraction in Japanese Speech Recognition

Aoi Ito(LINE/Hosei Univ.),  Tatsuya Komatsu(LINE),  Yusuke Fujita(LINE),  

[Date]2023-03-01
[Paper #]EA2022-102,SIP2022-146,SP2022-66
Multiscale Manifold Clustering and Embedding with Multiple Kernels

Kyohei Suzuki(Keio Univ.),  Masahiro Yukawa(Keio Univ.),  

[Date]2023-03-01
[Paper #]EA2022-123,SIP2022-167,SP2022-87
Predominant Instrument Recognition in Polyphonic Music Based on Transfer Learning with Vanilla ResNet-50

Lifan Zhong(UTokyo),  Nobuaki Minematsu(UTokyo),  Daisuke Saito(UTokyo),  

[Date]2023-03-01
[Paper #]EA2022-114,SIP2022-158,SP2022-78
Corpus construction toward multi-domain empathetic dialogue speech synthesis

Yuki Saito(UT),  Eiji Iimori(UT),  Shinnosuke Takamichi(UT),  Kentaro Tachibana(LINE),  Hiroshi Saruwatari(UT),  

[Date]2023-03-01
[Paper #]
On Design of Real Filters For Directed Graph Signals

Shogo Muramatsu(Niigta Univ.),  Hotaka Kitamura(Niigta Univ.),  Hiroyashu Yasuda(Niigta Univ.),  Yuichi Tanaka(Osaka Univ.),  

[Date]2023-03-01
[Paper #]EA2022-124,SIP2022-168,SP2022-88
Personality Recognition on Dyadic Interactions with Representation Learning

Nathania Nah(Tokyo Tech),  Takafumi Koshinaka(YCU),  Koichi Shinoda(Tokyo Tech),  

[Date]2023-03-01
[Paper #]EA2022-117,SIP2022-161,SP2022-81
[Invited Talk] Speech and Language Research in the Google Tokyo Office

Michiel Bacchiani(Google),  

[Date]2023-03-01
[Paper #]EA2022-116,SIP2022-160,SP2022-80
Choral Singing Voice Synthesis with Fundamental Frequency Modulation

Sora Miyazawa(UTokyo),  Anan Kikuchi(UTokyo),  Daisuke Saito(UTokyo),  Nobuaki Minematsu(UTokyo),  

[Date]2023-03-01
[Paper #]EA2022-110,SIP2022-154,SP2022-74
Low-bit Image Restoration with Loop-unrolled ISTA

Shu Abe(Niigata Univ),  Soushi Takahashi(Niigata Univ),  Shogo Muramatsu(Niigata Univ),  

[Date]2023-03-01
[Paper #]EA2022-125,SIP2022-169,SP2022-89
A Study on Virtual Sensing Method for Hybrid Active Noise Control System

Shota Toyooka(Kansai Univ.),  Kajikawa Yoshinobu(Kansai Univ.),  

[Date]2023-03-01
[Paper #]EA2022-126,SIP2022-170,SP2022-90
<<1234>> 41-60hit(77hit)