Committee |
Date Time |
Place |
Paper Title / Authors |
Abstract |
Paper # |
SIS, IPSJ-AVM |
2022-06-10 11:40 |
Fukuoka |
KIT(Wakamatsu Campus) (Primary: On-site, Secondary: Online) |
An embedded-oriented sound classification system using reservoir computing Yuichiro Tanaka, Issei Uchino (Kyutech), Kazunobu Ohkuri (Sony), Hakaru Tamukoh (Kyutech) SIS2022-9 |
Although deep neural networks (DNNs) have achieved state-of-the-art results in sound classification tasks in recent year... [more] |
SIS2022-9 pp.41-44 |
EA, SIP, SP |
2019-03-15 10:00 |
Nagasaki |
i+Land nagasaki (Nagasaki-shi) |
Consideration on Effectiveness of Relative Phase from Residual Speech for Speaker Recognition Seiichi Nakagawa, Kazumasa Yamamoto, Kazumasa Yamamoto (Chubu Univ.) EA2018-130 SIP2018-136 SP2018-92 |
We have focused on phase spectrum for speaker recognition. So we proposed relative phase as a feature parameter for spea... [more] |
EA2018-130 SIP2018-136 SP2018-92 pp.185-190 |
SP |
2019-01-27 11:05 |
Ishikawa |
Kanazawa-Harmonie |
A Speaker Recognition Performance Measure based on the Adaptation Quickness and Final Accuracy for Spoken Dialog Systems Junko Takami, Takeshi Kawabata (KGU) SP2018-59 |
For constructing user friendly spoken dialog system, it is important to recognize "Who is the user?" and to choose appro... [more] |
SP2018-59 pp.35-40 |
SP, IPSJ-SLP (Joint) |
2017-07-28 11:15 |
Miyagi |
Akiu Resort Hotel Crescent |
Speaker Diarization for Face-to-Face Dialog of Service Counters Based on Appearance Pattern of Speakers Mizuki Watabe (NTT DOCOMO), Atsushi Ando, Hosana Kamiyama, Satoshi Kobashikawa, Yushi Aono (NTT), Takanobu Oba, Yoshinori Isoda (NTT DOCOMO) SP2017-19 |
This paper proposes a speaker diarization method for face-to-face dialogue of service counters using appearance pattern ... [more] |
SP2017-19 pp.21-26 |
SP, SIP, EA |
2017-03-02 09:00 |
Okinawa |
Okinawa Industry Support Center |
[Poster Presentation]
Hardware Speech Sensor Based on Deep Neural Network Feature Extractor and Template Matching Yi Liu, Boyu Qian, Jian Wang, Takahiro Shinozaki (Titech) EA2016-135 SIP2016-190 SP2016-130 |
We explore the possibility of combination of a DNN-based feature extractor and template based matching for keyword detec... [more] |
EA2016-135 SIP2016-190 SP2016-130 pp.297-300 |
SP, IPSJ-SLP, NLC, IPSJ-NL (Joint) [detail] |
2016-12-20 11:20 |
Tokyo |
NTT Musashino R&D |
Speaker Recognition Based on Features through 1-Dimensional Convolutional Neural Network Shohei Sonoda, Yufu Kasahara, Masato Inoue (Waseda Univ) SP2016-52 |
Most of the speaker recognition methods utilize the voice features of the mel-frequency cepstrum coefficients (MFCCs) an... [more] |
SP2016-52 pp.17-21 |
SP, IPSJ-SLP, NLC, IPSJ-NL (Joint) [detail] |
2016-12-20 11:45 |
Tokyo |
NTT Musashino R&D |
Study on i-vector based speaker verification using rank for short utterances Misaki Tsujikawa (Panasonic/Sokendai), Tsuyoki Nishikawa (Panasonic), Tomoko Matsui (ISM) SP2016-53 |
Generally, short utterance test data seriously degrades the accuracy of speaker verification. However, in many voice-ope... [more] |
SP2016-53 pp.23-26 |
SP, IPSJ-SLP, NLC, IPSJ-NL (Joint) [detail] |
2016-12-20 15:10 |
Tokyo |
NTT Musashino R&D |
[Poster Presentation]
Deep Neural Network Using Fundamental Frequency For Noise Robust Speaker Recognition Yoshihiro Suzuki, Yosuke Sugiura, Tetsuya Shimamura (Saitama Univ.) SP2016-58 |
In this paper, we propose a neural network architecture for speaker recognition to simplify learning process. In the pro... [more] |
SP2016-58 pp.53-56 |
NLC, IPSJ-NL, SP, IPSJ-SLP (Joint) [detail] |
2015-12-02 10:25 |
Aichi |
Nagoya Inst of Tech. |
Simultaneous Modelling of Acoustic, Phonetic, Speaker Features Using Improved Three-Way Restricted Boltzmann Machine Toru Nakashika (UEC), Tetsuya Takiguchi (Kobe Univ.) SP2015-71 |
In this paper, we argue the way of modelling speech signals using improved three-way restricted Boltzmann machine (3WRBM... [more] |
SP2015-71 pp.7-12 |
WIT, SP, ASJ-H, PRMU |
2015-06-18 14:50 |
Niigata |
|
Study on i-vector based speaker identification for short utterances Misaki Tsujikawa (Panasonic Corporation/SOKENDAI), Tsuyoki Nishikawa (Panasonic Corporation), Tomoko Matsui (ISM) PRMU2015-43 SP2015-12 WIT2015-12 |
Recently, voice controlled system is growing popular due to the development of speech recognition technology. For exampl... [more] |
PRMU2015-43 SP2015-12 WIT2015-12 pp.65-70 |
NLC, IPSJ-NL, SP, IPSJ-SLP, JSAI-SLUD (Joint) [detail] |
2014-12-16 13:30 |
Kanagawa |
Tokyo Institute of Technology (Suzukakedai Campus) |
A Study on Speaker Recognition Method in Consideration of Speaking Style Differences in Lecture Speech Kota Nakatsuji (Doshisha Univ.), Masafumi Nishida (Nagoya Univ.), Seiichi Yamamoto (Doshisha Univ.) SP2014-123 |
Speaker recognition technology has been applied to achieve a variety of tasks such as minute taking and speaker search f... [more] |
SP2014-123 pp.141-146 |
BioX, ITE-ME, ITE-IST [detail] |
2014-06-16 15:45 |
Ishikawa |
Kanazawa University, Kakuma Campus |
Speaker verification based on enrolled individual speeches Hiroaki Natsumi, Manabu Kawasaki, Kouki Tanaka (SOHGO SECURITY SERVICES) BioX2014-6 |
As a part of applied researches of biometric authentication technique, we investigate speaker recognition, which identif... [more] |
BioX2014-6 pp.31-35 |
SP, IPSJ-MUS |
2014-05-24 11:30 |
Tokyo |
|
Robustness of Speaker Identification Using Pseudo Pitch Synchronized Phase Information Yuta Kawakami, Longbiao Wang (Nagaoka Univ. of Tech.), Atsuhiko Kai (Shizuoka Univ.), Seiichi Nakagawa (Toyohashi Univ. of Tech.) SP2014-11 |
The phase information is useful for the speaker recognition task, but MFCC ignores that. In this work, we conducted spea... [more] |
SP2014-11 pp.123-126 |
SP |
2014-01-23 16:00 |
Aichi |
Meijo Univ. |
Speaker recognition based on log-linear models using feature generation by variational Bayesian method Akifumi Tsuge, Kei Hashimoto, Yoshihiko Nankaku, Keiichi Tokuda (Nagoya Inst. of Tech.) SP2013-98 |
This paper presents a speaker recognition technique based on log-linear models (LLMs) using Bayesian statistics. Since d... [more] |
SP2013-98 pp.13-18 |
SP |
2013-03-01 16:00 |
Aichi |
Daido University |
Current situations and issues of speaker recognition technologies Kanae Amino (NRIPS), Shunichi Ishihara (The Australian National University), Tetsuji Ogawa (Waseda Univ.), Takashi Osanai (NRIPS), Shingo Kuroiwa (Chiba Univ.), Takafumi Koshinaka (NEC), Koichi Shinoda (Tokyo Inst. of Tech.), Satoru Tsuge (Daido Univ.), Masafumi Nishida (Doshisha Univ.), Tomoko Matsui (ISM), Longbiao Wang (Nagaoka University of Technology) SP2012-131 |
Speaker recognition for recognizing who is speaking from his/her voice has been studied for 30 years. As the importance ... [more] |
SP2012-131 pp.63-70 |
SP |
2013-01-30 15:45 |
Kyoto |
Doshisha Univ. |
A Study on Speaker Recognition Based on Decomposition of Periodic and Aperiodic Components Yuki Ishikawa, Masafumi Nishida (Doshisha Univ.), Masakiyo Fujimoto (NTT), Seiichi Yamamoto (Doshisha Univ.) SP2012-102 |
In conventional researches, mel-frequency cepstral coefficients (MFCC) are widely used for a feature parameter which app... [more] |
SP2012-102 pp.25-30 |
WIT, SP |
2011-10-06 13:30 |
Tokyo |
TFT Bldg. |
A Study of Speaker Recognition using Phase Spectrum Atsushi Ueda (Kanagawa Prefectural Police), Kiyoshi Mizui (Kanto Gakuin Univ.) SP2011-55 WIT2011-37 |
In this study, we propose speaker recognition system using phase spectrum for noisy voice with lack of high-frequency ba... [more] |
SP2011-55 WIT2011-37 pp.19-24 |
SP |
2011-07-21 15:00 |
Hokkaido |
Jozankei Grand Hotel |
Construction of Speaker Model Using A New GMM Learning Method Based on Clustering Masaki Mifune, Motoyuki Suzuki, Fuji Ren, Kenji Kita (Univ. of Tokushima) SP2011-42 |
In the speaker identification research fields,
Gaussian Mixture Models (GMM) are widely used as speaker models because ... [more] |
SP2011-42 pp.7-10 |
NC, MBE (Joint) |
2011-03-09 09:50 |
Tokyo |
Tamagawa University |
Construction of voiceprint identification systems using multi-step neural networks Shunsuke Onishi, Hiroshi Hasegawa, Kentaro Kinoshita, Satoru Kishida (Tottori Univ.) NC2010-186 |
We constructed a voiceprint identification system using three-layered neural networks with a back-propagation learning a... [more] |
NC2010-186 pp.349-353 |
EA |
2008-10-23 16:00 |
Toyama |
|
Vowel Speaker Recognition Using Projections to Speaker Subspaces Constructed with Excitation Pattern of Auditory Peripherals Futoshi Endo, Mamoru Iwaki (Niigata Univ.) EA2008-73 |
Excitation pattern (EP) of auditory peripheral model can estimate spectral information conducted into auditory periphera... [more] |
EA2008-73 pp.49-54 |