Committee |
Date Time |
Place |
Paper Title / Authors |
Abstract |
Paper # |
SIP, EA, SP, MI (Joint) [detail] |
2018-03-20 13:55 |
Okinawa |
|
DNN prefiltering for enhancement of voice recognition in noise environment Jun Takahashi, Kentaro Murase (Fujitsu Labs.) EA2017-170 SIP2017-179 SP2017-153 |
In this paper, we applied convolutional denoising autoencoder (CDAE) as the prefilter of voice recognition and evaluated... [more] |
EA2017-170 SIP2017-179 SP2017-153 pp.373-378 |
MSS, NLP (Joint) |
2018-03-14 10:05 |
Osaka |
|
Improvement of Spaito-Temporal Situation Recognition Using Staff's Behavior Logs Shun Hayakashi, Kunihiko Hiraishi, Naoshi Uchihira (JAIST) MSS2017-90 |
Recently, assist technologies for work staff using ICT devices is expected to be introduced into various real fields. In... [more] |
MSS2017-90 pp.67-72 |
IN |
2018-01-23 14:00 |
Aichi |
WINC AICHI |
Pitch-based Cluster Segmentalization for Voice Actor Recognition Using Gender Determination in Anime Video Motoki Eida, Shun Hattori (Muroran Inst. of Tech.) IN2017-86 |
When we hear someone's voice from an anime video, we need to carry extra burdens of searching the end roll of the anime ... [more] |
IN2017-86 pp.85-90 |
LOIS, ICM |
2018-01-19 10:00 |
Kumamoto |
Sojo University |
A study on verification result determination method using speech verification score and neural network in speech input quiz system Kyouhei Fukuda, Hiroyuki Nishi, Yoshimasa Kimura (Sojo Univ) ICM2017-43 LOIS2017-59 |
In the quiz speech recognition, confirming the recognition result to the solver is likely to present an answer to the qu... [more] |
ICM2017-43 LOIS2017-59 pp.53-56 |
HCGSYMPO (2nd) |
2017-12-13 - 2017-12-15 |
Ishikawa |
THE KANAZAWA THEATRE |
Proposal of Real-time Reading Support System Koichi Koyasu, Anna Yokokubo, Guillaume Lopez (Aoyama Gakuin Univ.) |
Reading is established as a Japanese culture, and it is also an important educational activity in kindergartens, nursery... [more] |
|
AI |
2017-11-24 15:55 |
Fukuoka |
|
Context Aware Gender Determination for Voice Actor Recognition in Anime Video Motoki Eida, Shun Hattori (Muroran Inst. of Tech.) AI2017-16 |
When we hear someone's voice from an anime video, we need to carry extra burdens of searching the end roll of the anime ... [more] |
AI2017-16 pp.55-60 |
SP, IPSJ-SLP (Joint) |
2017-07-27 14:30 |
Miyagi |
Akiu Resort Hotel Crescent |
[Invited Talk]
Synthesis, Recognition and Conversion of Various Speech Using Deep Learning and Their Applications Takashi Nose (Tohoku Univ.) SP2017-16 |
This paper focuses on synthesis, recognition and conversion of various speech in the speech processing using deep learni... [more] |
SP2017-16 pp.3-8 |
SP, IPSJ-SLP (Joint) |
2017-07-27 16:15 |
Miyagi |
Akiu Resort Hotel Crescent |
Voice Conversion Using Sequence-to-Sequence Learning of Context Posterior Probabilities and Evaluation of Dual Learning Hiroyuki Miyoshi, Yuki Saito, Shinnosuke Takamichi, Hiroshi Saruwatari (Univ. of Tokyo) SP2017-17 |
Voice conversion (VC) using sequence-to-sequence learning of context posterior probabilities is proposed. Conventional V... [more] |
SP2017-17 pp.9-14 |
SP |
2017-01-21 14:00 |
Tokyo |
The University of Tokyo |
[Invited Talk]
Deep learning in voice conversion Daisuke Saito (UTokyo) SP2016-72 |
In this paper, deep learning techniques in voice conversion studies are overviewed. Recently, deep learning techniques w... [more] |
SP2016-72 pp.47-52 |
ET |
2016-12-10 13:20 |
Osaka |
Kindai University |
Utterance Detection using Facial Image Combined with Voice Detection
-- Partial System for Reading Activity Understanding in Japanese Text Presentation System -- Shuichi Tashiro, Shu Aoki, Kyota Aoki, Koji Harada (Utsunomiya Univ.) ET2016-70 |
The authors implemented the system which detects utterance sections using mouth motion and reading aloud voice. This sys... [more] |
ET2016-70 pp.21-26 |
IN, MoNA, CNR (Joint) |
2016-11-18 12:45 |
Kagoshima |
Kirishima-kanko Hotel |
Voice Actor Recognition Using Frequency Spectrum in Anime Video Motoki Eida, Shun Hattori (Muroran Inst. of Tech.) IN2016-64 |
When we hear someone's voice from an anime video, we need to carry extra burdens of searching the end roll of the anime ... [more] |
IN2016-64 pp.25-30 |
EA, SP, SIP |
2016-03-28 13:15 |
Oita |
Beppu International Convention Center B-ConPlaza |
[Poster Presentation]
Voice Recognition using Signal Clustering Neural Network with Wavelet Transform Feature Extraction Bandhit Suksiri, Masahiro Fukumoto (KUT) EA2015-94 SIP2015-143 SP2015-122 |
This paper presents a new voice recognition method named as Signal Clustering Neural Network by simple ANN model with a ... [more] |
EA2015-94 SIP2015-143 SP2015-122 pp.165-170 |
IN |
2016-01-21 14:00 |
Aichi |
Nagoya Kigyou Fukushi Kaikan |
Automatic Baseball Video Tagging Using Ball-by-Ball Textual Report and Voice Recognition Komei Arasawa, Shun Hattori (Muroran Inst. of Tech.) IN2015-95 |
To enable us to select the only scenes that we want to watch in a baseball video and personalize its highlights sub-vide... [more] |
IN2015-95 pp.1-6 |
IN |
2016-01-21 14:25 |
Aichi |
Nagoya Kigyou Fukushi Kaikan |
Voice Actor Recognition Using Voice and Cast Information of Anime Video Motoki Eida, Shun Hattori (Muroran Inst. of Tech.) IN2015-96 |
When we hear a voice from amusement media such as animes, games, movies, and music, we sometimes feel like that we have ... [more] |
IN2015-96 pp.7-12 |
NLC, IPSJ-NL, SP, IPSJ-SLP (Joint) [detail] |
2015-12-02 10:25 |
Aichi |
Nagoya Inst of Tech. |
Simultaneous Modelling of Acoustic, Phonetic, Speaker Features Using Improved Three-Way Restricted Boltzmann Machine Toru Nakashika (UEC), Tetsuya Takiguchi (Kobe Univ.) SP2015-71 |
In this paper, we argue the way of modelling speech signals using improved three-way restricted Boltzmann machine (3WRBM... [more] |
SP2015-71 pp.7-12 |
SP |
2015-10-16 11:15 |
Hyogo |
Kobe Univ. |
Multi-modal speech recognition using deep bottleneck features Satoshi Tamura (Gifu Univ), Hiroshi Ninomiya (Nagoya Univ), Norihide Kitaoka (Tokushima Univ), Shin Osuga (Aisin Seiki), Yurie Iribe (Aichi Prefectural Univ), Kazuya Takeda (Nagoya Univ), Satoru Hayamizu (Gifu Univ) SP2015-69 |
In this paper, we propose a novel multi-modal speech recognition method which uses speech and lip images, employing Deep... [more] |
SP2015-69 pp.57-62 |
LOIS |
2015-03-06 16:30 |
Okinawa |
|
A Study of Multi-Modal Speech Visualization for Deaf and Hard of Hearing People Support Yusuke Toba, Hiroyasu Horiuchi, Shinsuke Matsumoto, Sachio Saiki, Masahide Nakamura (Kobe Univ.), Tomohito Uchino, Tomohiro Yokoyama, Yasuhiro Takebayashi (School for the Deaf, University of Tsukuba) LOIS2014-94 |
Although deaf and hard of hearing (D/HH) people have various communication ways such as sign language, conversation by w... [more] |
LOIS2014-94 pp.191-196 |
NLC, IPSJ-NL, SP, IPSJ-SLP, JSAI-SLUD (Joint) [detail] |
2014-12-15 10:45 |
Kanagawa |
Tokyo Institute of Technology (Suzukakedai Campus) |
Investigation of Deep Neural Network and Cross-adaptation for Voice Activity Detection in Meeting Speech Akihiro Nakadani (Shizuoka Univ.), Longbiao Wang (Nagaoka Univ. of Tech.), Atsuhiko Kai (Shizuoka Univ.) SP2014-107 |
In voice activity detection(VAD), performance largely decreases under the influence of noise and reverberation. In this ... [more] |
SP2014-107 pp.19-24 |
MoNA, IPSJ-DPS, IPSJ-MBL |
2014-05-15 11:55 |
Okinawa |
|
Cool Implementation of Voice Recognition System for Web Application Yuichi Maki, Noriyoshi Kamado, Shigeru Fujimura, Yushi Aono, Jyouji Nakayama, Sumitaka Sakauchi, Tomohiro Yamada (NTT) MoNA2014-6 |
We propose a browser-based speech recognition system using HTML5 in a broad sense and report its performance in actual u... [more] |
MoNA2014-6 pp.31-36 |
NC, MBE (Joint) |
2014-03-17 11:00 |
Tokyo |
Tamagawa University |
Correlation between voice signals and alcohol concentrations in blood Masayuki Kawanoi, Naoki Fujiwara, Satoru Kishida (Tottori Univ.) NC2013-117 |
We investigated the correlation between voice signals and alcohol concentrations in blood with neural network systems fo... [more] |
NC2013-117 pp.167-170 |