Committee |
Date Time |
Place |
Paper Title / Authors |
Abstract |
Paper # |
SP, IPSJ-SLP, IPSJ-MUS |
2021-06-19 13:00 |
Online |
Online |
Study on the background cancellation system for speech privacy Jiangning Huang, Akinori Ito (Tohoku Univ.) SP2021-14 |
Evacuation centers at the time of disaster do not have sufficient sound insulation to maintain sound privacy. In this st... [more] |
SP2021-14 pp.57-62 |
WIT |
2020-06-12 13:30 |
Online |
Online |
Improving the pronounce clarity of dysarthric speech using CycleGAN Shuhei Imai, Takashi Nose, Aoi Kanagaki (Tohoku Univ.), Satoshi Watanabe (HTS), Akinori Ito (Tohoku Univ.) WIT2020-1 |
Several voice conversion systems have been developed that converts the dysarthric speech into healthy speech.The convent... [more] |
WIT2020-1 pp.1-6 |
NLC, IPSJ-NL, SP, IPSJ-SLP (Joint) [detail] |
2019-12-06 13:55 |
Tokyo |
NHK Science & Technology Research Labs. |
[Poster Presentation]
Analysis and Subjective Labeling for Emotional Speech Database JTES Mai Yamanaka, Takashi Nose, Yuya Chiba, Akinori Ito (Tohoku Univ.) SP2019-39 |
We have constructed JTES, a prosodic balanced emotional speech database containing 50 sentences of 4 emotions of 50 men ... [more] |
SP2019-39 pp.61-66 |
SP |
2019-01-26 16:25 |
Ishikawa |
Kanazawa-Harmonie |
[Fellow Memorial Lecture]
Machine, human and sound communication Akinori Ito (Tohoku Univ.) SP2018-55 |
Speech is the most important modality for human-human communication. From invention of electrical speech communication, ... [more] |
SP2018-55 p.19 |
EA |
2018-10-11 14:30 |
Fukushima |
Iwaki business Innovation Center (Iwaki) |
A study on ship type identification by use of deep neural network Ryouichi Nishimura, Katsuhiro Temma (NICT), Kiyohiko Hattori (Saitama Inst. of Tech.), Kenji Kaneko (TEAMS), Akinori Ito (Tohoku Univ.), Toyonobu Fujii (TEAMS), Akihiro Kijima (Tohoku Univ.) EA2018-54 |
Poaching has recently become a serious problem due to the globalization of food culture and the accompanied rising price... [more] |
EA2018-54 pp.1-6 |
SP, IPSJ-SLP (Joint) |
2018-07-27 10:00 |
Shizuoka |
Sago-Royal-Hotel (Hamamatsu) |
Spoken Term Detection Using Speech Comparator by Machine Learning for Zero-Resource Language Akinori Ito, Masatoshi Koizumi (Tohoku Univ.) SP2018-21 |
In this paper, we propose a spoken term detection method for detection of terms in zero-resource languages. The proposed... [more] |
SP2018-21 pp.25-30 |
SP |
2017-01-21 11:00 |
Tokyo |
The University of Tokyo |
[Poster Presentation]
A Study on Singer-Independent Singing Voice Conversion Using Read Speech Based on Neural Network Harunori Koike, Takashi Nose, Akinori Ito (Tohoku Univ.) SP2016-67 |
There is a problem that the conventional method requires the speech of the source speaker for training. We proposed a me... [more] |
SP2016-67 pp.17-22 |
SP, IPSJ-SLP, NLC, IPSJ-NL (Joint) [detail] |
2016-12-20 15:10 |
Tokyo |
NTT Musashino R&D |
[Poster Presentation]
Improvement of accent sandhi rules based on accent dictionary for Japanese text-to-speech systems Hiroto Aoyama, Takashi Nose, Akinori Ito (Tohoku Univ.) SP2016-54 |
In order to synthesize more natural speech in Japanese text-to-speech systems, we improved accent sandhi rules. Conventi... [more] |
SP2016-54 pp.31-36 |
SP, IPSJ-SLP, NLC, IPSJ-NL (Joint) [detail] |
2016-12-20 15:10 |
Tokyo |
NTT Musashino R&D |
[Poster Presentation]
F0 control by modeling differential features in DNN-based speech synthesis Shuhei Yamada, Takashi Nose, Akinori Ito (Tohoku Univ.) SP2016-55 |
We have been developing ``tailor-made speech synthesis,'' a framework which enables users to modify synthetic speech nat... [more] |
SP2016-55 pp.37-42 |
SP, IPSJ-SLP, NLC, IPSJ-NL (Joint) [detail] |
2016-12-20 15:10 |
Tokyo |
NTT Musashino R&D |
[Poster Presentation]
Development of the Julius-compatible interface for the speech recognition engine of Kaldi toolkit Yusuke Yamada, Takashi Nose, Yuya Chiba, Akinori Ito (Tohoku Univ.) SP2016-57 |
[more] |
SP2016-57 pp.49-51 |
ITE-ME, IE, EMM, LOIS, IEE-CMN [detail] |
2016-09-16 15:30 |
Aichi |
Aichi Prefectural University |
A Study on Colorization in Photo-Realistic Facial Animation Synthesis from Text Based on HMM and DNN with Animation Unit Kazuki Sato, Takashi Nose, Akinori Ito (Tohoku Univ.) LOIS2016-27 IE2016-64 EMM2016-53 |
We propose a technique for synthesizing photo-realistic facial animation from a text based on hidden Markov model (HMM) ... [more] |
LOIS2016-27 IE2016-64 EMM2016-53 pp.67-72 |
IT, EMM |
2016-05-19 15:10 |
Hokkaido |
Otaru Economic Center |
Study of Photo-realistic Face Moving Image Generation from the Text Using the Facial Feature Kazuki Sato, Takashi Nose, Akinori Ito (Tohoku Univ.) IT2016-8 EMM2016-8 |
In this paper, we propose face moving image synthesis technique based on Hidden Markov model (HMM) using the facial feat... [more] |
IT2016-8 EMM2016-8 pp.43-48 |
EA, EMM |
2015-11-12 15:15 |
Kumamoto |
Kumamoto Univ. |
Facial image conversion based on transformation of Animation Units using DNN Yuuki Saito, Takashi Nose (Tohoku Univ.), Takahiro Shinozaki (Tokyo Institute of Technology), Akinori Ito (Tohoku Univ.) EA2015-28 EMM2015-49 |
[more] |
EA2015-28 EMM2015-49 pp.23-28 |
SP |
2015-10-15 13:50 |
Hyogo |
Kobe Univ. |
A Study on Speaker-Independent Voice Conversion Using Spectral Differential Filter Based on Neural Network Harunori Koike, Takashi Nose (Tohoku Univ.), Takahiro Shinozaki (Tokyo Tech), Akinori Ito (Tohoku Univ.) SP2015-61 |
In this paper, we propose a novel technique for making the speech individuality of an arbitrary source (input) speaker. ... [more] |
SP2015-61 pp.13-18 |
SP |
2015-10-15 16:45 |
Hyogo |
Kobe Univ. |
A study on quick model training in HMM-based speech synthesis Shuhei Yamada, Takashi Nose, Akinori Ito (Tohoku Univ.) SP2015-64 |
In this paper, we propose an alternative model training technique using speaker-independent monophone models and decisio... [more] |
SP2015-64 pp.27-32 |
SP |
2015-10-15 17:10 |
Hyogo |
Kobe Univ. |
Design and evaluation of prosodically balanced emotion-dependent sentence set based on entropy Emika Takeishi, Takashi Nose, Taketo Kase, Akinori Ito (Tohoku Univ.) SP2015-65 |
We designed an emotional speech database that can be used for emition recognition as well as recognition and synthsis of... [more] |
SP2015-65 pp.33-38 |
SP |
2015-08-21 10:00 |
Iwate |
Iwate Prefectural Univ. |
Latent Words Recurrent Neural Network Language Models for Automatic Speech Recognition Ryo Masumura, Taichi Asami, Takanobu Oba, Hirokazu Masataki, Sumitaka Sakauchi (NTT), Akinori Ito (Tohoku Universicty) SP2015-50 |
This paper proposes a novel language modeling approach called latent word recurrent neural network language model, which... [more] |
SP2015-50 pp.1-6 |
SP |
2015-08-21 10:25 |
Iwate |
Iwate Prefectural Univ. |
Automatic generation of abbreviated named entities for localized speech recognition Kenta Shiga, Takashi Nose, Akinori Ito (Tohoku Univ.) SP2015-51 |
[more] |
SP2015-51 pp.7-12 |
EMM, IT |
2015-05-21 15:20 |
Kyoto |
Kyoto International Community House |
A study on speaker conversion using speech and expression features for video chatting Yuuki Saito, Takashi Nose (Tohoku Univ.), Takahiro Shinozaki (Tokyo Institute of Technology), Akinori Ito (Tohoku Univ.) IT2015-9 EMM2015-9 |
In this paper, we suggest two method that the individuality of the face of original speaker convert that of target speak... [more] |
IT2015-9 EMM2015-9 pp.45-50 |
EMM, EA |
2014-11-20 16:30 |
Fukuoka |
|
Bit-error-tolerant quantizer based on self organizing map Akinori Ito (Tohoku Univ.) EA2014-31 EMM2014-59 |
Bit errors cannot be avoided when communicating using a digital channel. Packet-based communication abodons the packets ... [more] |
EA2014-31 EMM2014-59 pp.19-24 |