Committee |
Date Time |
Place |
Paper Title / Authors |
Abstract |
Paper # |
EA, SP, SIP |
2016-03-28 15:00 |
Oita |
Beppu International Convention Center B-ConPlaza |
[Special Invited Talk]
Progress in LPC-based Audio Coders
-- Reduction of quantization distortion by efficient representation of LPC envelope -- Takehiro Moriya, Ryosuke Sugiura, Yutaka Kamamoto, Hirokazu Kameoka, Noboru Harada (NTT) EA2015-99 SIP2015-148 SP2015-127 |
While Linear Predictive Coding (LPC) has been widely used for time-domain speech coding as an essential technology, it h... [more] |
EA2015-99 SIP2015-148 SP2015-127 p.189 |
EA, SP, SIP |
2016-03-29 09:00 |
Oita |
Beppu International Convention Center B-ConPlaza |
[Poster Presentation]
Majorisation-minimization based composite autoregressive system optimization with a glottal source model prior Lauri Juvela (Aalto Univ.), Hirokazu Kameoka (Tokyo Univ.), Junichi Yamagishi (NII) EA2015-115 SIP2015-164 SP2015-143 |
The Composite Autoregressive System solves the speech source-filter decomposition problem in a robust manner and can be ... [more] |
EA2015-115 SIP2015-164 SP2015-143 pp.273-278 |
EA, SP, SIP |
2016-03-29 14:40 |
Oita |
Beppu International Convention Center B-ConPlaza |
Product-of-Experts approach to integration of F0 generative process model to statistical F0 prediction for electrolaryngeal speech enhancement Kou Tanaka (NAIST), Hirokazu Kameoka (NTT), Tomoki Toda (The University of Nagoya/NAIST), Satoshi Nakamura (NAIST) EA2015-133 SIP2015-182 SP2015-161 |
We have previously proposed a statistical fundamental frequency (F0) prediction method that makes it possible to predict... [more] |
EA2015-133 SIP2015-182 SP2015-161 pp.373-377 |
SIP, EA, SP |
2015-03-02 09:50 |
Okinawa |
|
Unified approach for BSS, DOA estimation, audio event detection and dereverberation with multichannel factorial HMM and DOA mixture model Takuya Higuchi (Univ. of Tokyo), Hirokazu Kameoka (Univ. of Tokyo/ NTT) EA2014-74 SIP2014-115 SP2014-137 |
We deal with the problems of blind source separation, dereverberation, audio event detection and DOA estimation. We prev... [more] |
EA2014-74 SIP2014-115 SP2014-137 pp.13-18 |
IBISML |
2014-11-17 17:00 |
Aichi |
Nagoya Univ. |
[Poster Presentation]
Training Algorithm for Restricted Boltzmann Machines Using Auxiliary Function Approach Norihiro Takamune (Univ. of Tokyo), Hirokazu Kameoka (Univ. of Tokyo/NTT) IBISML2014-56 |
Layerwise pre-training is one of important elements for deep learning, and Restricted Boltzmann Machines (RBMs) is popul... [more] |
IBISML2014-56 pp.161-168 |
IBISML |
2014-11-17 17:00 |
Aichi |
Nagoya Univ. |
[Poster Presentation]
Unified approach for auditory scene analysis based on multichannel factorial hidden Markov model Takuya Higuchi (Univ. of Tokyo), Hirokazu Kameoka (Univ. of Tokyo/NTT) IBISML2014-57 |
This paper deals with the problems of audio source separation, audio event detection, dereverberation and DOA estimation... [more] |
IBISML2014-57 pp.169-176 |
SP, IPSJ-MUS |
2014-05-24 08:50 |
Tokyo |
|
"Ongaku" Symposium 2014: The 2nd Symposium on Any Topics Related to Acoustics, Audition and Natural Language Hirokazu Kameoka (Univ. of Tokyo/NTT), Eriko Aiba (UEC), Yasunori Ohishi (NTT), Tetsuro Kitahara (Nihon Univ.), Tatsuya Kitamura (Konan Univ.), Shoei Sato (NHK), Masahito Togami (Hitachi), Tomoki Toda (NAIST), Kazuyoshi Yoshii (Kyoto Univ.) SP2014-1 |
[more] |
SP2014-1 pp.1-3 |
SP, IPSJ-MUS |
2014-05-24 11:30 |
Tokyo |
|
Underdetermined Blind Separation of Moving Sources Based on Probabilistic Modeling Takuya Higuchi, Norihiro Takamune, Tomohiko Nakamura (Univ. of Tokyo), Hirokazu Kameoka (Univ. of Tokyo/NTT) SP2014-20 |
This paper deals with the problem of the underdetermined blind separation and tracking of moving sources. In practical s... [more] |
SP2014-20 pp.211-216 |
SP, IPSJ-MUS |
2014-05-25 11:30 |
Tokyo |
|
Text-to-speech prosody synthesis based on probabilistic model for F0 contour Kento Kadowaki, Tatsuma Ishihara, Nobukatsu Hojo (Univ. of Tokyo), Hirokazu Kameoka (Univ. of Tokyo/NTT) SP2014-28 |
This paper deals with the problem of generating the fundamental frequency (F0) contour of speech from a text input for t... [more] |
SP2014-28 pp.309-314 |
SP |
2014-02-28 13:50 |
Tokushima |
The University of Tokushima |
[Invited Talk]
Non-negative matrix factorization and its applications to time series processing Hirokazu Kameoka (Univ. Tokyo/NTT) SP2013-116 |
In this paper, I will give a brief introduction to a data analysis technique called non-negative matrix factorization (N... [more] |
SP2013-116 pp.31-36 |
SP |
2013-11-21 15:00 |
Nara |
Nara Institute of Science and Technology |
Text-to-speech synthesis based on composite wavelet trajectory model Nobukatsu Hojo, Hirokazu Kameoka, Shigeki Sagayama (Tokyo Univ.) SP2013-73 |
[more] |
SP2013-73 pp.13-18 |
EA |
2012-12-14 14:20 |
Tokyo |
National Institute of Informatics |
[Invited Talk]
Non-negative matrix factorization and its applications to audio signal processing Hirokazu Kameoka (University of Tokyo/NTT) EA2012-118 |
In this paper, I will give a brief introduction to a data analysis technique called non-negative matrix factorization (N... [more] |
EA2012-118 pp.53-58 |
PRMU, NLC |
2012-06-29 16:30 |
Tokyo |
|
A novel extension of hidden Markov models Masahiro Nakano, Yasunori Ohishi, Hirokazu Kameoka, Ryo Mukai, Kunio Kashino (NTT) NLC2012-8 PRMU2012-28 |
This paper discusses a novel extension of hidden Markov models.
We present a Bayesian nonparametric fusion of HMMs and... [more] |
NLC2012-8 PRMU2012-28 pp.31-36 |
PRMU, NLC |
2012-06-29 17:00 |
Tokyo |
|
Bayesian Nonparametric Approach to Audio Event Detection Yasunori Ohishi (NTT), Daichi Mochihashi, Tomoko Matsui (ISM), Masahiro Nakano, Hirokazu Kameoka, Tomonori Izumitani, Kunio Kashino (NTT) NLC2012-9 PRMU2012-29 |
As the amount of available multimedia data increases, the technique to automatically extract the significant information... [more] |
NLC2012-9 PRMU2012-29 pp.37-42 |
PRMU, SP |
2012-02-10 13:40 |
Miyagi |
|
[Invited Talk]
Music Information Processing based on Mathematical Modeling Shigeki Sagayama (The Univ. of Tokyo), Nobutaka Ono (NII), Hirokazu Kameoka (The Univ. of Tokyo/NTT) PRMU2011-229 SP2011-144 |
[more] |
PRMU2011-229 SP2011-144 p.193 |
PRMU, FM |
2010-12-09 09:30 |
Yamaguchi |
|
Automatic Audio Tagging and Retrieval Using Semi-Surpervised Canonical Density Estimation Jun Takagi (Tokyo Tech.), Yasunori Ohishi, Akisato Kimura (NTT), Masashi Sugiyama, Makoto Yamada (Tokyo Tech.), Hirokazu Kameoka (NTT) PRMU2010-126 |
We apply SSCDE (semi-supervised canonical density estimation), asemi-supervised learning method based on topic modeling,... [more] |
PRMU2010-126 pp.1-6 |
SP |
2010-11-18 15:15 |
Aichi |
Aichi Prefectural Univ. |
Statistical speech spectrum model incorporating all-pole vocal tract model and F0 contour generating process model Hirokazu Kameoka (NTT CS Lab.) SP2010-74 |
In this paper, we propose to introduce a statistical speech spectrum model,
simultaneously incorporating the all-pole v... [more] |
SP2010-74 pp.29-34 |
SP |
2010-11-19 10:15 |
Aichi |
Aichi Prefectural Univ. |
Statistical Modeling and Analysis of Singing Voice F0 Dynamics based on Second-order Linear System Yasunori Ohishi, Hirokazu Kameoka, Daichi Mochihashi, Hidehisa Nagano, Kunio Kashino (NTT) SP2010-80 |
We present a new statistical model for dynamics of various singing behaviors, such as vibrato and overshoot, in a fundam... [more] |
SP2010-80 pp.65-70 |
IBISML |
2010-11-04 15:00 |
Tokyo |
IIS, Univ. of Tokyo |
[Poster Presentation]
Hierarchical topic trajectory model for video annotation retrieval considering cross-modal co-occurrences Takuho Nakano (Univ. Tokyo), Akisato Kimura, Hirokazu Kameoka (NTT), Shigeki Miyabe, Shigeki Sagayama, Nobutaka Ono (Univ. Tokyo), Kunio Kashino (NTT), Takuya Nishimoto (Univ. Tokyo) IBISML2010-73 |
This paper deals with a problem of ``video annotation retrieval'' that achieves automatic video annotaion (providing rel... [more] |
IBISML2010-73 pp.105-112 |
EA |
2006-12-15 15:00 |
Kyoto |
NTT Communication Science Laboratories (Keihanna) |
The application of EM algorythm to 2ch BSS based on speech sparseness Yosuke Izumi, Hirokazu Kameoka, Nobutaka Ono, Shigeki Sagayama (Tokyo Univ.) |
[more] |
EA2006-96 pp.43-48 |