Committee |
Date Time |
Place |
Paper Title / Authors |
Abstract |
Paper # |
SP |
2019-01-27 11:05 |
Ishikawa |
Kanazawa-Harmonie |
A Speaker Recognition Performance Measure based on the Adaptation Quickness and Final Accuracy for Spoken Dialog Systems Junko Takami, Takeshi Kawabata (KGU) SP2018-59 |
For constructing user friendly spoken dialog system, it is important to recognize "Who is the user?" and to choose appro... [more] |
SP2018-59 pp.35-40 |
EA, SP, SIP |
2016-03-29 10:45 |
Oita |
Beppu International Convention Center B-ConPlaza |
Tensor-based Speech Representation and its Application to Identification of Languages and Speakers So Suzuki, Daisuke Saito, Nobuaki Minematsu (UTokyo) EA2015-127 SIP2015-176 SP2015-155 |
This paper proposes a novel approach to speech representation for automatic identification of languages and speakers by ... [more] |
EA2015-127 SIP2015-176 SP2015-155 pp.341-346 |
PRMU |
2014-03-13 10:00 |
Tokyo |
|
Velocity Pyramid for Event Detection Zhuolin Liang, Nakamasa Inoue, Koichi Shinoda (Tokyo Inst. of Tech.) PRMU2013-170 |
In this paper, we propose a new motion feature, a velocity pyramid, for multimedia event detection. In an event which is... [more] |
PRMU2013-170 pp.13-18 |
SP |
2013-02-28 15:00 |
Aichi |
Daido University |
[Poster Presentation]
Voice femininity estimation for MtF patients using supervectors and SVR Chengshuo Wang, Masayuki Suzuki, Nobuaki Minematsu (Univ. of Tokyo), Kyoko Sakuraba (Dokkyo Medical Univ. Hospital), Keikichi Hirose (Univ. of Tokyo) SP2012-120 |
Femininity estimation of MtF (Male to Female) voices is technically implemented.Speaker characteristics are extracted as... [more] |
SP2012-120 pp.23-24 |
PRMU |
2013-02-22 15:00 |
Osaka |
|
Multimedia event detection using camera motion cancelled features and GMM supervectors Yusuke Kamishima, Nakamasa Inoue, Koichi Shinoda (Tokyo Inst. of Tech.) PRMU2012-170 |
The number of studies for event detection from Internet videos has been increasing. Here, an event is defined as a combi... [more] |
PRMU2012-170 pp.185-190 |
PRMU, SP |
2012-02-10 14:50 |
Miyagi |
|
Event detection from Video using GMM-Supervectors and SVMs Yusuke Kamishima, Nakamasa Inoue, Koichi Shinoda (Tokyo Tech), Shunsuke Sato (Canon) PRMU2011-230 SP2011-145 |
In multimedia event detection, complex target events are detected from a large set of consumer domain videos taken in un... [more] |
PRMU2011-230 SP2011-145 pp.195-200 |
PRMU, FM |
2011-12-16 13:00 |
Shizuoka |
Hamamatsu Campus, Shizuoka Univ. |
[Special Talk]
Toward High-Performance Video Semantic Indexing Nakamasa Inoue, Koichi Shinoda (Tokyo Tech) PRMU2011-140 |
TokyoTech, in collaboration with Canon Inc., achieved the best performance in the Semantic Indexing task of TRECVID2011 ... [more] |
PRMU2011-140 pp.89-94 |
PRMU, DE |
2011-06-07 13:30 |
Kanagawa |
|
Fast Semantic Indexing Using Tree-structured GMMs Nakamasa Inoue, Koichi Shinoda (Tokyo Tech) DE2011-19 PRMU2011-50 |
We propose a fast semantic indexing method for large scale video resources using tree-structured Gaussian mixture models... [more] |
DE2011-19 PRMU2011-50 pp.105-110 |
PRMU |
2011-02-17 13:50 |
Saitama |
|
A Multi-modal, Multi-frame Approach for Semantic Indexing in TRECVID Nakamasa Inoue, Yusuke Kamishima, Koichi Shinoda (Tokyo Tech) PRMU2010-212 |
We propose a multi-modal, multi-frame approach for semantic indexing in the TRECVID 2010 workshop. The goal of the seman... [more] |
PRMU2010-212 pp.25-30 |
IBISML, PRMU, IPSJ-CVIM [detail] |
2010-09-05 09:30 |
Fukuoka |
Fukuoka Univ. |
Multiple Kernel Learning for Generic Object Recognition Using SIFT Gaussian Mixture Models Nakamasa Inoue, Yusuke Kamishima, Koichi Shinoda, Sadaoki Furui (Tokyo Tech) PRMU2010-58 IBISML2010-30 |
We propose a statistical framework for generic object recognition using SIFT Gaussian mixture models (GMMs) and multiple... [more] |
PRMU2010-58 IBISML2010-30 pp.7-12 |