Committee |
Date Time |
Place |
Paper Title / Authors |
Abstract |
Paper # |
SP, IPSJ-MUS, IPSJ-SLP [detail] |
2024-06-15 13:50 |
Tokyo |
(Primary: On-site, Secondary: Online) |
Discussion toward Accurate Analysis of Speech Signals Shigeki Sagayama (UT/UEC) |
[more] |
|
SIP, SP, EA, IPSJ-SLP [detail] |
2024-02-29 16:45 |
Okinawa |
(Primary: On-site, Secondary: Online) |
Multiple Lag Window Pairs for Estimation of Fundamental Frequency and Periodicity Measure Michiki Koshimori (UEC), Shigeki Sagayama (UTokyo/UEC), Toru Nakashika (UEC) EA2023-75 SIP2023-122 SP2023-57 |
Extending the main concept of modified autocorrelation method in LPC, we investigate lag windows, lag window pairs, and ... [more] |
EA2023-75 SIP2023-122 SP2023-57 pp.85-90 |
SP, IPSJ-MUS, IPSJ-SLP [detail] |
2023-06-24 13:50 |
Tokyo |
(Primary: On-site, Secondary: Online) |
[Short Paper]
SBERT-based Musical Components Estimation from Lyrics Trained with Imbalanced "Orpheus" Data Mastuti Puspitasari, Takuya Takahashi (UEC), Gen Hori (AU), Shigeki Sagayama, Toru Nakashika (UEC) SP2023-18 |
This research was done to develop neural models that are capable of estimating appropriate musical components based on l... [more] |
SP2023-18 pp.86-90 |
SP, IPSJ-MUS, IPSJ-SLP [detail] |
2023-06-24 13:50 |
Tokyo |
(Primary: On-site, Secondary: Online) |
Non-chord Tone Data Collection for Music Analysis and Generation Takuya Takahashi, , Toru Nakashika, Shigeki Sagayama (UEC) SP2023-20 |
The non-chord tones are one of the components of harmony theory and play an important role in music analysis and composi... [more] |
SP2023-20 pp.97-102 |
SP, IPSJ-MUS, IPSJ-SLP [detail] |
2022-06-18 15:00 |
Online |
Online |
Improved speech analysis using F0-adaptive lag window Michiki Koshimori, Shigeki Sagayama, Takuya Kishida, Toru Nakashika (UEC) SP2022-21 |
The lag window method is based on a source-filter model, which separates the source information from the filter informat... [more] |
SP2022-21 pp.90-93 |
SP, IPSJ-MUS, IPSJ-SLP [detail] |
2022-06-18 15:00 |
Online |
Online |
VAE-VC based on cross-entropy error minimization of LSP frequency intervals Yoshihiro Hiramoto, Shigeki Sagayama, Takuya Kishida, Toru Nakashika (UEC) SP2022-23 |
[more] |
SP2022-23 pp.100-103 |
WIT, ASJ-H |
2019-02-07 14:45 |
Ehime |
Ehime Univ. |
Toward Automatic Differentiation of Autism Spectrum Disorder Focusing on Dialogue Speech Features Daiki Mitsumoto (Meiji Univ.), Hidenori Yamasue (Hamamatsu Univ. School of Medicine), Keiho Owada, Masaki Kojima (Univ. of Tokyo), Keiko Ochi (TUT), Nobutaka Ono (TMU), Takeshi Hori, Shigeki Sagayama (Meiji Univ.) WIT2018-52 |
[more] |
WIT2018-52 pp.17-22 |
SIP, EA, SP, MI (Joint) [detail] |
2018-03-19 13:00 |
Okinawa |
|
[Poster Presentation]
Acoustic analysis of speech for emergency speech detection of voicemail Matsuto Hori (Meiji Univ.), Hosana Kamiyama, Satoshi Kobashikawa (NTT), Shigeki Sagayama (Meiji Univ.) EA2017-112 SIP2017-121 SP2017-95 |
In this paper, we investigate the effectiveness of acoustic features for detecting urgency in voicemail. In previous res... [more] |
EA2017-112 SIP2017-121 SP2017-95 pp.63-68 |
PRMU, IE, MI, SIP |
2017-05-25 15:50 |
Aichi |
|
[Special Talk]
How to Enjoy Music Information Processing Research Shigeki Sagayama (Meiji Univ.) |
[more] |
|
SP |
2013-11-21 15:00 |
Nara |
Nara Institute of Science and Technology |
Text-to-speech synthesis based on composite wavelet trajectory model Nobukatsu Hojo, Hirokazu Kameoka, Shigeki Sagayama (Tokyo Univ.) SP2013-73 |
[more] |
SP2013-73 pp.13-18 |
SP, EA, SIP |
2013-05-17 13:25 |
Okayama |
|
Online independent vector analysis with incremental updates of weighted covariance Toru Taniguchi (Toshiba), Nobutaka Ono (NII), Akinori Kawamura (Toshiba), Shigeki Sagayama (NII) EA2013-21 SIP2013-21 SP2013-21 |
We proposed a method of online independent vector analysis based on an
auxiliary-function approach, proposed by N. Ono,... [more] |
EA2013-21 SIP2013-21 SP2013-21 pp.121-126 |
EA, EMM |
2012-11-16 12:10 |
Oita |
OITA Univ. |
Auxiliary-function-based independent vector analysis with non-speech frame information for speech enhancement Masataka Suzuki (Univ. of Tokyo), Nobutaka Ono (NII), Toru Taniguchi, Masaru Sakai, Akinori Kawamura (Toshiba Corp.), Miquel Espi, Shigeki Sagayama (Univ. of Tokyo) EA2012-87 EMM2012-69 |
In this study, we discuss a technique to enhance the speech of interest in the noisy environment with using microphone a... [more] |
EA2012-87 EMM2012-69 pp.35-38 |
PRMU, SP |
2012-02-10 13:40 |
Miyagi |
|
[Invited Talk]
Music Information Processing based on Mathematical Modeling Shigeki Sagayama (The Univ. of Tokyo), Nobutaka Ono (NII), Hirokazu Kameoka (The Univ. of Tokyo/NTT) PRMU2011-229 SP2011-144 |
[more] |
PRMU2011-229 SP2011-144 p.193 |
PRMU |
2011-03-10 11:10 |
Ibaraki |
|
Simultaneous Optimization of Context Clustering and GMM for Offline Handwritten Word Recognition Using HMM Tomoyuki Hamamura (Toshiba/Univ. of Tokyo), Bunpei Irie (Toshiba), Takuya Nishimoto, Nobutaka Ono, Shigeki Sagayama (Univ. of Tokyo) PRMU2010-244 |
Context-dependent HMM is commonly used in speech recognition. The model can be realized by two ways: context clustering ... [more] |
PRMU2010-244 pp.43-48 |
SP |
2011-03-05 13:30 |
Tokyo |
Faculty of Engineering, The University of Tokyo |
Decision of Feedback Timing for Speech Recognition with Reinforcement Learning Di Lu, Satoru Fukayama, Takuya Nishimoto, Shigeki Sagayama (Univ. of Tokyo.) SP2010-125 |
In spoken dialog systems, it is important to reduce the delay of the response to the user's utterance. We investigated t... [more] |
SP2010-125 pp.61-66 |
SP |
2011-03-05 16:15 |
Tokyo |
Faculty of Engineering, The University of Tokyo |
Speech Enhancement Based on Frequency-Fluctuation-Length Filter Hideyuki Tachibana, Nobutaka Ono, Shigeki Sagayama (Univ. of Tokyo) SP2010-130 |
In this paper, we describe a novel speech enhancement technique based on fluctuation lengths of speech signals. Fluctuat... [more] |
SP2010-130 pp.89-94 |
EA |
2010-12-10 13:20 |
Ibaraki |
Univ. of Tsukuba |
Blind source separation with distributed microphone pairs and investigation of performance gain using post-filter Takuma Ono, Nobutaka Ito, Shigeki Miyabe, Nobutaka Ono, Shigeki Sagayama (Tokyo Univ.) EA2010-101 |
In this paper, we discuss microphone location and the separation method using frequency-domain independent component ana... [more] |
EA2010-101 pp.25-30 |
EA |
2010-12-10 13:45 |
Ibaraki |
Univ. of Tsukuba |
Diffuse noise robust multiple source localization based on noise reduction in covariance matrix domain Nobutaka Ito (Univ. Tokyo), Emmanuel Vincent (INRIA Rennes), Nobutaka Ono (Univ. Tokyo), Remi Gribonval (INRIA Rennes), Shigeki Sagayama (Univ. Tokyo) EA2010-102 |
In this paper, we propose a method for estimating the azimuths of multiple sound sources accurately even in the presence... [more] |
EA2010-102 pp.31-36 |
IBISML |
2010-11-04 15:00 |
Tokyo |
IIS, Univ. of Tokyo |
[Poster Presentation]
Hierarchical topic trajectory model for video annotation retrieval considering cross-modal co-occurrences Takuho Nakano (Univ. Tokyo), Akisato Kimura, Hirokazu Kameoka (NTT), Shigeki Miyabe, Shigeki Sagayama, Nobutaka Ono (Univ. Tokyo), Kunio Kashino (NTT), Takuya Nishimoto (Univ. Tokyo) IBISML2010-73 |
This paper deals with a problem of ``video annotation retrieval'' that achieves automatic video annotaion (providing rel... [more] |
IBISML2010-73 pp.105-112 |
SP, NLC |
2008-12-09 10:00 |
Tokyo |
Waseda Univ. |
Two-channel input speech recognition using sparsness-based blind source separation Kenta Nishiki, Yosuke Izumi (Univ. of Tokyo), Shinji Watanabe (NTT), Takuya Nishimoto, Nobutaka Ono, Shigeki Sagayama (Univ. of Tokyo) NLC2008-24 SP2008-79 |
This paper discusses a two-channel input speech recognition using a sparsness-based blind source separation. The target ... [more] |
NLC2008-24 SP2008-79 pp.1-6 |