Committee |
Date Time |
Place |
Paper Title / Authors |
Abstract |
Paper # |
SP, NLC, IPSJ-SLP [detail] |
2011-12-20 17:00 |
Tokyo |
|
A study on modeling phone duration using dynamic features for HMM-based speech synthesis Takashi Nose, Takao Kobayashi (Tokyo Tech) NLC2011-55 SP2011-100 |
This paper proposes a technique for modeling and generating phone durations
using their dynamic features to improve pre... [more] |
NLC2011-55 SP2011-100 pp.197-202 |
WIT, SP |
2011-10-07 15:45 |
Tokyo |
TFT Bldg. |
A Study on Switch Method to Suppress the Background Sound using Speech Segments Detection Tomoyasu Komori (NHK ES), Atsushi Imai, Nobumasa Seiyama, Reiko Takou, Toru Imai, Shinichi Homma (NHK), Tohru Takagi (NHK ES) SP2011-66 WIT2011-48 |
(To be available after the conference date) [more] |
SP2011-66 WIT2011-48 pp.77-82 |
SP |
2011-03-04 17:00 |
Tokyo |
Faculty of Engineering, The University of Tokyo |
The Phonemic Errors, Syntax Errors, and Semantic Errors
-- The Analyses of Phenomena in Pronunciation and Hearing of Foreign Languages -- Kimiaki Tokumaru (System Engineer) SP2010-120 |
Human Digital Language System consists of three automata, namely: "Vocalization of Doubly Articulated Messages", "Error ... [more] |
SP2010-120 pp.31-36 |
NLC, SP (Joint) [detail] |
2010-12-21 16:40 |
Tokyo |
National Olympics Memorial Youth Center |
Study on HMM-based F0 Coding for Very Low Bit-Rate Vocoder Takashi Nose, Masashi Kumamoto, Takao Kobayashi (Tokyo Inst. of Tech.) NLC2010-28 SP2010-101 |
This paper presents a novel F0 coding technique for very low bit-rate HMM-based phonetic vocoder. Our technique is based... [more] |
NLC2010-28 SP2010-101 pp.189-194 |
SP, NLC |
2009-12-22 15:50 |
Tokyo |
Univ. of Tokyo |
A study on Voice Conversion Based on F0 Quantization and Non-parallel Training Yuhei Ota, Takashi Nose, Takao Kobayashi (Tokyo Inst. of Tech.) NLC2009-27 SP2009-91 |
This paper presents a novel voice conversion technique using HMM-based phoneme recognition and speech synthesis with non... [more] |
NLC2009-27 SP2009-91 pp.171-176 |
EA |
2009-06-25 16:30 |
Hokkaido |
|
Design of Phoneme MIDI Codes Using the MIDI Encoding Tool "Auto-F" and Realizing Voice Synthesizing Functions Based on Musical Sounds Toshio Modegi (Dai Nippon Printing Co., Ltd.) EA2009-24 |
Using our previously developed audio to MIDI code converter tool “Auto-F”, from given vocal acoustic signals we can crea... [more] |
EA2009-24 pp.37-42 |
PRMU, IE, MI |
2009-05-28 15:00 |
Gifu |
Gifu Univ. |
Kansei Information Retrieval from Web Video Repository Yuichi Yaguchi (Univ. of Aizu/JSPS Research Fellw), Marie Oshima, Takashi Kawai, Ryuichi Oka (Univ. of Aizu) IE2009-20 PRMU2009-11 MI2009-11 |
This paper proposes and reports pilot study of Kansei information retrieval using Web based speech retrieval system for ... [more] |
IE2009-20 PRMU2009-11 MI2009-11 pp.59-64 |
SP, NLC |
2008-12-10 16:10 |
Tokyo |
Waseda Univ. |
Articulatory feature extraction based on 3-stage MLNs and Inhibition/Enhancement Network Mohammad Nurul Huda, Hiroaki Kawashima, Tsuneo Nitta (Toyohashi Univ. of Tech.) NLC2008-49 SP2008-104 |
This paper describes an articulatory feature extractor for accurate phoneme recognition. The proposed method comprises t... [more] |
NLC2008-49 SP2008-104 pp.149-154 |
SP, NLC |
2008-12-10 16:10 |
Tokyo |
Waseda Univ. |
Study on Spectro-Temporal Features Based on Gradient Histograms Takashi Muroi, Tetsuya Takiguchi, Yasuo Ariki (Kobe Univ.) NLC2008-51 SP2008-106 |
This paper proposes a novel feature extraction method for speech recognition based on gradient features on 2-D time-freq... [more] |
NLC2008-51 SP2008-106 pp.161-165 |
SP, NLC |
2008-12-10 16:10 |
Tokyo |
Waseda Univ. |
Speech Recognition Based on Statistical Models Including Multiple Decision Trees Sayaka Shiota, Kei Hashimoto, Yoshihiko Nankaku, Akinobu Lee, Keiichi Tokuda (Nagoya Institute of Technology) NLC2008-61 SP2008-116 |
This paper proposes a speech recoginition technique using multiple decision trees. In the use of context dependent model... [more] |
NLC2008-61 SP2008-116 pp.221-226 |
TL |
2008-08-08 16:15 |
Miyagi |
Hotel Onikobe |
Effects of English experiences on preschoolers' brain activities in listening to /ra/ and /la/ contrast
-- A NIRS study -- Akitsugu Konno, Suguru Maruyama (Tohoku Univ./JST), Souta Hidaka (Tohoku Univ.), Akihiro Tanaka (Tilburg Univ.), Masatoshi Koizumi, Jiro Gyoba (Tohoku Univ./JST), Hiroko Hagiwara (JST/Tokyo Metropolitan Univ.) TL2008-20 |
We observed the preschoolers’ brain activities in listening to /ra/ and /la/ phonemic contrast by using near-infrared sp... [more] |
TL2008-20 pp.45-48 |
SP |
2008-07-17 - 2008-07-19 |
Iwate |
Iwate Prefectural Univ. |
[Fellow Memorial Lecture]
Speech recognition research past and present Shozo Makino (Tohoku Univ.) SP2008-50 |
I have engaged in research of speech recognition from 1968. The approaches of our speech recognition research group abou... [more] |
SP2008-50 pp.43-48 |
NC, MBE (Joint) |
2008-03-14 11:10 |
Tokyo |
Tamagawa Univ |
Hierarchical Cluster Analysis of Phoneme Data Using the RDSP Method Yuta Onoshiro, Kazunori Iwata, Nobuo Suematsu, Akira Hayashi (Hiroshima City Univ.) NC2007-177 |
The RDSP method is an effective method in hierarchical cluster analysis
when the population of samples is characterize... [more] |
NC2007-177 pp.385-390 |
SP |
|
Toyama |
Toyama Prefectural University |
Acoustic Modeling Based on Model Structure Annealing for Speech Recognition Sayaka Shiota, Kei Hashimoto, Heiga Zen, Yoshihiko Nankaku, Akinobu Lee, Keiichi Tokuda (Nitech) SP2007-35 |
This paper proposes a joint optimization technique of phonetic decision trees and state sequences for HMM-based speech r... [more] |
SP2007-35 pp.67-72 |
SP |
2007-06-28 15:15 |
Fukushima |
The University of Aizu |
Detection of acoustic event boundaries from continuous speech based on constrained clustering Naoya Shimomura, Satoshi Asakawa, Nobuaki Minematsu, Keikichi Hirose (Univ. of Tokyo) SP2007-12 |
Speech databases with accurate phoneme labeling are often required to improve speech technologies. Although HMM-based fo... [more] |
SP2007-12 pp.25-30 |
SP |
2007-06-29 10:50 |
Fukushima |
The University of Aizu |
An investigation on speaker vector-based speaker identification under noisy conditions Yuki Goto, Tatsuya Akatsu, Masaharu Katoh, Tetsuo Kosaka, Masaki Kohda (Yamagata Univ.) SP2007-18 |
This paper describes the experimental results of the speaker identification method based on a speaker vector under noisy... [more] |
SP2007-18 pp.61-66 |
SP |
2007-05-31 10:00 |
Kyoto |
ATR |
Feature Extraction Using Phoneme PCA for Reverberant Speech Recognition Hyunsin Park, Tetsuya Takiguchi, Yasuo Ariki (Kobe Univ.) SP2007-1 |
In this paper, we propose a novel subspace approach for robust speech feature extraction in reverberant environments usi... [more] |
SP2007-1 pp.1-6 |
PRMU |
2007-03-16 09:00 |
Okayama |
|
Semantic news video indexing using the Web Hidenobu Nagata, Masashi Morimoto (NTT) |
This paper presents a novel methodology of semantic
video indexing for broadcasts using the Web. Focusing
on the co-... [more] |
PRMU2006-258 pp.19-24 |
NC |
2006-01-23 11:00 |
Hokkaido |
Hokkaido Univ. |
the McGurk effect on the self organizing maps Kei Omata (Titech), Ken Mogi (Sony CSL) |
Visual articulations interfere with auditory perception, as is manifest in the McGurk effect. In order to investigate th... [more] |
NC2005-92 pp.13-18 |