ken-system: Advance Program - 2014-05-SP-IPSJ-MUS

IEICE Technical Committee Submission System
Advance Program

Online Proceedings
[Sign in]
Tech. Rep. Archives

Go Top

Go Back

Prev SP Conf / Next SP Conf

[HTML] / [HTML(simple)] / [TEXT]

[Japanese] / [English]

Technical Committee on Speech (SP)

[schedule] [select]

Chair		Takeshi Kawabata (Kwansei Gakuin Univ.)
Vice Chair		Hisashi Kawai (KDDI Labs.)
Secretary		Motoyuki Suzuki (Osaka Inst. of Tech.), Tomoki Toda (NAIST)
Assistant		Yamato Ohtani (Toshiba), Takanobu Oba (NTT)

Special Interest Group on Music and Computer (IPSJ-MUS)

[schedule] [select]

Chair		Rumi Hiraga
Secretary		Tetsuro Kitahara, Tetsuaki Baba, Keiji Hirata, Kazuyoshi Yoshii, Hirokazu Kameoka

Conference Date	Sat, May 24, 2014 08:50 - 17:45 Sun, May 25, 2014 09:00 - 18:00
Topics
Conference Place

	-
Sat, May 24 AM 08:50 - 09:00
(1) SP	08:50-09:00	"Ongaku" Symposium 2014: The 2nd Symposium on Any Topics Related to Acoustics, Audition and Natural Language	Hirokazu Kameoka (Univ. of Tokyo/NTT), Eriko Aiba (UEC), Yasunori Ohishi (NTT), Tetsuro Kitahara (Nihon Univ.), Tatsuya Kitamura (Konan Univ.), Shoei Sato (NHK), Masahito Togami (Hitachi), Tomoki Toda (NAIST), Kazuyoshi Yoshii (Kyoto Univ.)
Sat, May 24 AM 09:00 - 09:45
(2) SP	09:00-09:45	[Invited Talk] Speaker adaptation technologies for speech synthesis and its application to assistive technology	Junichi Yamagishi (NII)
Sat, May 24 AM 09:45 - 10:30
(3)	09:45-10:30
Sat, May 24 AM 10:30 - 11:15
(4) SP	10:30-11:15	[Invited Talk] Infinite data analysis and Bayesian nonparametrics for audio signal processing	Masahiro Nakano (NTT)
Sat, May 24 AM 11:15 - 15:30
	-
	-
Sat, May 24 PM 15:30 - 16:15
(5) SP	15:30-16:15	[Invited Talk] From multimodal spatial hearing to engineering applications to cope with severe disasters -- Our recent research restuls on spatial acoustic information sciences --	Yo-iti Suzuki, Shuichi Sakamoto (Tohoku Univ.)
Sat, May 24 PM 16:15 - 17:00
(6)	16:15-17:00
Sat, May 24 PM 17:00 - 17:45
(7)	17:00-17:45
	-
	-
	-
Sun, May 25 AM 09:00 - 09:45
(8) SP	09:00-09:45	[Invited Talk] Behavioral neurosciences of vocal control and learning -- using the songbird as a model system --	Ryosuke O. Tachibana (Univ. of Tokyo)
Sun, May 25 AM 09:45 - 10:30
(9) SP	09:45-10:30	[Invited Talk] Machine Translation -- Why couldn't we do it? Why are we starting to be able to now? --	Graham Neubig (NAIST)
Sun, May 25 AM 10:30 - 11:15
(10) SP	10:30-11:15	[Invited Talk] Applications and Advances of Deep Learning for Automatic Speech Recognition	Yotaro Kubo (Amazon)
Sun, May 25 AM 11:15 - 15:30
	-
	-
Sun, May 25 PM 15:30 - 16:15
(11) SP	15:30-16:15	[Invited Talk] R&D of Music Information Retrieval Technology and Issues for its Deployment to Practical Applications	Keiichiro Hoashi (KDDI Labs)
Sun, May 25 PM 16:15 - 17:00
(12) SP	16:15-17:00	[Invited Talk] What Higher-Order Statistics Tell Us? -- Acoustic Signal Processing Based on Unsupervised Learning --	Hiroshi Saruwatari (Univ. of Tokyo)
Sun, May 25 PM 17:00 - 17:45
(13)	17:00-17:45
Sun, May 25 PM 17:45 - 18:00
	-
	-
Sat, May 24 AM 11:30 - 15:30
(14)	11:30-15:30
(15)	11:30-15:30
(16)	11:30-15:30
(17)	11:30-15:30
(18)	11:30-15:30
(19)	11:30-15:30
(20)	11:30-15:30
(21)	11:30-15:30
(22)	11:30-15:30
(23)	11:30-15:30
(24)	11:30-15:30
(25) SP	11:30-15:30	A Consideration of Evaluation Measurements in Spoken Term Detection	Satoshi Oshima, Yoshiaki Itoh (Iwate Prefectural Univ.)
(26) SP	11:30-15:30	Robustness of Speaker Identification Using Pseudo Pitch Synchronized Phase Information	Yuta Kawakami, Longbiao Wang (Nagaoka Univ. of Tech.), Atsuhiko Kai (Shizuoka Univ.), Seiichi Nakagawa (Toyohashi Univ. of Tech.)
(27) SP	11:30-15:30	Visualization of World Englishes pronunciations from a speaker's self-centered viewpoint using attributes of accent, gender, and age	Yuji Kawase, Nobuaki Minematsu, Daisuke Saito, Keikichi Hirose (UTokyo), Han-Ping Shen (NCKU)
(28)	11:30-15:30
(29) SP	11:30-15:30	Native language recognition using machine learning	Ryota Sakagami, Kouki Takeshita, Longbiao Wang, Masahiro Iwahashi (Nagaoka Univ. of Tech)
(30) SP	11:30-15:30	Language recognition in reverberant environments	Kouki Takeshita, Ryota Sakagami, Longbiao Wang, Masahiro Iwahashi (Nagaoka Univ. of Tech.)
(31) SP	11:30-15:30	Discriminative training of acoustic models for system combination	Yuuki Tachioka (Mitsubishi Electric), Shinji Watanabe, Jonathan Le Roux, John R. Hershey (MERL)
(32) SP	11:30-15:30	Distant-talking Speech Recognition with Asynchronous Speech Recording	Shunta Teraoka, Yuma Ueda (Shizuoka Univ.), Longbiao Wang (Nagaoka Univ. of Tech.), Atsuhiko Kai, Taku Fukushima (Shizuoka Univ.)
(33)	11:30-15:30
(34)	11:30-15:30
(35) SP	11:30-15:30	[研究紹介] A spectrogram-patch-input DNN model for detection and classification of acoustic events robust to speech overlapping scenarios	Miquel Espi, Masakiyo Fujimoto, Yotaro Kubo, Tomohiro Nakatani (NTT)
(36) SP	11:30-15:30	Development of environmental sound collection system using smart devices based on crowd-sourcing approach	Sunao Hara, Akinori Kasai, Masanobu Abe (Okayama Univ.), Noboru Sonehara (NII)
(37) SP	11:30-15:30	ROCKON:Environmental sound collection and recognition system using smartphones	Minori Matsuyama, Takahiko Tsuda, Ryuichi Nisimura, Hideki Kawahara (Wakayama Univ), Junnosuke Yamada (NTT), Toshio Irino (Wakayama Univ)
(38)	11:30-15:30
(39)	11:30-15:30
(40)	11:30-15:30
(41)	11:30-15:30
(42) SP	11:30-15:30	Underdetermined Blind Separation of Moving Sources Based on Probabilistic Modeling	Takuya Higuchi, Norihiro Takamune, Tomohiko Nakamura (Univ. of Tokyo), Hirokazu Kameoka (Univ. of Tokyo/NTT)
(43) SP	11:30-15:30	Psychometric functions for across-frequency gap detection	Yousuke Kikuchi, Takako Mitsudo, Nobuyuki Hirose, Shuji Mori (Kyushu Univ.)
(44) SP	11:30-15:30	Deriving the Salience Level of a Target Sound using a Tapping Technique Method	Shunsuke Kidani, Hsin-I Liao, Makoto Yoneya, Makio Kashino, Shigeto Furukawa (NTT)
(45) SP	11:30-15:30	Perception of stop consonants at the beginning of binaurally fused words	Hitomi Kondo, Yousuke Kikuchi, Takako Mitsudo, Nobuyuki Hirose, Shuji Mori (Kyushu Univ.)
(46) SP	11:30-15:30	Effect of interaural time difference for localization of spatially segregated sound	Daisuke Morikawa (JAIST)
(47) SP	11:30-15:30	Acquisition and retention of perceptual cue for size judgment using whispered speech	Koudai Yamamoto, Toshio Irino, Ryuichi Nisimura, Hideki Kawahara (Wakayama Univ.)
Sun, May 25 AM 11:30 - 15:30
(48)	11:30-15:30
(49)	11:30-15:30
(50)	11:30-15:30
(51)	11:30-15:30
(52)	11:30-15:30
(53)	11:30-15:30
(54)	11:30-15:30
(55)	11:30-15:30
(56)	11:30-15:30
(57)	11:30-15:30
(58) SP	11:30-15:30	Analysis of the Relationship between Pitch and Formant Frequencies in Voice Register Transition	Yasufumi Uezu, Takahiro Furukawa, Tokihiko Kaburagi (Kyushu Univ.)
(59) SP	11:30-15:30	Statistical bandwidth extension using sub-band basis spectrum model	Yamato Ohtani, Masatsune Tamura, Masahiro Morita, Masami Akamine (Toshiba)
(60) SP	11:30-15:30	Text-to-speech prosody synthesis based on probabilistic model for F0 contour	Kento Kadowaki, Tatsuma Ishihara, Nobukatsu Hojo (Univ. of Tokyo), Hirokazu Kameoka (Univ. of Tokyo/NTT)
(61) SP	11:30-15:30	Evaluation of singing voice similarity based on "acoustic singing-structure"	Shun Kojima, Takeshi Saitou, Masato Miyoshi (Kanazawa Univ.)
(62) SP	11:30-15:30	Statistical approach to perceived age control of singing voice	Kazuhiro Kobayashi, Tomoki Toda (NAIST), Tomoyasu Nakano, Masataka Goto (AIST), Graham Neubig, Sakriani Sakti, Satoshi Nakamura (NAIST)
(63) SP	11:30-15:30	A portable application for assistance of vocal sound training by overtone analysis	Iori Sugahara, Takayuki Itoh (Ochanomizu Univ)
(64) SP	11:30-15:30	An Evaluation of a Hybrid Approach to Electrolaryngeal Speech Enhancement Based on Noise Reduction and Statistical Excitation Prediction	Kou Tanaka, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura (NAIST)
(65) SP	11:30-15:30	Design of voice-enabled web test system for eliminating users' impatience	Chihiro Tafuji, Ryuichi Nisimura, Hideki Kawahara, Toshio Irino (Wakayama Univ.)
(66) SP	11:30-15:30	A joint restricted Boltzmann machine for dictionary learning in sparse-representation-based voice conversion	Toru Nakashika, Tetsuya Takiguchi, Yasuo Ariki (Kobe Univ.)
(67) SP	11:30-15:30	Speech waveform generation on subband domain	Nobuyuki Nishizawa, Tsuneo Kato (KDDI R&D Labs)
(68) SP	11:30-15:30	A Kana Protocol Recommendation Method for Switch Input Speech Synthesis Systems	Fuming Fang, Takahiro Shinozaki, Takao Kobayashi (Tokyo Tech)
(69) SP	11:30-15:30	Current situations and issues of open-source high-quality speech synthesis system WORLD	Masanori Morise (Univ. of Yamanashi)
(70) SP	11:30-15:30	The Acoustic Feature of the Loudspeaker which used the Reinforced Corrugated Fibreboard for the Enclosure Material	Takuto Isoyama, Yukio Mori (Salesian Polytechnic), Yoshiaki Kiyama
(71) SP	11:30-15:30	Spot-forming method by using two shotgun microphones	Motoyuki Suzuki, Takeshi Honjo (Osaka Inst. of Tech.)
(72) SP	11:30-15:30	Signal processing of ultrasound for osteoporosis diagnosis -- Modeling, time domain analysis, and frequency domain analysis --	Yoshiki Nagatani (KCCT), Ryosuke O. Tachibana (Univ. of Tokyo)
(73) SP	11:30-15:30	Modulation transfer function based robust method of voice activity detection for noisy reverberant environments -- Utilization of subband SNR estimation --	Shota Morita, Masashi Unoki (JAIST), Xugang Lu (NICT), Masato Akagi (JAIST)
(74) SP	11:30-15:30	Systematic study on kawaii products (The seventeenth report) -- Basic study for Kawaii sound --	Michiko Ohkura, Ryo Kanno (Shibaura Inst. Tech.)
(75) SP	11:30-15:30	The basic mechanisms for perception of simultaneity, stream segregation, and temporal order for auditory stimuli	Satoshi Okazaki, Makoto Ichikawa (Chiba Univ.)
(76)	11:30-15:30
(77)	11:30-15:30
(78) SP	11:30-15:30	[研究紹介] Adaptive adjustment of local temporal structure in song of Bengalese finches	Ryosuke O. Tachibana, Neal A. Hessler, Kazuo Okanoya (Univ. of Tokyo)
(79) SP	11:30-15:30	Modulation of the Temporal Dynamics of Microsaccades with the Presentation of Salient Sounds	Makoto Yoneya, Hsin-I Liao, Shunsuke Kidani, Shigeto Furukawa (NTT), Makio Kashino (NTT/Tokyo Tech)

Announcement for Speakers
Invited Talk	Each speech will have 35 minutes for presentation and 10 minutes for discussion.
Poster Presentation	Each speech will have 225 minutes for presentation.

Contact Address and Latest Schedule Information
SP	Technical Committee on Speech (SP) [Latest Schedule]
	Contact Address
IPSJ-MUS	Special Interest Group on Music and Computer (IPSJ-MUS) [Latest Schedule]
	Contact Address

Last modified: 2014-05-23 15:06:55

Notification: Mail addresses are partially hidden against SPAM.

[Download Paper's Information (in Japanese)] <-- Press download button after click here.

[Cover and Index of IEICE Technical Report by Issue]

[Presentation and Participation FAQ] (in Japanese)

[Return to SP Schedule Page] / [Return to IPSJ-MUS Schedule Page] /

Go Top

Go Back

Prev SP Conf / Next SP Conf

[HTML] / [HTML(simple)] / [TEXT]

[Japanese] / [English]

[Return to Top Page]

[Return to IEICE Web Page]

The Institute of Electronics, Information and Communication Engineers (IEICE), Japan