ソーシャルメディア「Twitter」を利用した音声データ収集の試み(第3回集合知シンポジウム)

Presentation	2012-02-03 An Attempt at Speech Data Collection Using the Social Media "Twitter" Hiroki SHIMADA, Hiromitsu NISHIZAKI, Yoshihiro SEKIGUCHI,
PDF Download Page	PDF download Page Link
Abstract(in Japanese)	(See Japanese page)
Abstract(in English)	This paper describes an attempt at speech data collection with transcription using the social media "Twitter." Statistical processing techniques using a large amount of speech data are successful in the speech processing research field. It is necessary to prepare a large amount of speech data to perform them in advance. However, it needs the high cost. Therefore, we developed a trial twitter client system with speech interface. When a user tweet, it is transcribed by a speech recognition system, then the transcription is provided to the user. The user can post the transcription with speech data to Twitter. Using our system makes it possible to collect speech data with transcription. The alpha version of system was released on Dec. 2011, at the present time, we collected 904 tweets (about 38.4 minutes, 4.4 minutes for speech data with collect transcriptions).
Keyword(in Japanese)	(See Japanese page)
Keyword(in English)	Twitter / speech data collection / speech interface / speech recognition
Paper #	NLC2011-61
Date of Issue

Paper Information
Registration To	Natural Language Understanding and Models of Communication (NLC)
Language	JPN
Title (in Japanese)	(See Japanese page)
Sub Title (in Japanese)	(See Japanese page)
Title (in English)	An Attempt at Speech Data Collection Using the Social Media "Twitter"
Sub Title (in English)
Keyword(1)	Twitter
Keyword(2)	speech data collection
Keyword(3)	speech interface
Keyword(4)	speech recognition
1st Author's Name	Hiroki SHIMADA
1st Author's Affiliation	Department of Computer Sience and Media Engineering, Faculty of Engineering, University of Yamanashi()
2nd Author's Name	Hiromitsu NISHIZAKI
2nd Author's Affiliation	Interdispilinary Research Graduate School of Medicin and Engineering, University of Yamanashi
3rd Author's Name	Yoshihiro SEKIGUCHI
3rd Author's Affiliation	Interdispilinary Research Graduate School of Medicin and Engineering, University of Yamanashi
Date	2012-02-03
Paper #	NLC2011-61
Volume (vol)	vol.111
Number (no)	427
Page	pp.pp.-
#Pages	6
Date of Issue