Presentation 2012-02-03
An Attempt at Speech Data Collection Using the Social Media "Twitter"
Hiroki SHIMADA, Hiromitsu NISHIZAKI, Yoshihiro SEKIGUCHI,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) This paper describes an attempt at speech data collection with transcription using the social media "Twitter." Statistical processing techniques using a large amount of speech data are successful in the speech processing research field. It is necessary to prepare a large amount of speech data to perform them in advance. However, it needs the high cost. Therefore, we developed a trial twitter client system with speech interface. When a user tweet, it is transcribed by a speech recognition system, then the transcription is provided to the user. The user can post the transcription with speech data to Twitter. Using our system makes it possible to collect speech data with transcription. The alpha version of system was released on Dec. 2011, at the present time, we collected 904 tweets (about 38.4 minutes, 4.4 minutes for speech data with collect transcriptions).
Keyword(in Japanese) (See Japanese page)
Keyword(in English) Twitter / speech data collection / speech interface / speech recognition
Paper # NLC2011-61
Date of Issue

Conference Information
Committee NLC
Conference Date 2012/1/26(1days)
Place (in Japanese) (See Japanese page)
Place (in English)
Topics (in Japanese) (See Japanese page)
Topics (in English)
Chair
Vice Chair
Secretary
Assistant

Paper Information
Registration To Natural Language Understanding and Models of Communication (NLC)
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) An Attempt at Speech Data Collection Using the Social Media "Twitter"
Sub Title (in English)
Keyword(1) Twitter
Keyword(2) speech data collection
Keyword(3) speech interface
Keyword(4) speech recognition
1st Author's Name Hiroki SHIMADA
1st Author's Affiliation Department of Computer Sience and Media Engineering, Faculty of Engineering, University of Yamanashi()
2nd Author's Name Hiromitsu NISHIZAKI
2nd Author's Affiliation Interdispilinary Research Graduate School of Medicin and Engineering, University of Yamanashi
3rd Author's Name Yoshihiro SEKIGUCHI
3rd Author's Affiliation Interdispilinary Research Graduate School of Medicin and Engineering, University of Yamanashi
Date 2012-02-03
Paper # NLC2011-61
Volume (vol) vol.111
Number (no) 427
Page pp.pp.-
#Pages 6
Date of Issue