Presentation | 2012-02-03 An Attempt at Speech Data Collection Using the Social Media "Twitter" Hiroki SHIMADA, Hiromitsu NISHIZAKI, Yoshihiro SEKIGUCHI, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | This paper describes an attempt at speech data collection with transcription using the social media "Twitter." Statistical processing techniques using a large amount of speech data are successful in the speech processing research field. It is necessary to prepare a large amount of speech data to perform them in advance. However, it needs the high cost. Therefore, we developed a trial twitter client system with speech interface. When a user tweet, it is transcribed by a speech recognition system, then the transcription is provided to the user. The user can post the transcription with speech data to Twitter. Using our system makes it possible to collect speech data with transcription. The alpha version of system was released on Dec. 2011, at the present time, we collected 904 tweets (about 38.4 minutes, 4.4 minutes for speech data with collect transcriptions). |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | Twitter / speech data collection / speech interface / speech recognition |
Paper # | NLC2011-61 |
Date of Issue |
Conference Information | |
Committee | NLC |
---|---|
Conference Date | 2012/1/26(1days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | |
Chair | |
Vice Chair | |
Secretary | |
Assistant |
Paper Information | |
Registration To | Natural Language Understanding and Models of Communication (NLC) |
---|---|
Language | JPN |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | An Attempt at Speech Data Collection Using the Social Media "Twitter" |
Sub Title (in English) | |
Keyword(1) | |
Keyword(2) | speech data collection |
Keyword(3) | speech interface |
Keyword(4) | speech recognition |
1st Author's Name | Hiroki SHIMADA |
1st Author's Affiliation | Department of Computer Sience and Media Engineering, Faculty of Engineering, University of Yamanashi() |
2nd Author's Name | Hiromitsu NISHIZAKI |
2nd Author's Affiliation | Interdispilinary Research Graduate School of Medicin and Engineering, University of Yamanashi |
3rd Author's Name | Yoshihiro SEKIGUCHI |
3rd Author's Affiliation | Interdispilinary Research Graduate School of Medicin and Engineering, University of Yamanashi |
Date | 2012-02-03 |
Paper # | NLC2011-61 |
Volume (vol) | vol.111 |
Number (no) | 427 |
Page | pp.pp.- |
#Pages | 6 |
Date of Issue |