Presentation 2008-05-23
Phrasal Analysis of Untagged Corpora : Aspects of Japanese and Bulgarian Online Conversation and Behaviour
Milen Martchev,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) This paper describes an N-gram-based approach to interrogating untagged corpora, called Phrasal Analysis, and attempts to explain its value and practicability. Furthermore, N-gram data from Japanese and Bulgarian Internet message boards is used to compare aspects of the language and behaviour of posters in the respective two countries. Contrasted categories include discussional phrases, greetings, time expressions and hyperlinks.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) Untagged Corpora / N-grams / Phrasal Analysis / Internet Forums / Message Boards
Paper # TL2008-4
Date of Issue

Conference Information
Committee TL
Conference Date 2008/5/16(1days)
Place (in Japanese) (See Japanese page)
Place (in English)
Topics (in Japanese) (See Japanese page)
Topics (in English)
Chair
Vice Chair
Secretary
Assistant

Paper Information
Registration To Thought and Language (TL)
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) Phrasal Analysis of Untagged Corpora : Aspects of Japanese and Bulgarian Online Conversation and Behaviour
Sub Title (in English)
Keyword(1) Untagged Corpora
Keyword(2) N-grams
Keyword(3) Phrasal Analysis
Keyword(4) Internet Forums
Keyword(5) Message Boards
1st Author's Name Milen Martchev
1st Author's Affiliation Graduate School of Social Sciences, Hitotsubashi University()
Date 2008-05-23
Paper # TL2008-4
Volume (vol) vol.108
Number (no) 50
Page pp.pp.-
#Pages 5
Date of Issue