Presentation | 2010-05-28 Speech communication modeled as timbre modulation and demodulation Nobuaki MINEMATSU, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | Perceptual invariance against a large amount of acoustic variability in speech has been a long-discussed question in speech science and engineering and it is still an open question. Recently, we proposed a candidate answer for it based on mathematically-guaranteed relational invariance. Here, completely transform-invariant features, f-divergences, are extracted from speech dynamics of an input utterance and they are used to represent that utterance. In this paper, this representation is interpreted from a viewpoint of telecommunications and evolutionary anthropology. Speech production is often regarded as a process of modulating the baseline timbre of a speaker's voices by manipulating the vocal organs, i.e. spectrum modulation. Then, extraction of the linguistic content from an utterance can be viewed as a process of spectrum demodulation. This modulation-demodulation model of speech communication has a good link to known morphological and cognitive differences between humans and apes. The model also claims that a linguistic content is transmitted mainly by supra-segmental (prosodic) features. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | (de)modulation / timbre / speech communication / anthropology / autism / supra-segmental features |
Paper # | TL2010-9 |
Date of Issue |
Conference Information | |
Committee | TL |
---|---|
Conference Date | 2010/5/21(1days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | |
Chair | |
Vice Chair | |
Secretary | |
Assistant |
Paper Information | |
Registration To | Thought and Language (TL) |
---|---|
Language | JPN |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | Speech communication modeled as timbre modulation and demodulation |
Sub Title (in English) | |
Keyword(1) | (de)modulation |
Keyword(2) | timbre |
Keyword(3) | speech communication |
Keyword(4) | anthropology |
Keyword(5) | autism |
Keyword(6) | supra-segmental features |
1st Author's Name | Nobuaki MINEMATSU |
1st Author's Affiliation | Graduate School of Information Science and Technology, The University of Tokyo() |
Date | 2010-05-28 |
Paper # | TL2010-9 |
Volume (vol) | vol.110 |
Number (no) | 63 |
Page | pp.pp.- |
#Pages | 6 |
Date of Issue |