Presentation 2010-05-14
A Method for Generating Concept-base with Considering Co-occurrences between All Words
Katsuji BESSHO, Toshio UCHIYAMA, Tadasu UCHIYAMA,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) When generating the concept vector as the meaning representation for the word, we propose a method that the co-occurrences between all words can be considered by allocating a random, unique number set to each word and generating the co-occurrence matrix between words and numbers. The method has the feature that the memory usage for generating and using the concept vectors doesn't increase though information on the co-occurrences between all words is contained. We also propose a method that word concept vectors generated thus are clustered, and the number of the cluster generated as a result is allocated to each word, and then the co-occurrence matrix between words and clusters is generated and united with the co-occurrence matrix between words and numbers. When the accuracy of various linguistic processing was measured by using the concept vector generated with these methods, we confirmed the effectiveness of our method compared with the conventional method.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) Concept-base / Co-occurrence Matrix / Clustering
Paper # IE2010-41,PRMU2010-29,MI2010-29
Date of Issue

Conference Information
Committee MI
Conference Date 2010/5/6(1days)
Place (in Japanese) (See Japanese page)
Place (in English)
Topics (in Japanese) (See Japanese page)
Topics (in English)
Chair
Vice Chair
Secretary
Assistant

Paper Information
Registration To Medical Imaging (MI)
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) A Method for Generating Concept-base with Considering Co-occurrences between All Words
Sub Title (in English)
Keyword(1) Concept-base
Keyword(2) Co-occurrence Matrix
Keyword(3) Clustering
1st Author's Name Katsuji BESSHO
1st Author's Affiliation NTT Cyber Solutions Laboratories, NTT Corporation()
2nd Author's Name Toshio UCHIYAMA
2nd Author's Affiliation NTT Cyber Solutions Laboratories, NTT Corporation
3rd Author's Name Tadasu UCHIYAMA
3rd Author's Affiliation NTT Cyber Solutions Laboratories, NTT Corporation
Date 2010-05-14
Paper # IE2010-41,PRMU2010-29,MI2010-29
Volume (vol) vol.110
Number (no) 28
Page pp.pp.-
#Pages 6
Date of Issue