Title

Constructing Word-Sense Association Networks from Bilingual Dictionary and Comparable Corpora

Author(s)

Hiroyuki Kaji, Osamu Imaichi

Central Research Laboratory, Hitachi, Ltd.

Session

O43-W

Abstract

A novel thesaurus named a gword-sense association networkh is proposed for the first time. It consists of nodes representing word senses, each of which is defined as a set consisting of a word and its translation equivalents, and edges connecting topically associated word senses. This word-sense association network is produced from a bilingual dictionary and comparable corpora by means of a newly developed fully automatic method. The feasibility and effectiveness of the method were demonstrated experimentally by using the EDR English-Japanese dictionary together with Wall Street Journal and Nihon Keizai Shimbun corpora. The word-sense association networks were applied to word-sense disambiguation as well as to a query interface for information retrieval.

Keyword(s)

semantic lexicon, knowledge acquisition, word sense, comparable corpora

Language(s)

English, Japanese

Full Paper

401.pdf