Request Correction
Use this form to request corrections to the paper metadata. Select the fields that need correction and provide the correct information.
Correction Guidelines
- Click the edit button next to a field to report a correction.
- Fill in the suggested correction value for each field you want to correct.
- Provide your name and email so we can contact you if needed.
Paper Information
Word Sense Disambiguation as a Wordnets’ Validation Method in Balkanet
Paper Fields
Click the edit button next to a field to report a correction.
Word Sense Disambiguation as a Wordnets’ Validation Method in Balkanet
BalkaNet is a European project which aims at the development of monolingual wordnets for five languages in the Balkans area (Bulgarian, Greek, Romanian Serbia, and Turkish) and at improvement of the Czech wordnet developed in the EuroWordNet project. The wordnets are aligned to the Princeton Wordnet, according to the principles established by the EuroWordNet consortium. One of the main concerns of this project is the interlingual validation of the wordnets alignment. To this end, we have developed a WSD system, based on parallel corpora, which exploits the common intuition according to which words that are reciprocal translations in a parallel texts should be linked to the same(or closely related) interlingual concepts. An embedded word aligner provides the wordnet-based algorithm, described in the paper, with pairs of words which are reciproca translations and which are subject to mutually disambiguate each other. With wordnets under construction, our WSD system is useful mainly for validation, pinpointing wrong interlingual alignments, incomplete or missing synsets in one or the other of the wordnets. With robust wordnets, the system is a proper word sense disambiguation tool for parallel corpora. The sense granularity at which the WSD is achieved is the one in the Princeton Wordnet. The challenge of this approach, besides its high accuracy and fine-grained disambiguation is that it may be used to automatically sense-tag corpora in not only one language, but rather several at once and by the same sense inventory. WSD is evaluated on an Romanian-English bitext, extracted form the multilingual parallel corpus "1984", against a hand sense-tagging used as a Gold-Standard.
Authors
Expand an author to correct their information. Use the remove button to request author removal, or add a new author.
PDF Attachment
You may attach a PDF as a corrected version of the paper. Max file size: 10MB. Only PDF files are accepted.
Your Information
Author Declaration *
Select at least one field to correct using the edit buttons above.