Back to Main Conference 2004
LREC 2004main

Multilingual Corpus-based Approach to the Resolution of English –ing

Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC 2004)

DOI:10.63317/3iro6vbtr5wb

Abstract

Corpus data has proven to be useful for dealing with ambiguities in natural language processing (NLP). A number of studies, for example, have dealt with disambiguating English PP attachments, using corpus data. This paper explores a novel approach to resolving ambiguities associated with ing + Noun constructions in English. We use an aligned multilingual (English, Spanish, French, German and Japanese) corpus to extract lexical information necessary for disambiguation. Our premise is that while in English -ing constructions are highly ambiguous, corresponding constructions in other languages may not be ambiguous, and can thus provide English with disambiguating information. We argue that with aligned multilingual corpora, languages can learn non-trivial linguistic information from one another.

Details

Paper ID
lrec2004-main-012
Pages
N/A
BibKey
schwartz-aikawa-2004-multilingual
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
2-9517408-1-6
Conference
Fourth International Conference on Language Resources and Evaluation
Location
Lisbon, Portugal
Date
26 May 2004 28 May 2004

Authors

  • LS

    Lee Schwartz

  • TA

    Takako Aikawa

Links