Back to Main Conference 2004
LREC 2004main

Enriching a Thai Lexical Database with Selectional Preferences

Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC 2004)

DOI:10.63317/2p2537km4b46

Abstract

A statistical corpus-based approach for acquiring selectional preferences of verbs is proposed. By parsing through text corpora, we obtain examples of context nouns that are considered to be the selectional preferences of a given verb. The approach is to generalize initial noun classes to the most appropriate levels on a semantic hierarchy. We present an iterative algorithm for generalization by combining an agglomerative merging and a model selection technique called the Bayesian Information Criterion (BIC). In our experiments, we consider the Web as the large corpora. We also propose approaches for extracting examples from the Web. Preliminarily experimental results are given to show the feasibility and effectiveness of our approach.

Details

Paper ID
lrec2004-main-450
Pages
N/A
BibKey
kruengkrai-etal-2004-enriching
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
2-9517408-1-6
Conference
Fourth International Conference on Language Resources and Evaluation
Location
Lisbon, Portugal
Date
26 May 2004 28 May 2004

Authors

  • CK

    Canasai Kruengkrai

  • TC

    Thatsanee Charoenporn

  • VS

    Virach Sornlertlamvanich

  • HI

    Hitoshi Isahara

Links