Back to Main Conference 2004
LREC 2004main

Retrieving Annotated Corpora for Corpus Annotation

Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC 2004)

DOI:10.63317/3uu2ehwnxfkj

Abstract

This paper introduces a tool \Bonsai which supports human in annotating corpora with morphosyntactic information, and in retrieving syntactic structures stored in the database. Integrating annotation and retrieval enables users to annotate a new instance while looking back at the already annotated sentences which share the similar morphosyntactic structure. We focus on the retrieval part of the system, and describe a method to decompose a large input query into smaller ones in order to gain retrieval efficiency. The proposed method is evaluated with the Penn Treebank corpus, showing significant improvements.

Details

Paper ID
lrec2004-main-233
Pages
N/A
BibKey
yoshida-etal-2004-retrieving
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
2-9517408-1-6
Conference
Fourth International Conference on Language Resources and Evaluation
Location
Lisbon, Portugal
Date
26 May 2004 28 May 2004

Authors

  • KY

    Kyôsuke Yoshida

  • TH

    Taiichi Hashimoto

  • TT

    Takenobu Tokunaga

  • HT

    Hozumi Tanaka

Links