Back to Main Conference 2010
LREC 2010main

Active Learning for Building a Corpus of Questions for Parsing

Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC 2010)

DOI:10.63317/3mqvuq6fnms9

Abstract

This paper describes how we built a dependency Treebank for questions. The questions for the Treebank were drawn from questions from the TREC 10 QA task and from Yahoo! Answers. Among the uses for the corpus is to train a dependency parser achieving good accuracy on parsing questions without hurting its overall accuracy. We also explore active learning techniques to determine the suitable size for a corpus of questions in order to achieve adequate accuracy while minimizing the annotation efforts.

Details

Paper ID
lrec2010-main-447
Pages
N/A
BibKey
atserias-etal-2010-active
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
2-9517408-6-7
Conference
Seventh International Conference on Language Resources and Evaluation
Location
Valletta, Malta
Date
17 May 2010 23 May 2010

Authors

  • JA

    Jordi Atserias

  • GA

    Giuseppe Attardi

  • MS

    Maria Simi

  • HZ

    Hugo Zaragoza

Links