Crowdsourcing a Large Dataset of Domain-Specific Context-Sensitive Semantic Verb Relations

Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016)

Abstract

We present a new large dataset of 12403 context-sensitive verb relations manually annotated via crowdsourcing. These relations capture fine-grained semantic information between verb-centric propositions, such as temporal or entailment relations. We propose a novel semantic verb relation scheme and design a multi-step annotation approach for scaling-up the annotations using crowdsourcing. We employ several quality measures and report on agreement scores. The resulting dataset is available under a permissive CreativeCommons license at www.ukp.tu-darmstadt.de/data/verb-relations/. It represents a valuable resource for various applications, such as automatic information consolidation or automatic summarization.

Resources

Details

Paper ID

lrec2016-main-338

Pages

pp. 2131-2137

DOI

10.63317/2ej5dkbwikp6

BibKey

sukhareva-etal-2016-crowdsourcing

Editors

Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Sara Goggi, Marko Grobelnik, Bente Maegaard, Joseph Mariani, Hélène Mazo, Asunción Moreno, Jan Odijk, Stelios Piperidis

Publisher

European Language Resources Association (ELRA)

ISSN

2522-2686

ISBN

978-2-9517408-9-1

Conference

Tenth International Conference on Language Resources and Evaluation

Location

Portorož, Slovenia

Date

23 - 28 May 2016

Authors

MS
Maria Sukhareva
JE
Judith Eckle-Kohler
IH
Ivan Habernal
IG
Iryna Gurevych

Links

URL

DOI