Back to Main Conference 2016
LREC 2016main

Metrical Annotation of a Large Corpus of Spanish Sonnets: Representation, Scansion and Evaluation

Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016)

DOI:10.63317/588pdsd79yq9

Abstract

In order to analyze metrical and semantics aspects of poetry in Spanish with computational techniques, we have developed a large corpus annotated with metrical information. In this paper we will present and discuss the development of this corpus: the formal representation of metrical patterns, the semi-automatic annotation process based on a new automatic scansion system, the main annotation problems, and the evaluation, in which an inter-annotator agreement of 96\% has been obtained. The corpus is open and available.

Details

Paper ID
lrec2016-main-691
Pages
pp. 4360-4364
BibKey
navarro-etal-2016-metrical
Editors
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Sara Goggi, Marko Grobelnik, Bente Maegaard, Joseph Mariani, Hélène Mazo, Asunción Moreno, Jan Odijk, Stelios Piperidis
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
978-2-9517408-9-1
Conference
Tenth International Conference on Language Resources and Evaluation
Location
Portorož, Slovenia
Date
23 - 28 May 2016

Authors

  • BN

    Borja Navarro

  • MR

    María Ribes Lafoz

  • NS

    Noelia Sánchez

Links