Back to Main Conference 2012
LREC 2012main

An empirical resource for discovering cognitive principles of discourse organisation: the ANNODIS corpus

Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC 2012)

DOI:10.63317/4embxkiu4opd

Abstract

This paper describes the ANNODIS resource, a discourse-level annotated corpus for French. The corpus combines two perspectives on discourse: a bottom-up approach and a top-down approach. The bottom-up view incrementally builds a structure from elementary discourse units, while the top-down view focuses on the selective annotation of multi-level discourse structures. The corpus is composed of texts that are diversified with respect to genre, length and type of discursive organisation. The methodology followed here involves an iterative design of annotation guidelines in order to reach satisfactory inter-annotator agreement levels. This allows us to raise a few issues relevant for the comparison of such complex objects as discourse structures. The corpus also serves as a source of empirical evidence for discourse theories. We present here two first analyses taking advantage of this new annotated corpus --one that tested hypotheses on constraints governing discourse structure, and another that studied the variations in composition and signalling of multi-level discourse structures.

Details

Paper ID
lrec2012-main-498
Pages
pp. 2727-2734
BibKey
afantenos-etal-2012-empirical
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
978-2-9517408-7-7
Conference
Eighth International Conference on Language Resources and Evaluation
Location
Istanbul, Turkey
Date
21 May 2012 27 May 2012

Authors

  • SA

    Stergos Afantenos

  • NA

    Nicholas Asher

  • FB

    Farah Benamara

  • MB

    Myriam Bras

  • CF

    Cécile Fabre

  • MH

    Mai Ho-dac

  • AD

    Anne Le Draoulec

  • PM

    Philippe Muller

  • MP

    Marie-Paule Péry-Woodley

  • LP

    Laurent Prévot

  • JR

    Josette Rebeyrolles

  • LT

    Ludovic Tanguy

  • MV

    Marianne Vergez-Couret

  • LV

    Laure Vieu

Links