Back to Main Conference 2012
LREC 2012main

French and German Corpora for Audience-based Text Type Classification

Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC 2012)

DOI:10.63317/3wqsdxx7fw6r

Abstract

This paper presents some of the results of the CLASSYN project which investigated the classification of text according to audience-related text types. We describe the design principles and the properties of the French and German linguistically annotated corpora that we have created. We report on tools used to collect the data and on the quality of the syntactic annotation. The CLASSYN corpora comprise two text collections to investigate general text types difference between scientific and popular science text on the two domains of medical and computer science.

Details

Paper ID
lrec2012-main-286
Pages
pp. 1591-1597
BibKey
todirascu-etal-2012-french
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
978-2-9517408-7-7
Conference
Eighth International Conference on Language Resources and Evaluation
Location
Istanbul, Turkey
Date
21 May 2012 27 May 2012

Authors

  • AT

    Amalia Todirascu

  • SP

    Sebastian Padó

  • JK

    Jennifer Krisch

  • MK

    Max Kisselew

  • UH

    Ulrich Heid

Links