Back to Main Conference 2018
LREC 2018main

Corpora with Part-of-Speech Annotations for Three Regional Languages of France: Alsatian, Occitan and Picard

Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)

DOI:10.63317/2kaaqqdvhqhv

Abstract

This article describes the creation of corpora with part-of-speech annotations for three regional languages of France: Alsatian, Occitan and Picard. These manual annotations were performed in the context of the RESTAURE project, whose goal is to develop resources and tools for these under-resourced French regional languages. The article presents the tagsets used in the annotation process as well as the resulting annotated corpora.

Details

Paper ID
lrec2018-main-619
Pages
N/A
BibKey
bernhard-etal-2018-corpora
Editors
Nicoletta Calzolari, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Koiti Hasida, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Hélène Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis, Takenobu Tokunaga
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
79-10-95546-00-9
Conference
Eleventh International Conference on Language Resources and Evaluation
Location
Miyazaki, Japan
Date
7 - 12 May 2018

Authors

  • DB

    Delphine Bernhard

  • AL

    Anne-Laure Ligozat

  • FM

    Fanny Martin

  • MB

    Myriam Bras

  • PM

    Pierre Magistry

  • MV

    Marianne Vergez-Couret

  • LS

    Lucie Steiblé

  • PE

    Pascale Erhart

  • NH

    Nabil Hathout

  • DH

    Dominique Huck

  • CR

    Christophe Rey

  • PR

    Philippe Reynés

  • SR

    Sophie Rosset

  • JS

    Jean Sibille

  • TL

    Thomas Lavergne

Links