Back to Main Conference 2012
LREC 2012main

Texto4Science: a Quebec French Database of Annotated Short Text Messages

Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC 2012)

DOI:10.63317/24i436sxdny8

Abstract

In October 2009, was launched the Quebec French part of the international sms4science project, called texto4science. Over a period of 10 months, we collected slightly more than 7000 SMSs that we carefully annotated. This database is now ready to be used by the community. The purpose of this article is to relate the efforts put into designing this database and provide some data analysis of the main linguistic phenomenon that we have annotated. We also report on a socio-linguistic survey we conducted within the project.

Details

Paper ID
lrec2012-main-214
Pages
pp. 1047-1054
BibKey
langlais-etal-2012-texto4science
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
978-2-9517408-7-7
Conference
Eighth International Conference on Language Resources and Evaluation
Location
Istanbul, Turkey
Date
21 May 2012 27 May 2012

Authors

  • PL

    Philippe Langlais

  • PD

    Patrick Drouin

  • AP

    Amélie Paulus

  • EB

    Eugénie Rompré Brodeur

  • FC

    Florent Cottin

Links