Summary of the paper

Title Introducing the Reference Corpus of Contemporary Portuguese Online
Authors Michel Généreux, Iris Hendrickx and Amália Mendes
Abstract We present our work in processing the Reference Corpus of Contemporary Portuguese and its publication online. After discussing how the corpus was built and our choice of meta-data, we turn to the processes and tools involved for the cleaning, preparation and annotation to make the corpus suitable for linguistic inquiries. The Web platform is described, and we show examples of linguistic resources that can be extracted from the platform for use in linguistic studies or in NLP.
Topics Corpus (creation, annotation, etc.), Metadata, Web Services
Full paper Introducing the Reference Corpus of Contemporary Portuguese Online
Bibtex @InProceedings{GNREUX12.309,
  author = {Michel Généreux and Iris Hendrickx and Amália Mendes},
  title = {Introducing the Reference Corpus of Contemporary Portuguese Online},
  booktitle = {Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC'12)},
  year = {2012},
  month = {may},
  date = {23-25},
  address = {Istanbul, Turkey},
  editor = {Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Thierry Declerck and Mehmet Uğur Doğan and Bente Maegaard and Joseph Mariani and Asuncion Moreno and Jan Odijk and Stelios Piperidis},
  publisher = {European Language Resources Association (ELRA)},
  isbn = {978-2-9517408-7-7},
  language = {english}
 }
Powered by ELDA © 2012 ELDA/ELRA