LREC 2000 2nd International Conference on Language Resources & Evaluation
 

Previous Paper   Next Paper

Title Spoken Portuguese: Geographic and Social Varieties
Authors Bettencourt Gonçalves José (Centro de Linguística da Universidade de Lisboa, Av. 5 de Outubro, 85 - 6º, 1050-050 LISBOA, Portugal, jose.bettencourt@clul.ul.pt)
Veloso Rita (Centro de Linguística da Universidade de Lisboa, Av. 5 de Outubro, 85 - 6º, 1050-050 LISBOA, Portugal, rita.veloso@clul.ul.pt)
Keywords Language Teaching, Listening and Understanding, Portuguese Corpus, Portuguese Varieties, Spoken Portuguese
Session Session SP3 - Spoken Language Resources' Projects
Full Paper 71.ps, 71.pdf
Abstract The Spoken Portuguese: Geographic and Social Varieties project has as its main goal the Portuguese teaching as foreign language. The idea is to provide a collection of authentic spoken texts and to make it friendly usable. Therefore, a selection of spontaneous oral data was made, using either already compiled material or material recorded for this purpose. The final corpus constitution resulted in a representative sample that includes European, Brazilian and African Portuguese, as well as Macau and East-Timor Portuguese. In order to accomplish a functional product the Linguistics Center of Lisbon University developed a sound/text alignment software. The final result is a CD-ROM collection that contains 83 text files, 83 sound files and 83 files produced by the sound/text alignment tool. This independence between sound and text files allows the CD-ROM user to manipulate it for other purposes than the educational one.