Back to Main Conference 2004
LREC 2004main

Creating Slovenian Language Resources for Development of Speech-to-speech Translation Components

Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC 2004)

DOI:10.63317/5jzk6skgvgvw

Abstract

Article brings detailed information about procedures of building Slovenian lexica within the LC-STAR project, and also detailed information about the size of that lexica. University of Maribor joined the LC-STAR project in order to provide appropriate language resources for developing speech-to-speech translation technology for Slovenian language. Lexica exists from three parts: 65.000 common words, 45.000 proper names and 6.000 special application domain words. All lexica will be morpho-syntactically tagged and phonetically transcribed. Quality of produced language resources is ensured by independent validation.

Details

Paper ID
lrec2004-main-045
Pages
N/A
BibKey
verdonik-etal-2004-creating
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
2-9517408-1-6
Conference
Fourth International Conference on Language Resources and Evaluation
Location
Lisbon, Portugal
Date
26 May 2004 28 May 2004

Authors

  • DV

    Darinka Verdonik

  • MR

    Matej Rojc

  • ZK

    Zdravko Kačič

Links