Back to Main Conference 2014
LREC 2014main

Rapid Deployment of Phrase Structure Parsing for Related Languages: A Case Study of Insular Scandinavian

Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC 2014)

DOI:10.63317/55gbszv9v5qd

Abstract

This paper presents ongoing work that aims to improve machine parsing of Faroese using a combination of Faroese and Icelandic training data. We show that even if we only have a relatively small parsed corpus of one language, namely 53,000 words of Faroese, we can obtain better results by adding information about phrase structure from a closely related language which has a similar syntax. Our experiment uses the Berkeley parser. We demonstrate that the addition of Icelandic data without any other modification to the experimental setup results in an f-measure improvement from 75.44% to 78.05% in Faroese and an improvement in part-of-speech tagging accuracy from 88.86% to 90.40%.

Details

Paper ID
lrec2014-main-661
Pages
pp. 91-95
BibKey
ingason-etal-2014-rapid
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
978-2-9517408-8-4
Conference
Ninth International Conference on Language Resources and Evaluation
Location
Reykjavik, Iceland
Date
26 May 2014 31 May 2014

Authors

  • AI

    Anton Karl Ingason

  • HL

    Hrafn Loftsson

  • ER

    Eiríkur Rögnvaldsson

  • ES

    Einar Freyr Sigurðsson

  • JW

    Joel C. Wallenberg

Links