Back to Main Conference 2008
LREC 2008main

Using Parsed Corpora for Estimating Stochastic Inversion Transduction Grammars

Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC 2008)

DOI:10.63317/25hhw4ac87kw

Abstract

An important problem when using Stochastic Inversion Transduction Grammars is their computational cost. More specifically, when dealing with corpora such as Europarl. only one iteration of the estimation algorithm becomes prohibitive. In this work, we apply a reduction of the cost by taking profit of the bracketing information in parsed corpora and show machine translation results obtained with a bracketed Europarl corpus, yielding interresting improvements when increasing the number of non-terminal symbols.

Details

Paper ID
lrec2008-main-113
Pages
N/A
BibKey
sanchis-sanchez-2008-using
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
2-9517408-4-0
Conference
Sixth International Conference on Language Resources and Evaluation
Location
Marrakech, Morocco
Date
28 May 2008 30 May 2008

Authors

  • GS

    Germán Sanchis

  • JS

    Joan Andreu Sánchez

Links