TLT-CRF: A Lexicon-supported Morphological Tagger for Latin Based on Conditional Random Fields

Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016)

Abstract

We present a morphological tagger for Latin, called TTLab Latin Tagger based on Conditional Random Fields (TLT-CRF) which uses a large Latin lexicon. Beyond Part of Speech (PoS), TLT-CRF tags eight inflectional categories of verbs, adjectives or nouns. It utilizes a statistical model based on CRFs together with a rule interpreter that addresses scenarios of sparse training data. We present results of evaluating TLT-CRF to answer the question what can be learnt following the paradigm of 1st order CRFs in conjunction with a large lexical resource and a rule interpreter. Furthermore, we investigate the contigency of representational features and targeted parts of speech to learn about selective features.

Resources

Details

Paper ID

lrec2016-main-240

Pages

pp. 1514-1519

DOI

10.63317/2a7arzf2kows

BibKey

vor-der-bruck-mehler-2016-tlt

Editors

Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Sara Goggi, Marko Grobelnik, Bente Maegaard, Joseph Mariani, Hélène Mazo, Asunción Moreno, Jan Odijk, Stelios Piperidis

Publisher

European Language Resources Association (ELRA)

ISSN

2522-2686

ISBN

978-2-9517408-9-1

Conference

Tenth International Conference on Language Resources and Evaluation

Location

Portorož, Slovenia

Date

23 - 28 May 2016

Authors

Tv
Tim vor der Brück
AM
Alexander Mehler

Links

URL

DOI