Back to Main Conference 2000
LREC 2000main

A Unified POS Tagging Architecture and its Application to Greek

Proceedings of the Second International Conference on Language Resources and Evaluation (LREC 2000)

DOI:10.63317/35m5snmp58vy

Abstract

This paper proposes a flexible and unified tagging architecture that could be incorporated into a number of applications like information extraction, cross-language information retrieval, term extraction, or summarization, while providing an essential component for subsequent syntactic processing or lexicographical work. A feature-based multi-tiered approach (FBT tagger) is introduced to part-of-speech tagging. FBT is a variant of the well-known transformation based learning paradigm aiming at improving the quality of tagging highly inflective languages such as Greek. Additionally, a large experiment concerning the Greek language is conducted and results are presented for a variety of text genres, including financial reports, newswires, press releases and technical manuals. Finally, the adopted evaluation methodology is discussed.

Details

Paper ID
lrec2000-main-135
Pages
N/A
BibKey
papageorgiou-etal-2000-unified
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
N/A
Conference
Second International Conference on Language Resources and Evaluation
Location
Athens, Greece
Date
31 May 2000 2 June 2000

Authors

  • HP

    Harris Papageorgiou

  • PP

    Prokopis Prokopidis

  • VG

    Voula Giouli

  • SP

    Stelios Piperidis

Links