Back to Main Conference 2000
LREC 2000main
A Robust Parser for Unrestricted Greek Text
Proceedings of the Second International Conference on Language Resources and Evaluation (LREC 2000)
Abstract
In this paper we describe a method for the efficient parsing of real-life Greek texts at the surface syntactic level. A grammar consisting of non-recursive regular expressions describing Greek phrase structure has been compiled into a cascade of finite state transducers used to recognize syntactic constituents. The implemented parser lends itself to applications where large scale text processing is involved, and fast, robust, and relatively accurate syntactic analysis is necessary. The parser has been evaluated against a ca 34000 word corpus of financial and news texts and achieved promising precision and recall scores.