Back to Main Conference 2014
LREC 2014main
Zmorge: A German Morphological Lexicon Extracted from Wiktionary
Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC 2014)
Abstract
We describe a method to automatically extract a German lexicon from Wiktionary that is compatible with the finite-state morphological grammar SMOR. The main advantage of the resulting lexicon over existing lexica for SMOR is that it is open and permissively licensed. A recall-oriented evaluation shows that a morphological analyser built with our lexicon has comparable coverage compared to existing lexica, and continues to improve as Wiktionary grows. We also describe modifications to the SMOR grammar that result in a more conventional lemmatisation of words.