Request Correction
Use this form to request corrections to the paper metadata. Select the fields that need correction and provide the correct information.
Correction Guidelines
- Click the edit button next to a field to report a correction.
- Fill in the suggested correction value for each field you want to correct.
- Provide your name and email so we can contact you if needed.
Paper Information
Automated Morphological Segmentation and Evaluation
Paper Fields
Click the edit button next to a field to report a correction.
Automated Morphological Segmentation and Evaluation
In this paper we introduce (i) a new method for morphological segmentation of German words and (ii) some measures related to the MDL principle for evaluation of morphological segmentations. Our segmentation method is based on general knowledge about inflection, derivation, and morphotactics, and part of speech information, all supplied by little effort. It includes the capabilities to generate allomorphs, to deal with hierarchical structure, and to retrieve morphemes not given in isolation in the input data. Manual evaluation of 1400 segmented types, counting omissions and false insertions of morpheme boundaries, gave 87 % recall and 98 % precision. In order to get automatic evaluation measures for morphological segmentations, we tested (i) vocabulary size and entropy measures (data size aspect of the MDL principle), (ii) model size represented as the number of states of reduced deterministic finite state automatons (DFSA) matching exactly the models' outputs, and (iii) a linear combination of (i) and (ii). These measures have been applied to segmentations of different qualities. As a result linear combination of vocabulary size and size of model-equivalent reduced DFSAs turned out to be an appropriate measure to rank segmentation models according to their quality.
Authors
Expand an author to correct their information. Use the remove button to request author removal, or add a new author.
PDF Attachment
You may attach a PDF as a corrected version of the paper. Max file size: 10MB. Only PDF files are accepted.
Your Information
Author Declaration *
Select at least one field to correct using the edit buttons above.