Back to Main Conference 2014
LREC 2014main

Information Extraction from German Patient Records via Hybrid Parsing and Relation Extraction Strategies

Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC 2014)

DOI:10.63317/25r7bfekrhxw

Abstract

In this paper, we report on first attempts and findings to analyzing German patient records, using a hybrid parsing architecture and a combination of two relation extraction strategies. On a practical level, we are interested in the extraction of concepts and relations among those concepts, a necessary cornerstone for building medical information systems. The parsing pipeline consists of a morphological analyzer, a robust chunk parser adapted to Latin phrases used in medical diagnosis, a repair rule stage, and a probabilistic context-free parser that respects the output from the chunker. The relation extraction stage is a combination of two systems: SProUT, a shallow processor which uses hand-written rules to discover relation instances from local text units and DARE which extracts relation instances from complete sentences, using rules that are learned in a bootstrapping process, starting with semantic seeds. Two small experiments have been carried out for the parsing pipeline and the relation extraction stage.

Details

Paper ID
lrec2014-main-197
Pages
pp. 2043-2048
BibKey
krieger-etal-2014-information
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
978-2-9517408-8-4
Conference
Ninth International Conference on Language Resources and Evaluation
Location
Reykjavik, Iceland
Date
26 May 2014 31 May 2014

Authors

  • HK

    Hans-Ulrich Krieger

  • CS

    Christian Spurk

  • HU

    Hans Uszkoreit

  • FX

    Feiyu Xu

  • YZ

    Yi Zhang

  • FM

    Frank Müller

  • TT

    Thomas Tolxdorff

Links