Back to Main Conference 2004
LREC 2004main

A Search Tool for Corpora with Positional Tagsets and Ambiguities

Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC 2004)

DOI:10.63317/3xoo9dkau536

Abstract

This article describes POLIQARP, a corpus indexing and query tool, which understands positional tagsets and which does not assume that word forms are annotated with unique morphosyntactic tags. POLIQARP is designed to be applicable to a variety of languages and tagsets: it works with XML-encoded texts, uses the UTF-8 character set, and allows for an external specification of the tagset. Currently, POLIQARP is used for indexing and searching a morphosyntactically annotated corpus of Polish.

Details

Paper ID
lrec2004-main-145
Pages
N/A
BibKey
przepiorkowski-etal-2004-search
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
2-9517408-1-6
Conference
Fourth International Conference on Language Resources and Evaluation
Location
Lisbon, Portugal
Date
26 May 2004 28 May 2004

Authors

  • AP

    Adam Przepiórkowski

  • ZK

    Zygmunt Krynicki

  • ŁD

    Łukasz Dębowski

  • MW

    Marcin Woliński

  • DJ

    Daniel Janus

  • PB

    Piotr Bański

Links