HomeLREC 2020WorkshopsLDLlrec2020-ws-ldl-06
Back to LDL 2020
LREC 2020workshop

Annohub – Annotation Metadata for Linked Data Applications

Proceedings of the 7th Workshop on Linked Data in Linguistics (LDL-2020)

DOI:10.63317/4ncwu8vyo27h

Abstract

We introduce a new dataset for the Linguistic Linked Open Data (LLOD) cloud that will provide metadata about annotation and language information harvested from annotated language resources like corpora freely available on the internet. To our knowledge annotation metadata is not provided by any metadata provider, e.g. linghub, datahub or CLARIN so far. On the other hand, language metadata that is found on such portals is rarely provided in machine-readable form, especially as Linked Data. In this paper, we describe the harvesting process, content and structure of the new dataset and its application in the Lin|gu|is|tik portal, a research platform for linguists. Aside from that, we introduce tools for the conversion of XML encoded language resources to the CoNLL format. The generated RDF data as well as the XML-converter application are made public under an open license.

Details

Paper ID
lrec2020-ws-ldl-06
Pages
pp. 36-44
BibKey
abromeit-etal-2020-annohub
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
N/A
ISBN
N/A
Workshop
Proceedings of the 7th Workshop on Linked Data in Linguistics (LDL-2020)
Location
undefined, undefined
Date
11 May 2020 16 May 2020

Authors

  • FA

    Frank Abromeit

  • CF

    Christian Fäth

  • LG

    Luis Glaser

Links