Back to Main Conference 2004
LREC 2004main

Securing Interpretability: The Case of Ega Language Documentation

Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC 2004)

DOI:10.63317/2anwtx949d29

Abstract

The prime consideration in designing sustainable language resources is to ensure that they remain interpretable for coming generations of users. In this paper we adopt a new perspective on resource creation - securing the interpretability of data, using a case study of Ega, an endangered African language for which a small amount of legacy data is available. Basic ste ps to securing interpretability are to transfer files to durable media, and where possible, to convert all legacy data into XML files with Unicode character encodings. In the absence of agreed `best practice' standards, we propose a methodology of `better practice' to assist in the transition process towards this goal. We discuss a number of issues involved in securing interpretability of the lexicon, character encodings, interlinear glossed text, annotated recordings and nomenclature in linguistic descriptions, and describe our solutions.

Details

Paper ID
lrec2004-main-098
Pages
N/A
BibKey
gibbon-etal-2004-securing
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
2-9517408-1-6
Conference
Fourth International Conference on Language Resources and Evaluation
Location
Lisbon, Portugal
Date
26 May 2004 28 May 2004

Authors

  • DG

    Dafydd Gibbon

  • CB

    Catherine Bow

  • SB

    Steven Bird

  • BH

    Baden Hughes

Links