HomeLREC 2026WorkshopsRESOURCEFULlrec2026-ws-resourceful-07
Back to RESOURCEFUL 2026
LREC 2026workshop

LLMs as Assistants for Data Annotation: Addressing Disagreement and Supporting Expert Processes

The Fourth Workshop on Resources and Representations for Under-Resourced Languages and Domains (RESOURCEFUL 2026)

DOI:10.63317/4gq9fnt5umo3

Abstract

This paper investigates the potential of Large Language Models to assist human annotation pipelines, with a particular focus on supporting the development of expert-informed annotation guidelines for document-level content categorisation. We present three experiments exploring distinct roles for LLMs in annotation: as annotators, as domain experts assisting in disagreement resolution, and as analysts of annotator discussions. Using GPT-4.5 and Claude Sonnet 4, we evaluate LLM-generated annotation guidelines for a document-level classification tasks in terms of coverage, applicability, and usefulness. Preliminary results are mixed-to-positive, with evidence that LLMs can provide useful support across different stages of the annotation pipeline, particularly when supplied with rich contextual information such as prior human annotations and annotator discussions. However, their effectiveness remains sensitive to prompting strategies and input configuration.

Details

Paper ID
lrec2026-ws-resourceful-07
Pages
pp. 62-72
BibKey
andrade-etal-2026-llms
Editors
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
N/A
ISBN
N/A
Workshop
The Fourth Workshop on Resources and Representations for Under-Resourced Languages and Domains (RESOURCEFUL 2026)
Location
Palma, Mallorca, Spain
Date
11 - 16 May 2026

Authors

  • MA

    Mark Andrade

  • BH

    Bláithín Heffernan

  • AW

    Abigail Walsh

  • SC

    Sheila Castilho

Links