Back to Main Conference 2026
LREC 2026main

Building Multimodal Corpora Using Microtask Pipelines and Local Annotators

Proceedings of the Fifteenth Language Resources and Evaluation Conference (LREC 2026)

DOI:10.63317/434uxg6yj2aj

Abstract

Multimodality, or how human communication and interaction combine multiple forms of expression, is studied across diverse fields of research. Many of these fields have underlined the need for large, richly annotated multimodal corpora to support empirical research. While language resources are increasingly annotated using microtask crowdsourcing, multimodal corpora remain largely reliant on expert annotators, which creates a bottleneck for scalability and broad applicability. This paper presents a novel hybrid approach to multimodal corpus annotation, leveraging the efficiency of microtask pipelines while preserving theoretical rigour. Our approach decomposes the annotation process into sequences of simple, well-instructed tasks, which are then performed by locally recruited non-expert annotators. We demonstrate the feasibility of this approach by presenting a pipeline for annotating the multimodal structure of school textbooks.

Details

Paper ID
lrec2026-main-514
Pages
pp. 6483-6495
BibKey
hotti-etal-2026-building
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
978-2-493814-49-4
Conference
The Fifteenth Language Resources and Evaluation Conference (LREC 2026)
Location
Palma, Mallorca, Spain
Date
11 May 2026 16 May 2026

Authors

  • HH

    Helmiina Hotti

  • RV

    Raul Vazquez

  • AJ

    Anna-Kaisa Jokipohja

  • TK

    Timo Kalliokoski

  • HP

    Henna Paakki

  • RS

    Rosa Suviranta

  • TH

    Tuomo Hiippala

Links