Back to Main Conference 2026
LREC 2026main

CommonMorph: Participatory Morphological Documentation Platform

Proceedings of the Fifteenth Language Resources and Evaluation Conference (LREC 2026)

DOI:10.63317/5gqigwzjjv4b

Abstract

Collecting and annotating morphological data present significant challenges, requiring linguistic expertise, methodological rigour, and substantial resources. These barriers are particularly acute for low-resource languages and varieties. To accelerate this process, we introduce CommonMorph, a comprehensive platform that streamlines morphological data collection development through a three-tiered approach: expert linguistic definition, contributor elicitation, and community validation. The platform minimises manual work by incorporating active learning, annotation suggestions, and tools to import and adapt materials from related languages. It accommodates diverse morphological systems, including fusional, agglutinative, and root-and-pattern morphologies. Its open-source design and UniMorph-compatible outputs ensure accessibility and interoperability with NLP tools. Our platform is accessible at https://common-morph.com, offering a replicable model for preserving linguistic diversity through collaborative technology.

Details

Paper ID
lrec2026-main-919
Pages
pp. 11735-11746
BibKey
mahmudi-etal-2026-commonmorph
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
978-2-493814-49-4
Conference
The Fifteenth Language Resources and Evaluation Conference (LREC 2026)
Location
Palma, Mallorca, Spain
Date
11 May 2026 16 May 2026

Authors

  • AM

    Aso Mahmudi

  • SA

    Sina Ahmadi

  • KK

    Kemal Maulana Kurniawan

  • RS

    Rico Sennrich

  • EH

    Eduard H. Hovy

  • EV

    Ekaterina Vylomova

Links