HomeLREC 2026WorkshopsNSLPlrec2026-ws-nslp-24
Back to NSLP 2026
LREC 2026workshop

The Software Mention Detection and Coreference Resolution Shared Task 2026

Proceedings of Natural Scientific Language Processing (NSLP) @ LREC 2026

DOI:10.63317/3koortigjkfr

Abstract

Software is referenced in research papers in many different ways: full names, abbreviations, misspellings, versioned names, or indirect references via websites and citations. This makes it hard to link mentions to a single software entity, which in turn limits large-scale analyses and knowledge graph construction. The Software Mention Detection and Coreference Resolution (SOMD) shared task 2026, organized at the Natural Scientific Language Processing (NSLP) workshop at LREC 2026, focuses on clustering software mentions that refer to the same software entity. We provide three subtasks covering gold mentions, automatically extracted mentions, and mentions sampled at scale from large-scale publications. Systems are evaluated with established coreference metrics (MUC, B³, CEAFe) and their CoNLL average. This paper describes the task setup, datasets, evaluation, baseline, and the observed patterns in participant submissions, and outlines future directions for scalable software mention coreference resolution. The shared task was concluded with total five registered participants, with total 43 submissions for all subtasks. Finally, two system papers were submitted with competitive performance against baselines.

Details

Paper ID
lrec2026-ws-nslp-24
Pages
pp. 247-254
BibKey
upadhyaya-etal-2026-software
Editors
Georg Rehm, Stefan Dietze, Danilo Dessi, Diana Maynard, Sonja Schimmler
Publisher
European Language Resources Association (ELRA)
ISSN
N/A
ISBN
N/A
Workshop
Proceedings of Natural Scientific Language Processing (NSLP) @ LREC 2026
Location
Palma, Mallorca, Spain
Date
11 - 16 May 2026

Authors

  • SU

    Sharmila Upadhyaya

  • WO

    Wolfgang Otto

  • JM

    Julia Matela

  • FK

    Frank Krüger

  • SD

    Stefan Dietze

Links