HomeLREC 2026WorkshopsNSLPlrec2026-ws-nslp-27
Back to NSLP 2026
LREC 2026workshop

Transformer Encoders with Heuristic-Guided Contrastive Learning for Software Coreference Resolution

Proceedings of Natural Scientific Language Processing (NSLP) @ LREC 2026

DOI:10.63317/4zfmki62nojg

Abstract

This paper describes our system submitted to the Software Mention Detection and Coreference Resolution (SOMD) 2026 shared task, specifically for Subtask 1 (cross-document coreference resolution over gold-standard mentions) and Subtask 2 (cross-document coreference resolution over predicted mentions). The proposed approach employs a SciBERT architecture trained with Supervised Contrastive (SupCon) loss to generate dense mention representations, which are then clustered using Hierarchical Agglomerative Clustering (HAC) with average linkage. Software-aware heuristics are integrated to exploit domain-specific signals such as software name canonicalization and developer disambiguation to adjust pairwise similarity scores before clustering. The system achieved strong performance, with a CoNLL F1 score of 92.18% on coreference resolution over gold-standard mentions and 91.87% on coreference resolution over predicted mentions, showing significant performance of our approach in this area for human annotated and automated systems respectively

Details

Paper ID
lrec2026-ws-nslp-27
Pages
pp. 270-276
BibKey
hassan-etal-2026-transformer
Editors
Georg Rehm, Stefan Dietze, Danilo Dessi, Diana Maynard, Sonja Schimmler
Publisher
European Language Resources Association (ELRA)
ISSN
N/A
ISBN
N/A
Workshop
Proceedings of Natural Scientific Language Processing (NSLP) @ LREC 2026
Location
Palma, Mallorca, Spain
Date
11 - 16 May 2026

Authors

  • MH

    Mahmoud Hassan

  • DY

    Dipendra Yadav

Links