Back to MATHNLP 2024
LREC-COLING 2024workshop

An Approach to Co-reference Resolution and Formula Grounding for Mathematical Identifiers Using Large Language Models

Proceedings of the 2nd Workshop on Mathematical Natural Language Processing @ LREC-COLING 2024

DOI:10.63317/3pq3nw4vzona

Abstract

This paper outlines an automated approach to annotate mathematical identifiers in scientific papers — a process historically laborious and costly. We employ state-of-the-art LLMs, including GPT-3.5 and GPT-4, and open-source alternatives to generate a dictionary for annotating mathematical identifiers, linking each identifier to its conceivable descriptions and then assigning these definitions to the respective identifier in- stances based on context. Evaluation metrics include the CoNLL score for co-reference cluster quality and semantic correctness of the annotations.

Details

Paper ID
lrec2024-ws-mathnlp-1
Pages
pp. 1-10
BibKey
dev-etal-2024-approach
Editor
N/A
Publisher
European Language Resources Association (ELRA) and ICCL
ISSN
N/A
ISBN
N/A
Workshop
Proceedings of the 2nd Workshop on Mathematical Natural Language Processing @ LREC-COLING 2024
Location
undefined, undefined
Date
20 May 2024 25 May 2024

Authors

  • AD

    Aamin Dev

  • TA

    Takuto Asakura

  • RS

    Rune Sætre

Links